I have developed a new codec for my signal processing learning.

Topic: I have developed a new codec for my signal processing learning. (Read 563 times) previous topic - next topic

0 Members and 1 Guest are viewing this topic.

I have developed a new codec for my signal processing learning.

2024-03-10 21:26:20

I have developed a new codec for my signal processing learning.

It can handle quality values from 0: 80kbps-110kbps to quality value 10: 300kbps-350kbps, and the values can be specified as a decimal point or an integer.

I can assure you that the quality is not as good as the current codecs, but it was very well referenced for my learning.

The code is written in Python.

Below I paste a sample.

I hope you find it of interest.

MOD edit: Please keep music clips under 30 seconds

Re: I have developed a new codec for my signal processing learning.

Reply #1 – 2024-03-10 21:33:09

xxx.y.flac
where y is the quality value.
A bit rate of 0 is 107 kbps
A bit rate of 2 is 132 kbps
A bit rate of 5 is 183 kbps
is the quality value.

Re: I have developed a new codec for my signal processing learning.

Reply #2 – 2024-03-10 23:07:07

Can you give us original flac so we can compare?

Re: I have developed a new codec for my signal processing learning.

Reply #3 – 2024-03-10 23:13:52

Okey

Re: I have developed a new codec for my signal processing learning.

Reply #4 – 2024-03-10 23:17:26

This is the sound source for comparison.
And this is my learning codec, so don't get your hopes up.
However, I can receive and teach questions about the technical details of the codecs.

Re: I have developed a new codec for my signal processing learning.

Reply #5 – 2024-03-10 23:35:31

The main algorithms are as follows
This codec basically handles M/S processing; all L/R is converted to the M/S domain.
Then, a 3072-point MDCT is performed next, which is divided into 32 bands before quantization is performed in the mode appropriate for each band.
Four modes of quantization exist.
The first mode, normal quantization, is a type of quantization that adds strain. The allowable strain is adaptively limited according to the volume of the original band, and the resolution is quantized accordingly.
The second mode, logarithmic quantization, is a mode of quantization that is performed after conversion to logarithmic. It converts to log/db and controls the width of quantization to match the band and volume. This mode also allows downsampling in the frequency domain. This process is based on the intuition that waveforms with little vertical width and noise are less likely to be affected by downsampling to some extent.
The third mode, prediction during decoding, this mode is mainly used for higher frequencies. This mode is used in bands where overtone structure is not so important, such as in the upper registers.
The fourth mode, no data, is used when the data is deemed unnecessary to human hearing; when either the M/S data is masked at a certain volume level, the mode changes to no data or prediction.
These are combined and compressed.

Notice