HydrogenAudio

Lossy Audio Compression => Ogg Vorbis => Ogg Vorbis - Tech => Topic started by: kumarsvinod20 on 2013-01-25 09:34:48

Title: documentation for vorbis encoder
Post by: kumarsvinod20 on 2013-01-25 09:34:48
Hi all,
I am new to this forum and currently working on developing vorbis encoder, downloaded the libvorbis-aotuv_b6.03 as i have many doubts.  i would like to know where i can get related documents for vorbis compression algorithm.

Thanks in advance,
Regards
Vinod
Title: documentation for vorbis encoder
Post by: ktf on 2013-01-25 10:10:41
Have you tried this one?

http://xiph.org/vorbis/doc/ (http://xiph.org/vorbis/doc/)
Title: documentation for vorbis encoder
Post by: kumarsvinod20 on 2013-01-25 12:10:28
Have you tried this one?

http://xiph.org/vorbis/doc/ (http://xiph.org/vorbis/doc/)


hi thanks for your reply,
yes i tried that, but i found docs related only to specifications and the compression algorithm is not explined much. Basically i am searching for some documenation which explains the whole compression algorithm (used in refernce encoder) in detail for eg: psychoacoustic model, channeel coupling and encoding of residue etc...

Thanks in advance
Vinod
Title: documentation for vorbis encoder
Post by: Dynamic on 2013-01-28 02:45:50
Why not read the Wikipedia Vorbis page's Technical (http://en.wikipedia.org/wiki/Vorbis#Technical_details) section and follow the references to technical web pages and scholarly papers describing things further. These give a lot more detail. There's also some information in Xiph's CELT and Opus blogs regarding certain techniques (e.g. Constrained Band Energy, Band Folding, Pyramidal Vector Quantization etc.).

Actually, most of that's in the Specification (http://www.xiph.org/vorbis/doc/Vorbis_I_spec.html). If you want to know about Psychoacoustic Models, the general principles are explained in numerous places with reference to other codecs, and most of these techniques are applicable to the tools available in Vorbis. You might also want to look into AoTuV source code versus libvorbis to spot differences in tuning. I think the libvorbis source code is probably pretty good documentation, then search Wikipedia, Google Scholar, and finally Google generally, to understand terms you don't recognise.
Title: documentation for vorbis encoder
Post by: kumarsvinod20 on 2013-01-28 06:28:39
Why not read the Wikipedia Vorbis page's Technical (http://en.wikipedia.org/wiki/Vorbis#Technical_details) section and follow the references to technical web pages and scholarly papers describing things further. These give a lot more detail. There's also some information in Xiph's CELT and Opus blogs regarding certain techniques (e.g. Constrained Band Energy, Band Folding, Pyramidal Vector Quantization etc.).

Actually, most of that's in the Specification (http://www.xiph.org/vorbis/doc/Vorbis_I_spec.html). If you want to know about Psychoacoustic Models, the general principles are explained in numerous places with reference to other codecs, and most of these techniques are applicable to the tools available in Vorbis. You might also want to look into AoTuV source code versus libvorbis to spot differences in tuning. I think the libvorbis source code is probably pretty good documentation, then search Wikipedia, Google Scholar, and finally Google generally, to understand terms you don't recognise.


Hi Thank you very much for your suggestions..

Regards
Vinod
Title: documentation for vorbis encoder
Post by: testyou on 2013-01-28 07:12:20
downloaded the libvorbis-aotuv_b6.03 as i have many doubts

What do you mean?
Title: documentation for vorbis encoder
Post by: kumarsvinod20 on 2013-01-28 08:50:20
downloaded the libvorbis-aotuv_b6.03 as i have many doubts

What do you mean?

I mean , I need some documents for better understanding of the complete compression algorithm used in this code.
Title: documentation for vorbis encoder
Post by: Dynamic on 2013-01-28 16:30:56
Hi Vinod,

I was just checking something out about how SILK works, and was reminded that another good resource is the PDF version (http://gstreamer.freedesktop.org/data/events/gstreamer-conference/2012/opus.pdf) of a LibreOffice or OpenOffice Impress presentation which is probably the one used by Jean-Marc Valin in his LinuxConf.au talk "Opus - The Swiss Army Knife Of Audio Codecs". (That PDF was for a gstreamer-related presentation)

This contains links to relevant Wikipedia articles and illustrative graphics. You can also find the video under Presentations (http://opus-codec.org/presentations/) on the opus-codec.org website. (The specific presentation materials for that talk are linked in that page and are very similar to the gstreamer link I gave above)

You'll find a good bit of detail including the LP filters coefficients that mirror the vocal tract, the way they're converted to Line Spectral Frequencies before quantization, and the combination of fixed codebooks and scalar and vector quantization, all with helpful diagrams and a fair number of equations and wikipedia links to further info, plus references that you can follow up.
Title: documentation for vorbis encoder
Post by: Dynamic on 2013-01-28 18:06:05

OOPS!


I just realised I posted links to Opus documentation and you want Vorbis. Some of it is relevant (e.g. Pyramid Vector Quantization) but a lot of it is different and reflects the lessons learned in Vorbis development (e.g. Opus/CELT explicitly encodes the band energy, Vorbis indirectly reflects this by the noise curve) and specifics relating to speech coding and low latency that Vorbis doesn't have to worry about.
Title: documentation for vorbis encoder
Post by: db1989 on 2013-01-28 18:15:37
downloaded the libvorbis-aotuv_b6.03 as i have many doubts
What do you mean?

I’m not aware of the linguistic basis, or how lessons (or is it online translators?) don’t seem to correct it very often, but many non-native English speakers use the word doubt when the concept they’re actually thinking of is question.
Title: documentation for vorbis encoder
Post by: kumarsvinod20 on 2013-01-29 04:25:04
Hi Vinod,

I was just checking something out about how SILK works, and was reminded that another good resource is the PDF version (http://gstreamer.freedesktop.org/data/events/gstreamer-conference/2012/opus.pdf) of a LibreOffice or OpenOffice Impress presentation which is probably the one used by Jean-Marc Valin in his LinuxConf.au talk "Opus - The Swiss Army Knife Of Audio Codecs". (That PDF was for a gstreamer-related presentation)

This contains links to relevant Wikipedia articles and illustrative graphics. You can also find the video under Presentations (http://opus-codec.org/presentations/) on the opus-codec.org website. (The specific presentation materials for that talk are linked in that page and are very similar to the gstreamer link I gave above)

You'll find a good bit of detail including the LP filters coefficients that mirror the vocal tract, the way they're converted to Line Spectral Frequencies before quantization, and the combination of fixed codebooks and scalar and vector quantization, all with helpful diagrams and a fair number of equations and wikipedia links to further info, plus references that you can follow up.


Hi, Thank you very much for the information provided above.

Regards
Vinod