Vorbis Technology Demonstration 2002-07-28 18:36:20 I had a little fun with the Vorbis 1.0 libs, hacked a bit with it, and made some demonstration oggs of what it is capable of at the really low end.Have a listen to:http://sjeng.org/ftp/vorbis/scar.ogg (6.4kbps)http://sjeng.org/ftp/vorbis/queen.ogg (5.3kbps)http://sjeng.org/ftp/vorbis/ss.ogg (6.3kbps)http://sjeng.org/ftp/vorbis/waitress.ogg (6.7kbps)Edit: alsohttp://sjeng.org/ftp/vorbis/daan.ogg (6.6kbps)And keep in mind this is 1/20'th of an 128kbps file.-- GCP
Vorbis Technology Demonstration Reply #1 – 2002-07-28 19:22:24 :jawdrop:Holy f'n $#!+!!!! That's incredible!Don't get me wrong, they sound like ass. They sound remarkably good for 8kb/s though.How much encoder tuning have you done, and is more possible?
Vorbis Technology Demonstration Reply #2 – 2002-07-28 19:28:08 QuoteOriginally posted by Ardax They sound remarkably good for 8kb/s though.How much encoder tuning have you done, and is more possible?6kbps, not 8kbps There is no real tuning in here; I spent about an hour re-familiarizing myself with the 1.0 infrastructure and made the new encoding mode on guesswork. Getting everything to work took another two hours. I need to talk to monty to get a few things cleared up before more can be done.I would like to put a CD on a floppy. If you have 2.8M floppies, you can do so already.-- GCP
Vorbis Technology Demonstration Reply #3 – 2002-07-28 19:34:01 and make Vorbis Floppy-Transparent to 99% users over 99% samples:"OGG Vorbis is the only format to give You floppy-quality at ~6kbps"
Vorbis Technology Demonstration Reply #4 – 2002-07-28 20:03:11 you can imagine what can be possible for 64 kbps :eek:
Vorbis Technology Demonstration Reply #5 – 2002-07-28 20:05:08 Impressive Garf indeed! I think most of the pure tones in Scar sound particularly very impressive at 8 kHz telephony fidelity. Have you tested this out with any sort of speech signal yet? You didn't happen to by any chance use floor 0 did you for testing purposes?
Vorbis Technology Demonstration Reply #6 – 2002-07-28 20:09:11 Setup an ogg stream (q0 40~45kb/s 22khz )rexit2.ath.cx:8000/q0.ogg (seems to be a GREAT alternative to mp3 streaming @ lower bitrates)
Vorbis Technology Demonstration Reply #7 – 2002-07-28 20:27:49 QuoteOriginally posted by HotshotGG Impressive Garf indeed! I think most of the pure tones in Scar sound particularly very impressive at 8 kHz telephony fidelity. Have you tested this out with any sort of speech signal yet? You didn't happen to by any chance use floor 0 did you for testing purposes?I've tested it with speech against Speex 5.1.The size comes around 5-6kbps, which is lower than Speex quality 1 (goes from 1-10) and higher than Speex quality 0. Speex quality 0 artifacts quite badly and I wouldn't know if this mode or Speex is better. At quality 1, Speex wins hands down. The problem of this mode is that it doesn't window-switch, which (I think) is needed to prevent sibilants from echoing. Except for the echo's, the mode sounds quite reasonable on voice.Based on this result, I think you could make a Vorbis 1.0 encoder that handles voice at least as well as the current Speex. Interesting result, considering Speex was designed for voice and Vorbis wasn't. You'd want to have someone with more knowledge about voice coding than me working on it though I would have liked to try floor0 with this, as it _possibly_ could be a little more efficient, but the 1.0 code doesn't have any other mode that uses it! Without an example or prior codebooks to work from it is impossible for me to make one. I already had enough problems with the floor1 codebooks now. -- GCP
Vorbis Technology Demonstration Reply #8 – 2002-07-28 20:41:09 QuoteWithout an example or prior codebooks to work from it is impossible for me to make one. I already had enough problems with the floor1 codebooks now.yes, in deed I had been looking over the floor 1 backend implementation myself and I still need to look over it a few more times to figure it out. I wouldn't argue with you about the codebooks as they look they are a major task to figure out. I had wanted to point that out with Floor 0, because floor 0 as you know uses a IIR LSP representation. This might be especially useful in speech coding. It seems as though floor 0 is deprecated for the time being though like you said. I had been looking through the documentation for it though anyway.
Vorbis Technology Demonstration Reply #9 – 2002-07-28 21:20:36 QuoteOriginally posted by HotshotGG I had wanted to point that out with Floor 0, because floor 0 as you know uses a IIR LSP representation. This might be especially useful in speech coding. The usage is so different that success of LSP/LPC in speech coding has no bearing on Vorbis. I'd prefer floor0 because it may be more efficient for uncoupled modes (like mono) than floor1, even when not encoding speech.-- GCP
Vorbis Technology Demonstration Reply #10 – 2002-07-28 21:38:24 QuoteThe usage is so different that success of LSP/LPC in speech coding has no bearing on Vorbis. I'd prefer floor0 because it may be more efficient for uncoupled modes (like mono) than floor1, even when not encoding speech.I see where you coming from. It might be useful from an experimental point. I had figured that LSP/LPC representation to have no bearing for speech coding in Vorbis. I had just been quite confused when it had been implemented and I was reading it was popular in speech coding. Now I understand everyone has there own flavors of it.
Vorbis Technology Demonstration Reply #11 – 2002-07-28 22:31:21 Uhm...could you give us the exact command line for these ultra-low settings? I'd like to do previews of entire albums for my band at cdstreet.com. We have 10 MB of space of samples of the CDs and with that bitrate (which sound damn good) I can fit a ton of stuff in there...so, what q- setting is this? -1? -2? -3?
Vorbis Technology Demonstration Reply #12 – 2002-07-28 22:40:08 OMG! LOOK HOW SMALL AND CUTE THEY ARE! 6KBPS!!! WOW! THEY ARE LIKE LITTLE BABIES!
Vorbis Technology Demonstration Reply #13 – 2002-07-28 22:47:28 QuoteOriginally posted by Zaraza Uhm...could you give us the exact command line for these ultra-low settings? I'd like to do previews of entire albums for my band at cdstreet.com. We have 10 MB of space of samples of the CDs and with that bitrate (which sound damn good) I can fit a ton of stuff in there...so, what q- setting is this???? -1? -2? -3? This is a custom version of the encoder libs. If I can get monty to help sort out a few remaining issues, I'll make it available.-- GCP
Vorbis Technology Demonstration Reply #14 – 2002-07-28 23:04:55 QuoteOriginally posted by Zaraza We have 10 MB of space of samples of the CDs and with that bitrate (which sound damn good) I can fit a ton of stuff in there...oggenc -q -1 --downmix --resample 8000 with the normal Vorbis 1.0 oggenc will already fit an entire CD in 10M-- GCP
Vorbis Technology Demonstration Reply #15 – 2002-07-29 00:00:07 QuoteIf I can get monty to help sort out a few remaining issuesWhat would you need sorted out might I ask?
Vorbis Technology Demonstration Reply #16 – 2002-07-29 00:06:53 QuoteOriginally posted by HotshotGG What would you need sorted out might I ask?Why I get massively differing performance from one codebook training run to another.Why some samples completely fall apart (as in: not remotely comparable to original) even though they're not particularly hard.Why the codebook trainers sometimes don't produce all the residue training files I'd expect them to.What the correct parameters for floor training are (I guessed them).The codebook training part of Vorbis is one of the least intiutive (at least to me) and it's almost undocumented (the spec doesn't help either)-- GCP
Vorbis Technology Demonstration Reply #17 – 2002-07-29 00:18:06 QuoteThe codebook training part of Vorbis is one of the least intiutive (at least to me) and it's almost undocumented (the spec doesn't help either)yes, when I was looking over the source code myself I didn't even make an effort to look over the codebooks as well I had the same basic idea in my mind. What codebooks are you using just the regular scalar huffman codebooks?
Vorbis Technology Demonstration Reply #18 – 2002-07-29 00:49:41 QuoteOriginally posted by HotshotGG What codebooks are you using just the regular scalar huffman codebooks?IIRC, floor codebooks are basic huffman and residue is lattice VQ-- GCP
Vorbis Technology Demonstration Reply #19 – 2002-07-29 02:58:40 QuoteIIRC, floor codebooks are basic huffman and residue is lattice VQThank you for bringing that to my attention again. I was aware of the residue using lattice VQ codebooks, however I never really thought that the floor codebooks used huffman coding. I see the puzzle fits together now.
Vorbis Technology Demonstration Reply #20 – 2002-07-29 05:13:33 I was fooling around with this extreme low bitrate stuff last week. Mono, -q -1, 8KHz resampling, etc. I encoded George Winston's December CD, which is all solo piano. Even at 8kbps, it didn't sound half bad. I guess you could use this to stream music over a 28.8kbps modem connection...but who has those anymore?<- 1.5Mbps broadband pipe
Vorbis Technology Demonstration Reply #21 – 2002-07-29 05:16:41 Quotewhich is all solo pianoyes, strong tones sound particularly good it's unfortunate there is no window switching like Garf mentioned above to eliminate any echo's etc. Quote<- 1.5Mbps broadband pipeMy dad is heavenly into Broadband and from a growing response in that area I think we are moving in just the opposite direction. I can't even get my friend back on his 56.6 V. 90 dial-up. AT&T is ripping everyone off in this city because there is no competition, but that's another story that is more or less important for a discussion on this forum because physcoacoustics audio compression is more important.]
Vorbis Technology Demonstration Reply #22 – 2002-08-12 17:04:35 I've shaved off another 40-50%, so they're around 3.8kbps now.http://sjeng.org/ftp/vorbis/scarres2.ogg (4.0kbps)http://sjeng.org/ftp/vorbis/ssres2.ogg (4.3kbps)http://sjeng.org/ftp/vorbis/qnres2.ogg (3.2kps)Not much worse than the previous ones due to better tuning & some hints from Klemm.-- GCP
Vorbis Technology Demonstration Reply #23 – 2002-08-12 17:13:44 It's hard to believe ! Absolutely amazing (but really awful, too) Can you go below ?
Vorbis Technology Demonstration Reply #24 – 2002-08-12 17:29:40 Amazing. Though there are a few codecs designed for voice transmission in the 1000-6000bps range, I don't recall any of them ever being able to handle a musical signal this well when playing around with them (voxware, 1-bit ADPCM, etc). An application for such a low bitrate could include streaming audio for low-bandwidth digital cell phones (9-14.4kbps). Is it possible to get an 8Khz stereo encode at bitrates under 10kbps?If this demonstration were run by Micro$oft, they would probably claim "CD Quality At 3kbps!"