Hydrogenaudio Forums

Hydrogenaudio Forum => Polls => Topic started by: hyaudio on 2019-03-05 12:58:12

Poll
Question: What audio format do you usually use for speech contents?
Option 1: AMR-NB votes: 0
Option 2: AMR-WB votes: 0
Option 3: AMR-WB+ votes: 0
Option 4: Speex votes: 1
Option 5: WMA Voice votes: 0
Option 6: Opus votes: 12
Option 7: EVS votes: 0
Option 8: Codec2 votes: 0
Option 9: MP3 votes: 1
Option 10: Vorbis votes: 0
Option 11: AAC votes: 5
Option 12: Other lossy compressed formats votes: 0
Option 13: Lossless compressed formats votes: 2
Option 14: Uncompressed formats votes: 0
Title: What audio format do you usually use for speech contents?
Post by: hyaudio on 2019-03-05 12:58:12
What audio format do you usually use for speech contents?
Title: Re: What audio format do you usually use for speech contents?
Post by: KozmoNaut on 2019-03-05 13:54:58
Is there really any reason to use anything other than Opus, unless you are locked in by legacy devices that only support a limited set of formats, or your connection has extremely restricted bandwidth? That goes for both music and speech content.
Title: Re: What audio format do you usually use for speech contents?
Post by: Case on 2019-03-05 14:02:09
There is. Using it makes no sense until this (https://hydrogenaud.io/index.php/topic,116605.0.html) is fixed.
Title: Re: What audio format do you usually use for speech contents?
Post by: magicgoose on 2019-03-06 09:46:15
It's not as bad as to make it "no sense to use". Especially with speech, one rarely needs to have track boundaries right in the middle of a phrase. Even with music, the glitch on track transition is not always noticeable.
Title: Re: What audio format do you usually use for speech contents?
Post by: Zarggg on 2019-03-08 20:49:36
When I encode audiobooks from CD, I use HE-AAC at CVBR 32kbps.
Title: Re: What audio format do you usually use for speech contents?
Post by: lvqcl on 2019-03-08 20:58:15
Speech + background music: WMA standard, 64 kbps
Title: Re: What audio format do you usually use for speech contents?
Post by: ThaCrip on 2019-09-19 04:11:51
If we are strictly talking speech... Opus v1.3.1 @ 13kbps (or Opus v1.2 @ 14kbps) is what I consider THE minimum I would personally use (although if one likes to play it a bit safer you could bump up the bit rate a bit). I can notice the difference from a higher quality source file but it's not that significant of a hit (say going from a 128kbps MP3 and thereabouts to Opus @ 13kbps) and saves you a lot of storage space and it is speech at the end of the day and not music, so I don't mind sacrificing some speech quality to save storage space.

so while I have not played with it too much, I can't imaging needing more than around 32-48kbps tops for speech only on Opus, even for those who prefer to have a bit higher quality speech but are not too concerned with total or near transparency.
Title: Re: What audio format do you usually use for speech contents?
Post by: Triza on 2019-09-19 09:48:44
There is. Using it makes no sense until this (https://hydrogenaud.io/index.php/topic,116605.0.html) is fixed.

That will never be fixed. Monty and the other prima donnas are already onto something new. I use and I will keep using Vorbis, which is really good in several aspects, but I will stay away any other work of Monty and Xiph. Only one other successful project they have: FLAC. But they took it over long after it became a success. So it is not their achievement. Every Vorbis user including myself uses Aotuv Vorbis. It seems to be proved and tested over here for years, but they refused to incorporate that hard work. No reasons. Nothing. And now one has to know where to turn to have the source code. It is all disappearing from the internet. 5 more years and it is all gone.

It is a shame how Xiph codecs are managed.

Maybe it is part of the push towards Opus. I did not compared them, but it is possible that Aotuv Vorbis would be at minimum on par with Opus.
Title: Re: What audio format do you usually use for speech contents?
Post by: ani_Jackal3 on 2019-09-19 13:16:05
Speech: 80kbps ABR Lame

Title: Re: What audio format do you usually use for speech contents?
Post by: AndyH-ha on 2019-09-19 22:09:01
I've done a few hundred audiobooks from cassette and CD to LAME  with this setting
-V 8 -m m --vbr-new --resample 22 --lowpass 11 --noreplaygain
The result is easy to  understand, easy to listen to, the files are relatively small, and they play pretty much anywhere.

Being VBR, it sometimes produces a bit rate as high as 256kbps, according to foobar2000 or Razorlame, where I do most of the encoding. Possibly a close ABX could detect some differences from the original or relative to "better" encodings, but I could not care less as there are no playback or listening problems.
SimplePortal 1.0.0 RC1 © 2008-2019