HydrogenAudio

Hydrogenaudio Forum => Listening Tests => Topic started by: Gabriel on 2004-11-21 13:05:01

Title: A new recommendation for listening tests?
Post by: Gabriel on 2004-11-21 13:05:01
I would like to suggest a little change to the recommended practices in listening tests.

Most of modern codecs are working based on the recent context. They usually have a way to adapt the bitrate to the content that take into consideration the past recent bitrate (a window). Many encoders also have a psychoacoustic model that take into consideration the previous psychoacoustic parameters/results.

Right now, when listening to samples, we usually encode a short sample with the encoder and listen to the result.
But the encoder needs some time to adapt its models (bitrate and psychoacoustic), and of course will not be able to properly adapt at the very beginning of the sample. If the sample hasn't been extracted from the full track, the encoder would have some time to adapt its models. It means that encoding a short sample is not totally representative of how this portion would be encoded in a "real" encode.

That is why I am proposing the following:
When encoding a short sample, allow a 1 second margin at the beginning and at the end  of the sample so the encoder can adapt its models. This should not be 1s of silence, but a real 1s of content.
For ease of use, this could even be taken into consideration by the testing tools.


For video, the vqeg already has a similar recommendation: 1s at the beginning and 1s at the end should not be considered for tests, in order to let encoders stabilize themselves.
Title: A new recommendation for listening tests?
Post by: rjamorim on 2004-11-21 13:14:56
http://www.hydrogenaudio.org/forums/index....ndpost&p=127328 (http://www.hydrogenaudio.org/forums/index.php?showtopic=12471&view=findpost&p=127328)  (second paragraph)

Not really the same issue, but it's somewhat similar.
Title: A new recommendation for listening tests?
Post by: Gabriel on 2004-11-21 13:53:38
What you are pointing to would be ideal, especially for 2 pass codecs targetting bitrate.

The problem is that it is sometimes impractical for various reasons (download time, impossibility to easily cut some formats,....)
Title: A new recommendation for listening tests?
Post by: rjamorim on 2004-11-21 14:04:01
Quote
The problem is that it is sometimes impractical for various reasons (download time, impossibility to easily cut some formats,....)
[a href="index.php?act=findpost&pid=255305"][{POST_SNAPBACK}][/a]


Yes, what draw me to give up that approach was precisely the difficulty of cutting some formats. MPC is impossible to cut, WMA too as far as I know. Atrac3 and Real Audio too, of course, since they are completely undocumented.

Also, the only available AAC cutter (BeSplit) can't cope with HE AAC and PS AAC, as far as I know.

That leaves me with MP3, Vorbis and LC AAC as cuttable formats...


And, of course, you can imagine what would happen to a tester that would cut only "cuttable" formats and just encode small samples in other formats. It's either all or nothing for fairness purposes.
Title: A new recommendation for listening tests?
Post by: Gabriel on 2004-11-21 14:16:31
Well, for wma, atrac3 and real cutting should not usually be an issue, as in group listening tests you were distributing thos samples as decoded files.
We could imagine encoding the full track, decoding, cutting, encode the cut part as flac.
Title: A new recommendation for listening tests?
Post by: 2Bdecided on 2004-11-22 10:37:12
I'm going on an unreliable memory, but isn't there a chance that even 1s extra isn't "fair" to WMA? Maybe 5 or 10 seconds is more appropriate? But them with the 30 second copyright limit, you're rather limiting the length of "useable" sample! Maybe 2 or 5 seconds would be OK.

If you want to do this, you'll either have to modify the test tools (like ABC/HR), or else include a simple .wav cutter, called from the .bat file which creates+decodes the test samples.

I think cutting encoded bitstreams is a non starter anyway - even if you could, wouldn't the frame boundaries be all over the place? Meaning the test tool has to hide the small extra/missing parts of each sample?

Cheers,
David.
Title: A new recommendation for listening tests?
Post by: Gabriel on 2004-11-22 14:33:58
Well, I suggested 1s because we have to find a reasonable value.
I do not know about wma encoders, but even 1s is not optimal to Lame, as the ATH adjustement might need more than 1s to stabilise.
But 1s is still way better than nothing and does not reduces the sample that much.

Regarding the testings themselves, I think that it would be very nice to have the tools automatically restrict the default time by 1s at both ends.
Title: A new recommendation for listening tests?
Post by: ChangFest on 2004-11-22 14:48:38
Quote
If you want to do this, you'll either have to modify the test tools (like ABC/HR), or else include a simple .wav cutter, called from the .bat file which creates+decodes the test samples.


Why not encode as whole files and when the test is distributed have an internal .wav cutter randomly select 30 seconds from the whole song file. That way from a statistical standpoint, sample would be more random and add a validity to the test. It would also solve Gabriel's suggestion as well.
Title: A new recommendation for listening tests?
Post by: freakngoat on 2004-11-23 20:13:05
Quote
Quote
If you want to do this, you'll either have to modify the test tools (like ABC/HR), or else include a simple .wav cutter, called from the .bat file which creates+decodes the test samples.


Why not encode as whole files and when the test is distributed have an internal .wav cutter randomly select 30 seconds from the whole song file. That way from a statistical standpoint, sample would be more random and add a validity to the test. It would also solve Gabriel's suggestion as well.
[a href="index.php?act=findpost&pid=255479"][{POST_SNAPBACK}][/a]


I suppose this would work in certain cases, but in general, I don't think that's a good idea, because usually there is a particular artifact at a particular time in the song that people would like to point out to others, and usually there are certain parts of the song that have problems while others don't. When doing listening tests, its common to discuss things like, "I hear a weird sound at 4.3 seconds" for example. Plus 30 seconds is a very long duration for a listening test.