Skip to main content

Notice

Please note that most of the software linked on this forum is likely to be safe to use. If you are unsure, feel free to ask in the relevant topics, or send a private message to an administrator or moderator. To help curb the problems of false positives, or in the event that you do find actual malware, you can contribute through the article linked here.
Topic: Delayed short to long switching of 14496-3 (Read 3615 times) previous topic - next topic
0 Members and 1 Guest are viewing this topic.

Delayed short to long switching of 14496-3

Hi all,

I am working on psychoacoustic model II as explained in ISO Reference 14496-3 Sub-part 4. I have been reading about the discrepancies in 14496-3 and I feel even the clarification of those discrepancies are somewhat in gray. So I want to ask, not a new question, but probably in a different way.

From my experiments, I have observed that the window switching decision is a bit slow in switching back to long window from short window.

For the following time signal:
    TimeSignal      =  zeros(1,1024*3.5);
    TimeSignal(1024*3.5+1:1024*4.5) = sin(2*pi*10000*t(1:1024*1));
    TimeSignal(length(TimeSignal)+1:2*length(TimeSignal))= TimeSignal;
    TimeSignal(length(TimeSignal)+1:2*length(TimeSignal))= TimeSignal;


I have observed that the short window is triggered with the first attack (i.e. instantly). So it seems that long to short window switching is fine.

But when the attack is removed, the short to long switch is not instant. It takes one sample block to  switch back to long window.

The equation of perceptual entropy in 14496-3 says,

perceptual_entropy = perceptual_entropy - ((w_high(b) - w_low(b)) * log10(part_energy_thr(b)/(part_energy(b)+1)));

The above equation states that the window switching decision is a function of partition energy and 'pre echo control & Threshold in quiet' calculations on actual energy threshold for each partition.

It appears that the affect of previous sample(s) is more on perceptual entropy than it should be!!!


My Question is: What are the reasons for delay in switching from short to long window?

Regards,
Nabsha

 

Delayed short to long switching of 14496-3

Reply #1
Hi all,

I am working on psychoacoustic model II as explained in ISO Reference 14496-3 Sub-part 4. I have been reading about the discrepancies in 14496-3 and I feel even the clarification of those discrepancies are somewhat in gray. So I want to ask, not a new question, but probably in a different way.

From my experiments, I have observed that the window switching decision is a bit slow in switching back to long window from short window.

For the following time signal:
    TimeSignal      =  zeros(1,1024*3.5);
    TimeSignal(1024*3.5+1:1024*4.5) = sin(2*pi*10000*t(1:1024*1));
    TimeSignal(length(TimeSignal)+1:2*length(TimeSignal))= TimeSignal;
    TimeSignal(length(TimeSignal)+1:2*length(TimeSignal))= TimeSignal;


I have observed that the short window is triggered with the first attack (i.e. instantly). So it seems that long to short window switching is fine.

But when the attack is removed, the short to long switch is not instant. It takes one sample block to  switch back to long window.

The equation of perceptual entropy in 14496-3 says,

perceptual_entropy = perceptual_entropy - ((w_high(b) - w_low(b)) * log10(part_energy_thr(b)/(part_energy(b)+1)));

The above equation states that the window switching decision is a function of partition energy and 'pre echo control & Threshold in quiet' calculations on actual energy threshold for each partition.

It appears that the affect of previous sample(s) is more on perceptual entropy than it should be!!!


My Question is: What are the reasons for delay in switching from short to long window?

Regards,
Nabsha



Much better psycho model you will find in 3GPP TS 26.403.