HydrogenAudio

Hydrogenaudio Forum => Scientific Discussion => Topic started by: theorist1 on 2013-06-14 04:32:58

Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: theorist1 on 2013-06-14 04:32:58
As a scientist (biophysics) and an audiophile, I have my feet in both worlds, and have thus enjoyed this (http://www.hydrogenaudio.org/forums/index.php?showtopic=57406) discussion.  Here are some thoughts:

I.  Open Access. 

Several have written it’s unfortunate that this paper, which is of general public interest, is not available to those without an AES subscription (that includes myself).  Agreed; and note that the landscape for scientific publication has been changing: researchers are increasingly realizing the value of “open access” papers – ones available freely, without a subscription.  Some scientific journals have gone entirely open access, while others (for an additional fee) offer the option of open access publication of your paper.  Let me suggest that, if AES doesn’t offer this option, it should, and that those publishing papers of broad interest should endeavor (if financially possible) to avail themselves of this.


II. ABX testing.

I assume from the discussion that this paper makes use of sequential ABX testing (by “sequential” I mean: present A, then B, then X, in sequence). I’d like to see basic research done into sequential ABX testing itself.  In particular, sequential ABX testing (double-blinding will be assumed throughout this discussion) is highly regarded for assessing perceptual discrimination, because (with proper statistical analysis) it eliminates false positives – subjects can’t “pass” (i.e., achieve successful discrimination in) a sequential ABX test unless they truly can reliably distinguish A from B.  But what about the converse?  Achieving accurate discrimination in sequential ABX testing is hard, because it requires accurate memory of both A and B when judging X.  Therefore I’d like to offer the hypothesis that subjects can “fail” sequential ABX testing even when they can successfully discriminate between A and B, and suggest a way of testing this.  Essentially, one would use simultaneous ABX testing to eliminate the memory requirement, and then compare the results to those for sequential ABX testing.  I can’t think of how you’d do simultaneous ABX testing with hearing, but it could certainly be done with vision (using color tiles).  The experiment would be divided into three phases:

1)   Find the smallest color difference that can be reliably distinguished during simultaneous ABX testing (present all three tiles – A, B, and X – simultaneously, so that the subjects can line them up next to each other for comparison).
2)   Repeat the experiment, using that same color difference, with sequential ABX testing.
3)   If the subjects fail phase 2 (thus confirming my hypothesis), find the smallest color difference that can be reliably distinguished using sequential ABX testing, and compare to the results of phase 1.

If it is found that subjects indeed fail to reliably distinguish colors when presented sequentially that they can reliably distinguish when presented simultaneously, this will suggest that sequential ABX testing may not be a good method for assessing our perceptual limits (i.e., for determining transparency) --  and not just for vision, but possibly for hearing as well.


III.  High-resolution audio and the Shannon sampling theorem.

Here I’ll apologize in advance if my questions are naïve – I’m no expert at signal processing.  Anyways:

Those who are dismissive of high-res audio based on theory alone typically cite the Shannon sampling theorem (typically misattributed to Nyquist), noting correctly that we can’t hear over 20 kHz (if that), and that we only need >40 kHz sampling to accurately reproduce this.  But 2 x max. frequency is not the theorem’s only requirement.  It’s my understanding that it also assumes an infinite signal, perfect sampling, and perfect interpolation.  I’ve never seen any of these assumptions discussed in this context, so I’d like to ask how much practical effect these requirements would have on Redbook (16 bit/44.1 kHz) vs. high-res conversion:

1) Infinite signal.  I assume we can effectively satisfy this with signal length >> 1/frequency, and that this is thus a non-issue.

2) Perfect sampling.  Clearly, sampling need not be perfect, but simply close enough to perfect to be transparent in amplitude and time.  Timing errors lead to jitter (right?).  So (and this is an engineering question): what’s the relationship between sampling rate and how easy it is to eliminate audible jitter errors?

3) Perfect interpolation.  Another engineering question: Naively, unless implementing near-perfect (i.e., transparent) interpolation is trivial, I would think it would be easier to achieve transparent interpolation with a higher sampling rate, because the points are more closely spaced.  Is transparent interpolation so easily achievable that the sampling rate has no practical effect?


IV.  The effect of mastering.

Many have mentioned that the reason high-res  disks do indeed sound better than CDs of the same performance is that they are mastered differently – as labors of love, and without the usual commercial pressures to alter the sound.  Given this, I think it would be a pubic service if someone could produce a Blu-Ray disk corresponding to the songs tested in this study, containing both the high-res and Redbook versions of each, and make it available for sale.  That way people could easily experiment for themselves.


V.  General thoughts.

I think the reason for the continued controversy about digital audio performance is that we don’t completely understand the biophysics of human hearing (which is why it continues to be an active area of research).  If we did, we would know, a priori, what constitutes a complete specification set sufficient to determine transparency, and thus could engineer transparent electronic gear (I say electronic because I am excluding transducers) without listening to it.  To the best of my understanding, this is not yet the case, since the errors that our auditory system is capable of detecting can be extraordinarily subtle, and what would constitute a complete set of scalar specifications sufficient to ensure transparency thus remains an open question.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: pdq on 2013-06-14 05:17:29
I don't know what you mean by "sequential" ABX testing. Any ABX test that I am aware of allows the subject to listen to any of the three samples, in any order, as many times as he/she pleases until ready to make a choice.

Also, you cannot "eliminate" false positives just as you cannot eliminate false negatives. You can only make the null hypothesis statistically improbable.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: theorist1 on 2013-06-14 05:34:57
I don't know what you mean by "sequential" ABX testing. Any ABX test that I am aware of allows the subject to listen to any of the three samples, in any order, as many times as he/she pleases until ready to make a choice.

Also, you cannot "eliminate" false positives just as you cannot eliminate false negatives. You can only make the null hypothesis statistically improbable.


The point is that it's sequential regardless of the order, and thus relies on memory -- you can't listen to all three tracks simultaneously.    If you reread that section of my post carefully I think you'll see what I'm getting at.

And yes, I could (and probably should) have written that sentence less casually and said "reduce the probability of a false positive (where a false positive is defined in statistics as 'the incorrect rejection of a true null hypothesis'*) to a very low level" instead of "eliminate," but the difference between those two statements has entirely no bearing on any of the arguments I've made.

*source: http://en.wikipedia.org/wiki/Type_I_and_type_II_errors (http://en.wikipedia.org/wiki/Type_I_and_type_II_errors)
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-06-14 06:15:36
The point is that it's sequential regardless of the order, and thus relies on memory

Memory is always required in order to asses audible differences; not just for testing.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: 2Bdecided on 2013-06-14 09:38:34
The point is that it's sequential regardless of the order, and thus relies on memory

Memory is always required in order to asses audible differences; not just for testing.
Except comparing two mono signals simultaneously, one fed to each ear via headphones. A niche pass-time, but possible.

...but we have plenty of ABX threads that theorist1 could read, where it's explained in painful detail how all the "problems" with ABX testing are either easily avoided, or also apply to any kind of listening test that involves humans listening.

Tests that don't involve humans listening may avoid such problems, but also have less relevance.

Cheers,
David.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Kees de Visser on 2013-06-14 10:33:52
(by “sequential” I mean: present A, then B, then X, in sequence).
No need to listen to each sample from beginning to end (like with an orchestra audition). AFAIK in audio ABX tests the subject is allowed to switch freely between A, B and X as fast and as many times (s)he wants. Since our auditory memory is short, I think this is even a requirement, although I can imagine situations where it can't work, like different tempi.

Except comparing two mono signals simultaneously, one fed to each ear via headphones.
I've thought about that as well. It's e.g. very handy for time-aligning two signals. But there's a risk of false positives, just imagine a 180° phase difference. When heard in isolation this is probably inaudible, but when compared directly with the original, it's evident
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-06-14 13:55:22
I hate to do this, since drawing allusions to sight usually doesn't work well and this desire to test simultaneously is no exception, but here goes:
http://en.m.wikipedia.org/wiki/Checker_shadow_illusion (http://en.m.wikipedia.org/wiki/Checker_shadow_illusion)
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: drewfx on 2013-06-14 17:54:58
Achieving accurate discrimination in sequential ABX testing is hard, because it requires accurate memory of both A and B when judging X.  Therefore I’d like to offer the hypothesis that subjects can “fail” sequential ABX testing even when they can successfully discriminate between A and B, and suggest a way of testing this.

Are you making the common error here of making a hypothesis and then asserting that your hypothesis must be disproven else ABX testing is flawed? Where is the evidence that someone can "successfully discriminate between A and B" but fail to ABX it? Because they "know" they heard a difference in sighted testing?

My recollection is that studies have shown audio memory starts to lose information at something on the order of 200ms, so ideally ABX tests try to keep switching below this (while not creating audible artifacts in the process). Search for "fast switching" - it has been discussed here.

Quote
3) Perfect interpolation.  Another engineering question: Naively, unless implementing near-perfect (i.e., transparent) interpolation is trivial, I would think it would be easier to achieve transparent interpolation with a higher sampling rate, because the points are more closely spaced.  Is transparent interpolation so easily achievable that the sampling rate has no practical effect?

Moving the sampling rate higher effectively shifts the interpolation errors to a higher frequency. This could indeed be beneficial if they are audible at a lower sampling rate. At HA, the TOS lay out certain requirements for discussing audibility.

Quote
Given this, I think it would be a pubic service if someone could produce a Blu-Ray disk corresponding to the songs tested in this study, containing both the high-res and Redbook versions of each, and make it available for sale.  That way people could easily experiment for themselves.

The standard test for this sort of thing only requires a higher res source and SW to convert it to a lower res and back. Various SW is readily available that can do this. IOW, people can indeed easily experiment for themselves.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: [JAZ] on 2013-06-14 18:32:26
I will try to address the theory and reasoning of your audio related questions:

1) Infinite signal.  I assume we can effectively satisfy this with signal length >> 1/frequency, and that this is thus a non-issue.

I don't recall if the sampling theorem talks about an infinite signal, but ... why would it be easier to sample a 2 hour video than a 3 minute song?.
On the other hand, when doing a transform to the frequency domain, the signal is assumed to repeat itself infinitely on each side.


2) Perfect sampling.  Clearly, sampling need not be perfect, but simply close enough to perfect to be transparent in amplitude and time.  Timing errors lead to jitter (right?).  So (and this is an engineering question): what’s the relationship between sampling rate and how easy it is to eliminate audible jitter errors?

We can be incorrect in two ways when sampling:
1) The time we do the sampling
2) The value we obtain for that sample.

Jitter affects 1 (concretely it means being inexact in determining the moment you have to sample, and how it varies between samples), and studies have demonstrated that nowadays jitter is a non-issue. Think about it... CD = 44Khz, DVD= 96Khz, PC computer > 1GHz. (A bad clock on a PC wouldn't mean that it swings its speed, but would completely break the expected response time of internal buses and update of RAM memory).
It is much easier to get a very exagerated jitter with a Vinyl disc than with digital audio.

On 2, the effect is the noise floor of the signal. It basically causes noise (different types depending on how the incorrect value is obtained, but no more than that).

I assume you already understand the sampling rate. ( amount of samples obtained in a second, or its inverse, the period: how much time elapsed between getting each sample).


3) Perfect interpolation.  Another engineering question: Naively, unless implementing near-perfect (i.e., transparent) interpolation is trivial, I would think it would be easier to achieve transparent interpolation with a higher sampling rate, because the points are more closely spaced.  Is transparent interpolation so easily achievable that the sampling rate has no practical effect?


Interpolation only applies on resampling. What is done in a DAC is a reconstruction filter. A reconstruction filter can (and usually does) oversample to allow using a less complex filter with the same results, but has no other complication.

A perfect reconstruction filter would be a brickwall filter with the frequency just below samplerate/2 ( samplerate/2 in fact cannot be sampled perfectly by the theorem). Such a filter is also expected on the input, so what is sampled already is not a perfect signal due to the inability to have a perfect filter.

Said that, the not-exactness of the filters does not make invalid the theorem. What implies is that there is a band of frequencies that cannot be sampled and/or reconstructed exactly due to the progressivity of the filter ( a decaying line, instead of an abrupt cut).
What is before the filter frequency (not all frequencies, but the ones where the filter starts) is just attenuated, while what is after the frequency is considered aliasing (Which represents a mirroring of the frequencies).

Since it's not difficult to find reconstruction filters working 20.5Khz on 44Khz sampling rate, and at 22Khz on 48Khz, we are talking about a reduced imperfection.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: pdq on 2013-06-14 18:44:13

1) Infinite signal.  I assume we can effectively satisfy this with signal length >> 1/frequency, and that this is thus a non-issue.

I don't recall if the sampling theorem talks about an infinite signal, but ...

The sampling theorem specifies a band-limited signal less than f/2, and I suspect there is some theory that says that a signal that has a beginning and an end must have components greater than f/2.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: splice on 2013-06-15 00:10:41
... The sampling theorem specifies a band-limited signal less than f/2, and I suspect there is some theory that says that a signal that has a beginning and an end must have components greater than f/2.


A signal with a beginning and an end will contain components in addition to those of the signal. They need not exceed f/2.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: theorist1 on 2013-06-15 03:11:33
Thanks for all your comments!

The point is that it's sequential regardless of the order, and thus relies on memory

Memory is always required in order to asses audible differences; not just for testing.


Not necessarily.  In principle, one could play a pristine recording, and that same recording to which very low levels of a very irritating form of audio distortion had been added, while monitoring a specific physiological attribute.  With a sufficiently large N, and with the right attribute, one might find significant differences in physiological response to the pristine and non-pristine versions, even if the subjects subsequently failed to distinguish them based on ABX testing. 

Now some might dismiss this as too theoretical, and if you're doing engineering or applied research it might be; but for one with a basic research mindset, these questions are intriguing.


Achieving accurate discrimination in sequential ABX testing is hard, because it requires accurate memory of both A and B when judging X.  Therefore I’d like to offer the hypothesis that subjects can “fail” sequential ABX testing even when they can successfully discriminate between A and B, and suggest a way of testing this.

Are you making the common error here of making a hypothesis and then asserting that your hypothesis must be disproven else ABX testing is flawed?


No, I've done the opposite: I've made a hypothesis and then asserted that, if my hypothesis is confirmed, it suggests that "sequential ABX testing may not be a good method for assessing our perceptual limits (i.e., for determining transparency) -- and not just for vision, but possibly for hearing as well."

Where is the evidence that someone can "successfully discriminate between A and B" but fail to ABX it? Because they "know" they heard a difference in sighted testing?

I never said that evidence exists -- I made a plausibility argument, and proposed an experiment that would test this.



3) Perfect interpolation.  Another engineering question: Naively, unless implementing near-perfect (i.e., transparent) interpolation is trivial, I would think it would be easier to achieve transparent interpolation with a higher sampling rate, because the points are more closely spaced.  Is transparent interpolation so easily achievable that the sampling rate has no practical effect?

Moving the sampling rate higher effectively shifts the interpolation errors to a higher frequency. This could indeed be beneficial if they are audible at a lower sampling rate. At HA, the TOS lay out certain requirements for discussing audibility.


Interesting.



Given this, I think it would be a pubic service if someone could produce a Blu-Ray disk corresponding to the songs tested in this study, containing both the high-res and Redbook versions of each, and make it available for sale.  That way people could easily experiment for themselves.

The standard test for this sort of thing only requires a higher res source and SW to convert it to a lower res and back. Various SW is readily available that can do this. IOW, people can indeed easily experiment for themselves.


Yes, but it would broaden the number of people who would participate in such a test, which I find to have general public education value, since there are consumers that would be happy to buy the Blu-Ray and do the test, but that don't want to mess with downloading and learning how to use conversion software.  Also, I understand down-conversion is not trivial, so the Blu-Ray eliminates the possibility that some will hear differences because they purchased flawed software or used it incorrectly.  In addition, there are many consumers that own high-res capable Blu-Ray players but that don't have high-res capable outboard DACs, or players with that can act as such (and either of the latter are required for the approach that you suggest) .
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: theorist1 on 2013-06-15 03:12:52

I will try to address the theory and reasoning of your audio related questions:


1) Infinite signal.  I assume we can effectively satisfy this with signal length >> 1/frequency, and that this is thus a non-issue.

I don't recall if the sampling theorem talks about an infinite signal, but ... why would it be easier to sample a 2 hour video than a 3 minute song?.
On the other hand, when doing a transform to the frequency domain, the signal is assumed to repeat itself infinitely on each side.


This is entirely beyond the scope of my expertise, but the short response to your question about 2 hours vs. 3 minutes is that, based on the reading I just did, it appears not to be about length per se, but the extent to which a signal is time-varying (and essentially all music signals are time-varying).  This nicely-written Wikipedia article helps (http://en.wikipedia.org/wiki/Time–frequency_analysis):

"The practical motivation for time–frequency analysis is that classical Fourier analysis assumes that signals are infinite in time or periodic, while many signals in practice are of short duration, and change substantially over their duration. For example, traditional musical instruments do not produce infinite duration sinusoids, but instead begin with an attack, then gradually decay. This is poorly represented by traditional methods, which motivates time–frequency analysis."

If you read further you will see that the various formulations used to obtain a time-frequency distribution function each have strengths and weaknesses--none is prefect.  And if you read further still, the article seems to indicate that doing a Shannon reconstruction on a time-varying audio signal also requires implementing time-frequency analysis, which means that even with perfect sampling and a perfectly bandwidth-limited signal, we can't perfectly reconstruct the analog audio signal.  So this would motivate the question: what is the effect of sampling rate and bit depth on the nature of the errors introduced by this time-frequency analysis?





3) Perfect interpolation.  Another engineering question: Naively, unless implementing near-perfect (i.e., transparent) interpolation is trivial, I would think it would be easier to achieve transparent interpolation with a higher sampling rate, because the points are more closely spaced.  Is transparent interpolation so easily achievable that the sampling rate has no practical effect?


Interpolation only applies on resampling. What is done in a DAC is a reconstruction filter. A reconstruction filter can (and usually does) oversample to allow using a less complex filter with the same results, but has no other complication.

A perfect reconstruction filter would be a brickwall filter with the frequency just below samplerate/2 ( samplerate/2 in fact cannot be sampled perfectly by the theorem). Such a filter is also expected on the input, so what is sampled already is not a perfect signal due to the inability to have a perfect filter.

Said that, the not-exactness of the filters does not make invalid the theorem. What implies is that there is a band of frequencies that cannot be sampled and/or reconstructed exactly due to the progressivity of the filter ( a decaying line, instead of an abrupt cut).
What is before the filter frequency (not all frequencies, but the ones where the filter starts) is just attenuated, while what is after the frequency is considered aliasing (Which represents a mirroring of the frequencies).

Since it's not difficult to find reconstruction filters working 20.5Khz on 44Khz sampling rate, and at 22Khz on 48Khz, we are talking about a reduced imperfection.



Sorry, I'm afraid I don't understand your response here.  It appears to be addressing bandwidth limitations and aliasing, with is a separate issue from interpolation.  Also, regarding your statement that "interpolation only applies on resampling":  From what I've read, the "perfect interpolation" requirement of the Shannon theorem is indeed referring to D->A reconstruction.  From http://en.wikipedia.org/wiki/Nyquist–...mpling_theorem: (http://en.wikipedia.org/wiki/Nyquist–Shannon_sampling_theorem:)

"Methods that reconstruct a continuous function from the x(nT) sequence are called interpolation. As will be shown below, the mathematically ideal way to reconstruct x(t) involves the use of sinc functions.... Each sample in the sequence is replaced by a sinc function centered on the time axis at the original location of the sample (nT), and the amplitude of the sinc function is scaled to the sample value, x(nT). Then all the sinc functions are summed into a continuous function. A mathematically equivalent method is to convolve one sinc function with a series of Dirac delta pulses, weighted by the sample values. Neither method is numerically practical. Instead, some type of approximation of the sinc functions, finite in length, has to be used. The imperfections attributable to the approximation are known as interpolation error.
Practical digital-to-analog converters produce neither scaled and delayed sinc functions nor ideal Dirac pulses. Instead they produce a piecewise-constant sequence of scaled and delayed rectangular pulses, usually followed by a "shaping filter" to clean up spurious high-frequency content."

So I suppose the bottom line is that, even with perfect sampling and a perfectly bandwidth-limited signal, the Shannon theorem (and the attendant engineering practicalities) tells us there are unavoidable distortions introduced by both the time-varying nature of the music signal (which necessitates the implementation of time-frequency analysis) and by interpolation errors.  drewfx tells us that higher sampling rates shift the latter to higher frequencies.  So that leaves me wondering what the effects of sampling rate and bit depth would be on the former.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-06-15 07:16:09
If it is found that subjects indeed fail to reliably distinguish colors when presented sequentially that they can reliably distinguish when presented simultaneously, this will suggest that sequential ABX testing may not be a good method for assessing our perceptual limits (i.e., for determining transparency) --  and not just for vision, but possibly for hearing as well.

This is, I fear, preposterous.  One can put colors next to each other, because vision is a spatial sense.  Sound is a time-domain sense that is perceived in the frequency domain by the ear.  The proper analogy to colors next to each other is SOUNDS ADJACENT IN TIME, which is exactly what ABX testing does.

There is rather some research on discrimination with time lapse, and it is quite true that a break between A/X or B/X (or even A/B) will disrupt the subject, so time proximity WITHOUT glitches is absolutely requisite.  This is because partial loudnesses, rather than time domain waveforms, is what must be recalled, and the first level of memory for such is under 200 milliseconds. So proximate IN TIME presentation is the relevant, germane thing to do.

Since people do perform with such testing down to physical limits, I don't think there is a great deal of room left there for problems.
Quote
III.  High-resolution audio and the Shannon sampling theorem.

1) Infinite signal.  I assume we can effectively satisfy this with signal length >> 1/frequency, and that this is thus a non-issue.

Yep.
Quote
2) Perfect sampling.  Clearly, sampling need not be perfect, but simply close enough to perfect to be transparent in amplitude and time.  Timing errors lead to jitter (right?).  So (and this is an engineering question): what’s the relationship between sampling rate and how easy it is to eliminate audible jitter errors?

There's an oldish AES paper that addresses this very nicely. It's not just the amount of jitter, it's also the bandwidth of the jitter, i.e. the spectrum of the deviation from the mean sampling rate, that matters. It's much like FM modulation, only where all sidebands modulate back down into the baseband.  It's not trivial, but it's entirely possible to make sure jitter isn't an issue, and most hardware has (finally!) killed this problem dead.
Quote
3) Perfect interpolation.  Another engineering question: Naively, unless implementing near-perfect (i.e., transparent) interpolation is trivial, I would think it would be easier to achieve transparent interpolation with a higher sampling rate, because the points are more closely spaced.  Is transparent interpolation so easily achievable that the sampling rate has no practical effect?

This is strictly a question of filtering. Filter design for this problem is accomplished, and in fact most 44/16 DAC's use a very high sampling rate with a low-bit-count DAC and a lot of digital filtering for this reason.  While I have seen some very bad filters (um, in computer audio chains for instance) the "how" here is well known, it's a question of good engineering practice, and the "how" goes all the way back to Crochiere and Rabiner.
Quote
IV.  The effect of mastering.

Many have mentioned that the reason high-res  disks do indeed sound better than CDs of the same performance is that they are mastered differently – as labors of love, and without the usual commercial pressures to alter the sound.  Given this, I think it would be a pubic service if someone could produce a Blu-Ray disk corresponding to the songs tested in this study, containing both the high-res and Redbook versions of each, and make it available for sale.  That way people could easily experiment for themselves.

I've seen some recordings that are two-layer, where the high-rez layer wasn't compressed to (*&(*& and the redbook layer was. 'nuff said?
Quote
V.  General thoughts.

I think the reason for the continued controversy about digital audio performance is that we don’t completely understand the biophysics of human hearing (which is why it continues to be an active area of research).  If we did, we would know, a priori, what constitutes a complete specification set sufficient to determine transparency, and thus could engineer transparent electronic gear (I say electronic because I am excluding transducers) without listening to it.  To the best of my understanding, this is not yet the case, since the errors that our auditory system is capable of detecting can be extraordinarily subtle, and what would constitute a complete set of scalar specifications sufficient to ensure transparency thus remains an open question.


Actually we understand the sensitivities of the hearing apparatus quite well, but they are not easily mapped into simplistic things like frequency response or signal to noise ratio.

I gave a plenary talk to InfoComm this Monday on that very subject.  It's much like the Heyser Lecture I gave at last fall's AES. You can find a link to it at www.aes.org/sections/pnw if you want to know more.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: [JAZ] on 2013-06-15 16:43:45
So this would motivate the question: what is the effect of sampling rate and bit depth on the nature of the errors introduced by this time-frequency analysis?

The majority of filters work in the time domain. The fourier transform (and so, the frequency domain) is not necessarily involved. As I wrote, i knew that fourier transform expects an infinite and continuous signal (That's why windowing becomes necessary, but that's another subject).

But since you ask:
The sampling rate affects the bandwith that a specific transform will show. You can do an FFT of 1024 samples on a 11Khz signal and a 44Khz signal. Both will return 512 bands (and 512 phases).
The FFT of the former is about 100milliseconds and the highest band is half of samplerate (5.5Khz). On the latter it's about 25ms and the highest band is half of samplerate (22Khz).
And I already wrote about bit depth. You would see the effects of bit depth as the spectrum bottom (going up or down depending if bits decrease or increase).



Also, regarding your statement that "interpolation only applies on resampling":  From what I've read, the "perfect interpolation" requirement of the Shannon theorem is indeed referring to D->A reconstruction.

I sometimes make differences in words that don't really have a difference. I meant interpolation as in the process of generating a different signal out of the original one (thinking only in the digital domain).
You quoted interpolation as in the process of making a continuous signal out of a sampled signal. Sure, that's also interpolation, but that's what the DAC does, and it mostly implies the filter that i talked about. And yes, nowadays this is done beyond the threshold of audibility.

(edit:spelling)
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Porcus on 2013-06-15 18:38:51
There is a sampling/interpolation/hearing issue that can only be resolved by studying the ear's response, namely to what extent the hair cells work so much as idealized strings that the sine functions are the appropriate basis. Such studies can, for all that I know, have been carried out explicitely; otherwise, a layman's gut feeling is that the effect is at most worth a slight miscalibration or margin of conservatism. I would be grossly surprised if this could possibly increase the 20 kHz figure by those ten percent required to break through the CD limit.

Anyway, the argument is that the canonical choice of sine functions is due to the wave equation, deduced by a 'spherical cow in vacuum' theoretical ideal string, which the hair cells are not. The periodic function that, around the 20 kHz mark, is "least painful given the hearing threshold" is likely not exactly the sine, but likely so close to that it is nothing to worry about for the purpose of the "20" figure.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-06-16 07:34:20
Anyway, the argument is that the canonical choice of sine functions is due to the wave equation, deduced by a 'spherical cow in vacuum' theoretical ideal string, which the hair cells are not. The periodic function that, around the 20 kHz mark, is "least painful given the hearing threshold" is likely not exactly the sine, but likely so close to that it is nothing to worry about for the purpose of the "20" figure.



I have no idea whatsoever you're talking about here.  There is no "canonical" choice for a transform, and a Discrete Fourier Transform uses both sines and cosines, and this has nothing to do with an idea string at all.

The basis vectors for an FFT have to do with mathematics, not string vibrations.

The hair cells, which work somewhat differently, have nothing to do with "least painful given the hearing threshold", either, that I can think of.

So what are you asking?
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: phofman on 2013-06-16 19:49:13
Think about it... CD = 44Khz, DVD= 96Khz, PC computer > 1GHz.


The clock responsible for the jitter level is an independent hardware device (provided by the soundcard or provided by the motherboard for the USB controller for USB adaptive mode), irrelevant to the CPU clock. CPU cannot influence the final clock directly (except for some hardly predictable induced power supply noise due to CPU consumption fluctuations).
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-06-17 07:00:41
So this would motivate the question: what is the effect of sampling rate and bit depth on the nature of the errors introduced by this time-frequency analysis?

So that we're crystal clear, time-frequency analysis does not apply to sampling and subsequent reconstruction.

Assuming your signal is band-limited below half the sample frequency, the only errors are due to quantization, imperfect clocking and an imperfect response of the reconstruction filter.  These days this can be accomplished without any audible degredation fairly cheaply.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: knutinh on 2013-06-17 07:44:51
Scientific curiosity aside:
If it was found that some audiophile remedy (96kHz sampling, fancy cables, whatnot) produced audible differences if compared on shorter time-scales than 200ms, or when compared simultaneously via 2-ch headphones, what would the practical consequence be?

For me to enjoy expensive speakers over inexpensive speakers, there must be some long-term memory effect, or at least some change in mood, happiness, etc. If not, then my enjoyment (aside from "owners happiness" etc) would be exactly the same post purchase as pre purchase of those expensive boxes, at least 200ms after installing the new ones. What is the fun in that?

If you want to explore new (?) subjective testing areas, I would vote for long-term testing of (sub) conscious "happiness" when blinded to make etc when installing audio component A over audio component B. Because that would be the best indicator on choices such as "is it worth it?", "should I spend my money on cables or flowers for my wife" kind of questions, assuming that people don't really like paying for brand, looks, marketing etc.

Does speakers with a smooth and wide frequency response/polar pattern improve sleep, reduce depression, make people argue less with their spouse, reduce suicide rates and stuff like that? My guess is that they do, for a small selection of people, and perhaps to such a degree that we will never be able to realistically measure it?

-k
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Porcus on 2013-06-17 09:56:19
If [...] produced audible differences if compared on shorter time-scales than 200ms, or when compared simultaneously via 2-ch headphones, what would the practical consequence be?

For me to enjoy expensive speakers over inexpensive speakers, there must be some long-term memory effect, or at least some change in mood, happiness, etc. If not, then my enjoyment (aside from "owners happiness" etc) would be exactly the same post purchase as pre purchase of those expensive boxes, at least 200ms after installing the new ones. What is the fun in that?

If you want to explore new (?) subjective testing areas, I would vote for long-term testing of (sub) conscious "happiness" when blinded to make etc when installing audio component A over audio component B. Because that would be the best indicator on choices such as "is it worth it?"


Long-term, as in a six hour listening session, or as in living with it for a month?  The latter would be akin to ordinary drug testing, and ... hm, what is the disease?
Has anyone ever gotten positive results measuring happiness out of any reasonably comparable setup?
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Porcus on 2013-06-17 10:53:41
I have no idea whatsoever you're talking about here.


You seem to have the idea that I actually did write choice of 'transform' ...

A 19 kHz symmetric triangular and a 19 kHz sine are different. What does that difference mean in practice? Likely you would get one of two answers: (i) nothing, or (ii) if anything, the triangular is worse, as its useless higher order components are good for nothing and potentially bad for something.
Without choice of basis, it is absolutely no reason to claim that the latter has higher order components at all. It would not be so if we used triangular functions ( http://dx.doi.org/10.1016/S0898-1221(99)00075-9 (http://dx.doi.org/10.1016/S0898-1221(99)00075-9) - I am somewhat surprised that this had publishable news value as a research article as late as 1999 ...).  We could do that, but there is a good reason why we don't. (Here's a lecturer who has or at last has a fond hope to have, students bright enough to spot that it isn't obvious: http://www.cv.nrao.edu/course/astr534/FourierTransforms.html (http://www.cv.nrao.edu/course/astr534/FourierTransforms.html) .)
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-06-18 03:10:41
A 19 kHz symmetric triangular and a 19 kHz sine are different.

D'oh. So?
Quote
What does that difference mean in practice? Likely you would get one of two answers: (i) nothing, or (ii) if anything, the triangular is worse, as its useless higher order components are good for nothing and potentially bad for something.

If I make a transform from triangle waves, which can by done by integrating a Hadamard transform, there will be no higher order components.

The ear, on the other hand, has a very strong resonant response, as in "coupled second order sections" with some delayed feedback. So these "higher order components" that one would see in a triangle wave as analyzed by a Fourier basis will simply not be captured by the ear unless they are within the bandwidth of the ear.

So, having said that, what's your point?  I still see no point beyond a fallacious appeal to ignorance.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: knutinh on 2013-06-18 10:19:54
Long-term, as in a six hour listening session, or as in living with it for a month?  The latter would be akin to ordinary drug testing, and ... hm, what is the disease?
Has anyone ever gotten positive results measuring happiness out of any reasonably comparable setup?

As a consumer, audible differences between e.g. mp3 and CD that is only audible on a timescale of 200ms is irrelevant. What is relevant is whether those differences are perceived (or leads to a changed state of mind) on a timescale of months and years.

If I am to buy $1000 loudspeakers, I would like the experience when listening to Miles Davis this august to be "better" than if I had bought $100 loudspeakers. If it is not, then I would rather spend those money elsewhere.

-h
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Porcus on 2013-06-18 10:41:53
If I make a transform from triangle waves, which can by done by integrating a Hadamard transform, there will be no higher order components.


If you use triangulars as basis, there will by definition be no higher order component.


Now is the hearing upper limint of human hearing completely independent of waveform, or is it not? And, is this a rationalizable fact following a priori without any empirical study, or is it not?
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: knutinh on 2013-06-18 13:01:57
Now is the hearing upper limint of human hearing completely independent of waveform, or is it not?

It appears that humanity, so far, have been unable to show audible differences between the CD format and the DVD-A/SACD formant, a prominent difference between the two is that the latter allows extension far beyond the 20kHz or so of CD.

One would think that the variation of waveshape found in such studies would be sufficient, although the amount of energy beyond 20kHz tends to be low (as is the case with most live music).

-k
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Porcus on 2013-06-18 23:26:02
It appears that humanity, so far, have been unable to show audible differences between the CD format and the DVD-A/SACD formant, a prominent difference between the two is that the latter allows extension far beyond the 20kHz or so of CD.


.... indicating that the CD format is “more than good enough”, but not what would be “precisely good enough”.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: knutinh on 2013-06-19 06:24:10
It appears that humanity, so far, have been unable to show audible differences between the CD format and the DVD-A/SACD formant, a prominent difference between the two is that the latter allows extension far beyond the 20kHz or so of CD.


.... indicating that the CD format is “more than good enough”, but not what would be “precisely good enough”.

Indicating that the decades of research into human auditory perception is right, a lowpass filter at 15 kHz or 25 kHz or whatever (depending on age, gender, genes, ...) seems to be transparent. Any claims that humans can somehow take advantage of ultra-sound in certain non-linear, waveform-dependent ways are less believable as a result.

At my age, 15 kHz tends to be sufficient.

-k
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-06-20 06:26:21
Now is the hearing upper limint of human hearing completely independent of waveform, or is it not?


Uh, of course it's not "independent of waveform" since the domain the ear responds in is 4th order resonance, give or take, and that is going to map directly to a fourier basis.

So you need to use a fourier basis, not "any waveform".
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: krabapple on 2013-06-20 11:17:49
ly.  In principle, one could play a pristine recording, and that same recording to which very low levels of a very irritating form of audio distortion had been added, while monitoring a specific physiological attribute.  With a sufficiently large N, and with the right attribute, one might find significant differences in physiological response to the pristine and non-pristine versions, even if the subjects subsequently failed to distinguish them based on ABX testing.


And away we go.... 

http://en.wikipedia.org/wiki/Hypersonic_effect (http://en.wikipedia.org/wiki/Hypersonic_effect)
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: antz on 2013-08-04 13:46:01
If it is found that subjects indeed fail to reliably distinguish colors when presented sequentially that they can reliably distinguish when presented simultaneously, this will suggest that sequential ABX testing may not be a good method for assessing our perceptual limits (i.e., for determining transparency) --  and not just for vision, but possibly for hearing as well.

I'm a little late to this discussion but the partial quote above seems to contain a fundamental flaw, my highlighting.

I would suggest that "sequential" ABXing and "simultaneous" ABXing of colours would show a large discrepancy because most people have very poor colour memory. I spent a large portion of my working life in colour-related industry and only ever came across a single person whose colour-memory was uncannily good. That aside, it surely is a flawed idea that hearing must be also be so afflicted, unless there is evidence to suggest it. Is there any evidence either way?

In addition, it is patently obvious that "simultaneous" ABXing is possible with coloured tiles, whereas with audio samples it is inherently somewhere between difficult and impossible. The fact that a person's ear could discriminate between samples if they were simultaneous is irrelevant since that isn't feasible. What matters is if they can "hold" the audio memory or not.

Let me suggest a more appropriate analogy to colour. Let's say you decide to re-paint your living room walls. You pay a visit to a paint supplier and select your desired colour from a chart. A machine then dispenses the required mixture and off you go and paint your walls. Knowing the technology involved, I can almost guarantee the walls will not match the chart exactly. Will you notice this? Highly unlikely, assuming there was no grave error in the process. Would you be able to ABX the chart and your walls? Highly likely. Does this make the wall-painting process "non-transparent", in the audio-ABX sense? Not a bit.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Light-Fire on 2013-08-04 17:43:51
...sequential ABX testing is hard, because it requires accurate memory of both A and B when judging X...


A NON blind test also requires accurate memory of both A and B. So your argument is irrelevant.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-08-04 18:31:07
If

http://www.hydrogenaudio.org/forums/index....st&p=838137 (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=101500&view=findpost&p=838137)
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-08-04 21:42:20
The fact that a person's ear could discriminate between samples if they were simultaneous is irrelevant since that isn't feasible.


And what level-roving experiments show is that you want clean, clickless near-instantaneous switching, no more, no less.

As to the title, the Shannon theorem is right, and ABX testing is as sensitive as anything else when it's done right.

I've seen many miserable excuses for why that isn't, but I am frankly tired of them, it's the same old mistakes over and over again.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: antz on 2013-08-04 22:14:16
The fact that a person's ear could discriminate between samples if they were simultaneous is irrelevant since that isn't feasible.


And what level-roving experiments show is that you want clean, clickless near-instantaneous switching, no more, no less.

As to the title, the Shannon theorem is right, and ABX testing is as sensitive as anything else when it's done right.

I've seen many miserable excuses for why that isn't, but I am frankly tired of them, it's the same old mistakes over and over again.

I didn't word that well - a better wording would have been: The fact that a person's ear might be able to discriminate between samples if they were simultaneous is irrelevant since that isn't feasible.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: mzil on 2013-08-04 22:54:59
Quote
And what level-roving experiments show is that you want clean, clickless near-instantaneous switching, no more, no less

In my experience, based on my perception, lightning fast transitions are indeed critically important to being able to audibly discern subtle differences between samples [such as level or minor tonal balance variations] however I've always assumed that the reason we want to avoid transitional clicks between A, B, and X is because they potentially may act as a "tell", a giveaway as to the samples' identity, not because hearing the minor clicks themselves diminishes our sensitivity to picking up on small differences. Correct?

What I'm getting at is that consistent, minor ticks/clicks which are first verified, pre-test, to be audibly identical each time one switches between samples [so it can't act as a "tell"] is perhaps not ideal, however it is far less destructive to our sensitivity, in my mind, than another solution I've sometimes seen used to overcome minor clicks in some tests, where a quick volume ramp is introduced at each transition to obscure any such clicks. The problem being the time delay introduced by that volume up ramp, even if it is only one second in duration, reduces our sensitivity to hearing minor changes, significantly. [Also, hearing the volume ramp itself, or a beep, or pink noise, or any alternative sounds to the targets, for that matter, may simply throw off our concentration and focus to the task at hand.]
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-08-05 03:22:18
however I've always assumed that the reason we want to avoid transitional clicks between A, B, and X is because they potentially may act as a "tell", a giveaway as to the samples' identity, not because hearing the minor clicks themselves diminishes our sensitivity to picking up on small differences. Correct?


No, actually, if you'll think briefly about the loudness of a click, in addition to being a tell, it ALSO mucks up short-term loudness memory, so it reduces sensitivity AND provides a leak in the DBT protocol. It's a twofer.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-08-05 04:33:35
How can something that serves to make distinguishing two samples more difficult also serve as a tell?

It doesn't.

ABX testing, as I know it, hands the power of switching to the testee. A click will only be a tell if it occurs for one sample and not the other for both the unknown X and the corresponding known (A or B, but not both). That a click occurs as the result of switching is meaningless in alerting the testee to the switch since the testee initiated it.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: 2Bdecided on 2013-08-05 10:46:09
Two mono samples, A and B.

sequential: Listen to A then B.

simultaneous: Listen to A in one ear and B in the other.

Some differences will be far easier to hear in the simultaneous presentation merely due to the binaural masking level difference (BMLD).

I can't think of another way of doing simultaneous presentation that isn't a nonsense, and that one is slightly a nonsense because mostly what you're measuring is BMLD, not simultaneous vs sequential.

Cheers,
David.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Wayne Highwood on 2013-08-05 19:14:58
In light of the limits of serial audio testing raised by the OP, I'm pondering two possible conclusions:

"We've reached the limits of differences the subjects can hear"

vs.

"We have reached the limits of resolution for using human memory as the measuring instrument in an AB test".

Though the OP's point is generating some discussion, it seems to be sidestepping his basic criticism (which seems logical, particularly when considering current knowledge about perception).

Thoughts? 
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-08-05 19:23:23
I see a distinction without a difference.  Subjective interpretation requires memory.  If memory is suspect then so are the impressions born from it.

This was already addressed, was it not?
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Wayne Highwood on 2013-08-05 19:59:49
I see a distinction without a difference.  Subjective interpretation requires memory.  If memory is suspect then so are the impressions born from it.

This was already addressed, was it not?


I guess not in the way I was hoping or expecting from the dbt experts.  (Great forum, by the way.  I'm not trolling, just trying to learn and generate some discussion.  Hope this horse ain't dead.) 

DBT fails to address the "unreliable memory" problem in any way, as it relies on such for it's data input.  The tests indicate that gross differences are clearly audible.  As those differences shrink, more and more fail the test.  When you reach the useful resolution of a memory based AB test, most people fail.  But then which conclusion posted above should be drawn from the results?  It seems an important distinction, as the validity of the method is assumed in the first while the second one explicitly acknowledges it's (possible, likely?) limits.





Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: 2Bdecided on 2013-08-05 20:59:25
So you are exploring the possibility that the reason no difference is reported is not because we cannot hear the difference, but because we cannot remember it for a few seconds.

If we have forgotten it before we have even drawn another breath, how do we subsequently know that we ever heard it?

Cheers,
David.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Wayne Highwood on 2013-08-05 22:05:43
So you are exploring the possibility that the reason no difference is reported is not because we cannot hear the difference, but because we cannot remember it for a few seconds.


Second clause mainly.  I don't want you or anyone to read my posts as any sort of apologia to the audiophile crowd.  I'm more curious if the tools being used to support claims are properly calibrated, so to speak.  The serial listening criticism seems to have teeth, even if it's off in the academic weeds for all practical purposes.

If we have forgotten it before we have even drawn another breath, how do we subsequently know that we ever heard it?


Now you're getting a bit too far into the weeds.  I did not want an epistemic discussion of what we know, I simply wanted to explore what seems like a legit criticism of dbt from those in the know.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: saratoga on 2013-08-05 22:27:18
Now you're getting a bit too far into the weeds.  I did not want an epistemic discussion of what we know, I simply wanted to explore what seems like a legit criticism of dbt from those in the know.


Generally when ABXing, you can switch rapidly (even instantly) between samples, so if you're going to make this argument you would be diving into the weeds and arguing that an instant is too long.

I think mainly this is just an argument against setting up tests poorly such that subjects cannot easily switch between samples.  With proper setup, this should not be an issue.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: 2Bdecided on 2013-08-06 10:15:42
People who look for reasons to doubt DBTs generally believe they can hear an audible difference sighted but cannot hear an audible difference blind. They are trying to discover some difference in the testing methodology (other than sighted-vs-blind) to justify their belief that they really hear a real audible difference. They want this other difference in testing methodology to be the reason the believed audible difference vanishes in blind testing, rather than accepting that it vanishes simply due to the blinding.

In both kinds of tests, people need to rely on their memory. If there is any difference, it is that they need to rely on memory less so in some DBT designs (as saratoga says), not more. Hence reliance on memory is not this "other" testing methodology difference that they seek.

In short, I think you're barking up the wrong tree.

Cheers,
David.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Wayne Highwood on 2013-08-06 16:47:02
People who look for reasons to doubt DBTs generally believe they can hear an audible difference sighted but cannot hear an audible difference blind. They are trying to discover some difference in the testing methodology (other than sighted-vs-blind) to justify their belief that they really hear a real audible difference. They want this other difference in testing methodology to be the reason the believed audible difference vanishes in blind testing, rather than accepting that it vanishes simply due to the blinding.

In both kinds of tests, people need to rely on their memory. If there is any difference, it is that they need to rely on memory less so in some DBT designs (as saratoga says), not more. Hence reliance on memory is not this "other" testing methodology difference that they seek.

In short, I think you're barking up the wrong tree.

Cheers,
David.


I'm not one who claims to hear differences, nor do I really care much about those that do.  What I'm trying to discuss is not people suffering delusions, but the validity and reliability of a test based on human sensory impressions/conscious awareness/memory as the source of data. 

I guess I like my data to be more raw and empirical.  Brains are really amazing biological devices, but they make for piss poor lab equipment.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: mzil on 2013-08-06 17:15:14
How can something that serves to make distinguishing two samples more difficult also serve as a tell?

When the click noise has not been verified, pre-test, to be uniform in its sound quality (and/or timing) when transitioning to either A or B; then it can act as a tell.

Arnold B. Krueger mentions this problem can exist in hardware ABX testing and personally being able to reliably distinguish the sound of the QSC ABX switcher transitions, at least in the absence of music playing, here (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=697607). He also mentions it occasionally can be an issue with PCABX (at least when using certain computers) in that post.

A slightly different but equally problematic issue is when the click sound doesn't immediately identify the signal itself, however is does indicate to the listener if the transition was to the same signal vs. the alternate signal [For example, the artifact click sound switching from A to X, where X = A, vs. the sound of A to X where instead X = B].

Kees de Visser, who started that thread (http://www.hydrogenaudio.org/forums/index.php?showtopic=79566&hl=ABX+fast+switching+artifacts), mentions a difference in the click sounds (and gives some examples for us to listen to) in software ABX testing, in this post (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=698641). His belief at the time, if I understood correctly, was that it was random in nature [a good thing] however his response to a question here (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=698645) seemed to suggest otherwise to me.

The unrelated reason why we ideally don't want clicks at all, is because the presence of any other obtrusive sound, during the transition period, be it a blast from a bull horn, an annoying loud beep, or even just a loud click/thump can muck up our short term memory for things like loudness (or I would assume, also tonality). This I would assume is rather self evident. Sorry if I wasn't more clear about the "tell" part.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Martel on 2013-08-06 17:20:02
Except comparing two mono signals simultaneously, one fed to each ear via headphones. A niche pass-time, but possible.
This generally won't work because human body is rarely perfectly symmetrical (similar for headphones). Left and right ear may deteriorate (be damaged) at differing rates etc.
For example, when I activate the channel swap plugin in Foobar2000, it's not just side-swap for me. It's likely because I recently damaged my right ear a bit in a gym (we were dropping the weight carelessly while dead-lifting).
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: [JAZ] on 2013-08-06 17:45:20
I guess I like my data to be more raw and empirical.  Brains are really amazing biological devices, but they make for piss poor lab equipment.


We do not want raw data or data that simply demonstrates that the subject has received a stimulus from the outside world. There are things that we perceive, but that we aren't conscious about that perception (for example, when paying attention to something complex, we might not be conscious of small changes around us), and there are actions we do which we don't control them directly (like heart beating or breathing, although we can partially stop the second one).

As such, we rely on what the subject is able to tell that he has perceived. Since the question is made just after the perception has been received, the idea of forgeting it seems absurd (in a sane individual), and, at much, we can wonder how accurate the communication of what he has perceived is. That is not much different than calibrating a device. (except there are more variables).
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: 2Bdecided on 2013-08-06 18:16:06
What I'm trying to discuss is not people suffering delusions, but the validity and reliability of a test based on human sensory impressions/conscious awareness/memory as the source of data.
...but we're testing whether people using their sensory impressions and conscious awareness* can remember hearing a difference.

That's relevant to the choice of audio codec or hi-fi equipment. 

* - I included "conscious awareness" just to copy your sentence, but I don't think it's necessarily a restriction of DBTs. Many people who do ABX testing say it reveals audible differences that they were barely if at all "consciously" aware of. They thought they were guessing throughout, but they could guess correctly with statistical significance.

Quote
I guess I like my data to be more raw and empirical.  Brains are really amazing biological devices, but they make for piss poor lab equipment.
...but we listen to audio codecs or hi-fi equipment with our ears and brains, not lab equipment.

I think what you are wanting is irrelevant to the task at hand. The point of hi-fi, audio coding, Hydrogenaudio etc is to listen to audio (music, speech, whatever).

The question you are asking seems to be quite different. It might make an interesting scientific study, but I don't think you've made a convincing argument that it has any relevance to listening to music.

Cheers,
David.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Wayne Highwood on 2013-08-06 21:13:45
I just hope you guys in the industry don't end up with the wrong codec because you're using a rubber yardstick.  (I kid.)  I have it easy.  I've been moved to tears by a highly compressed Beethoven's 9th on a table radio.

Thanks for the interesting discussion, and for not banning me for random questions from the peanut gallery!
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Arnold B. Krueger on 2013-08-13 01:38:52
In light of the limits of serial audio testing raised by the OP, I'm pondering two possible conclusions:

"We've reached the limits of differences the subjects can hear"

vs.

"We have reached the limits of resolution for using human memory as the measuring instrument in an AB test".

Though the OP's point is generating some discussion, it seems to be sidestepping his basic criticism (which seems logical, particularly when considering current knowledge about perception).



Do you have some means for comparing two sounds that does not depend on human memory?  Please do tell!
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Wayne Highwood on 2013-08-15 00:31:05
* - I included "conscious awareness" just to copy your sentence, but I don't think it's necessarily a restriction of DBTs. Many people who do ABX testing say it reveals audible differences that they were barely if at all "consciously" aware of. They thought they were guessing throughout, but they could guess correctly with statistical significance.


That is an interesting observation.  Lots more sensory data is received and processed than what actually get's through to conscious awareness.  I wonder if the subjects could maintain their batting average over a larger sample size. 

Do you have some means for comparing two sounds that does not depend on human memory?  Please do tell!


Can't really think of one.  But for that matter I am not trying to develop audibly transparent compression codecs. 

What I wonder is this: can it be done simply by the numbers?  We know thresholds of audibility, why not design to that, instead of the less precise ability of humans to differentiate?  Is it because of actual loss of data in the compression schemes, a matter of finding that happy place between the fat milk and the skim that folks will still find palatable?

Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-08-15 00:45:03
We know thresholds of audibility, why not design to that

What makes you think this isn't done?

Something is used to determine where to reduce precision in order to make a signal easier to compress, right?
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-08-15 01:01:23
Do you have some means for comparing two sounds that does not depend on human memory?  Please do tell!


Can't really think of one.  But for that matter I am not trying to develop audibly transparent compression codecs. 


What ever does what Arnold said have to do with what you said?
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Wayne Highwood on 2013-08-15 14:39:03
What ever does what Arnold said have to do with what you said?


We're discussing serial audio testing of human subjects, which necessarily relies on memory. 

Arny asked if I had means of comparing sound without using memory, I replied in the negative (assuming human subjects again, not the use of some sort of measurement equipment).  I also pointed out that I am not developing anything that requires discrimination testing of this sort.







   

 

Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-08-15 15:08:09
The point is that ABX is useful for more than the development of lossy codecs.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Wayne Highwood on 2013-08-15 16:21:22
The point is that ABX is useful for more than the development of lossy codecs.


I get that. 

The rather confrontational, terse one-line interrogatives and comments gives the distinct impression you guys are preparing to tee off on me, which is great if that sort of behavior meets your psychosocial needs, but it would be like a college professor demeaning a freshman.  I just visited to learn, not be treated with scorn.  Have a nice day, fellas.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: saratoga on 2013-08-15 19:26:44
The point is that ABX is useful for more than the development of lossy codecs.


I get that. 


Then your reply above doesn't really make sense, or perhaps you have misunderstood what you've quoted above.

The rather confrontational, terse one-line interrogatives and comments gives the distinct impression you guys are preparing to tee off on me,


Hello and welcome to the internet where people are not always going to agree with you.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: greynol on 2013-08-17 01:10:13
I live on the West Coast so there's still time, and I definitely will.  Thanks.

My apologies for not noticing that you stopped carrying the torch for this discussion as of 8/6.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-08-17 04:57:00
What ever does what Arnold said have to do with what you said?


We're discussing serial audio testing of human subjects, which necessarily relies on memory. 

Arny asked if I had means of comparing sound without using memory, I replied in the negative (assuming human subjects again, not the use of some sort of measurement equipment).  I also pointed out that I am not developing anything that requires discrimination testing of this sort.


If you want to do an audio test, how else would you do this? By the way, sequential tests with proper windowing are documented as the best way to extract the most reliable answers from subjects.

If you're not interested in audio testing, why are we having this discussion? Seriously. I am confused.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Kees de Visser on 2013-08-22 08:59:51
Kees de Visser, who started that thread (http://www.hydrogenaudio.org/forums/index.php?showtopic=79566&hl=ABX+fast+switching+artifacts), mentions a difference in the click sounds (and gives some examples for us to listen to) in software ABX testing, in this post (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=698641). His belief at the time, if I understood correctly, was that it was random in nature [a good thing] however his response to a question here (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=698645) seemed to suggest otherwise to me.
Sorry for being late. The artifacts I found are indeed random. They also appear when switching between identical audio, which is strange since this is a trivial task and should be 100% lossless, e.g. with a simple linear fade. The artifacts I was/am worried about will only appear when switching between different audio streams, which is not a lossless process, not a trivial task and might be audible, depending on many variables.
Hope this helps.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Arnold B. Krueger on 2013-08-22 13:14:53
What I wonder is this: can it be done simply by the numbers?  We know thresholds of audibility, why not design to that, instead of the less precise ability of humans to differentiate?  Is it because of actual loss of data in the compression schemes, a matter of finding that happy place between the fat milk and the skim that folks will still find palatable?


In the early 1980s I sat through an AES presentation about the development of lossy encoders based on the thresholds of hearing. It was tough going for the developers. Couldn't get worthwhile amounts of data compression. Later on masking became better understood, and development of lossy encoders shifted into high gear.

The point being that the thresholds of audibility overestimated the working sensitivity of the human ear.

While the golden ears rant and rave about how wrong Fletcher and Munson were, they were actually overly optimistic.


Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2013-08-24 13:11:59
Kees de Visser, who started that thread (http://www.hydrogenaudio.org/forums/index.php?showtopic=79566&hl=ABX+fast+switching+artifacts), mentions a difference in the click sounds (and gives some examples for us to listen to) in software ABX testing, in this post (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=698641). His belief at the time, if I understood correctly, was that it was random in nature [a good thing] however his response to a question here (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=698645) seemed to suggest otherwise to me.
Sorry for being late. The artifacts I found are indeed random. They also appear when switching between identical audio, which is strange since this is a trivial task and should be 100% lossless, e.g. with a simple linear fade. The artifacts I was/am worried about will only appear when switching between different audio streams, which is not a lossless process, not a trivial task and might be audible, depending on many variables.
Hope this helps.


Two things come to mind:

1) bad time alignment, although with 2 identical files that would seem unlikely
2) bad crossfade window design

If the click when switching between A/X is more noticible than B/X, you can be sure subjects will latch on to that.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Arnold B. Krueger on 2013-08-25 01:21:27
Kees de Visser, who started that thread (http://www.hydrogenaudio.org/forums/index.php?showtopic=79566&hl=ABX+fast+switching+artifacts), mentions a difference in the click sounds (and gives some examples for us to listen to) in software ABX testing, in this post (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=698641). His belief at the time, if I understood correctly, was that it was random in nature [a good thing] however his response to a question here (http://www.hydrogenaudio.org/forums/index.php?s=&showtopic=79566&view=findpost&p=698645) seemed to suggest otherwise to me.
Sorry for being late. The artifacts I found are indeed random. They also appear when switching between identical audio, which is strange since this is a trivial task and should be 100% lossless, e.g. with a simple linear fade. The artifacts I was/am worried about will only appear when switching between different audio streams, which is not a lossless process, not a trivial task and might be audible, depending on many variables.
Hope this helps.


Two things come to mind:

1) bad time alignment, although with 2 identical files that would seem unlikely
2) bad crossfade window design

If the click when switching between A/X is more noticeable than B/X, you can be sure subjects will latch on to that.


That's for sure!  One good test is to run an ABX test with no music playing.  If there are switching artifacts or background noises that relate to one alternative but not the other. a careful listener can detect them and do pretty well!

While none of the switchboxes that were sold by the ABX company had this problem, I did encounter a sample of a competitive product that I could score 16/16 with, with nothing attached to it at all. 

This sort of problem can be inherent in the switchbox, or it can be due to an error in the setup of the test.

It is also problem to have noises that are truly random.  They are a less severe problem but if noticeable enough they can distract the listener and reduce the probability of reliable detection when it is possible.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Arnold B. Krueger on 2014-11-28 16:50:40
As a scientist (biophysics) and an audiophile, I have my feet in both worlds, and have thus enjoyed this (http://www.hydrogenaudio.org/forums/index.php?showtopic=57406) discussion.


Here are some more details about the earliest paper I can find that describes sequential ABX testing:

http://scitation.aip.org/content/asa/journ....1121/1.1917190 (http://scitation.aip.org/content/asa/journal/jasa/22/5/10.1121/1.1917190)

"
An understanding of the over?all process of hearing depends upon proper interpretation of the results of many individual experiments. In the field of subjective experimentation the problem has been complicated by the wide variety of test procedures that characterize available data. If a common technique could be applied to the many different types of auditory tests, such as thresholds of acuity, masking tests, difference limens, etc., the organization of these data would be facilitated. The purpose of the present paper is to describe a test procedure which has shown promise in this direction and to give descriptions of equipment which have been found helpful in minimizing the variability of the test results. The procedure, which we have called the “ABX” test, is a modification of the method of paired comparisons. An observer is presented with a time sequence of three signals for each judgment he is asked to make. During the first time interval he hears signal A, during the second, signal B, and finally signal X. His task is to indicate whether the sound heard during the X interval was more like that during the A interval or more like that during the B interval. For a threshold test, the A interval is quiet, the B interval is signal, and the X interval is either quiet or signal. For a masking test, A is the masking signal, B is the masking signal plus the signal being masked, and X is either A or B repeated. The apparatus for the ABX test is mechanized so all details of the method can be duplicated for each observer, and the variability of manual operation eliminated. The entire test is coded on teletype tape to reduce the time and effort of collecting large quantities of data.
"

Just to reiterate something that many of us are way too aware of and is mentioned above, which is that the above methodology is not the same as the method described in Clark's 1972 JAES article which was designed to overcome many of the limitations of the method described in the 1950 JASA paper.

Conflating these two very different methodologies is a not infrequent  mistake that was recently repeated in a highly touted  recent AES conference paper 9174  "The audibility of typical digital audio lters in a high-fi delity playback system"
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: Woodinville on 2014-12-09 21:26:44
I don't know what you mean by "sequential" ABX testing. Any ABX test that I am aware of allows the subject to listen to any of the three samples, in any order, as many times as he/she pleases until ready to make a choice.

Also, you cannot "eliminate" false positives just as you cannot eliminate false negatives. You can only make the null hypothesis statistically improbable.


The point is that it's sequential regardless of the order, and thus relies on memory -- you can't listen to all three tracks simultaneously.    If you reread that section of my post carefully I think you'll see what I'm getting at.



Well, your original post here, having some very unoriginal opposition to ABX testing in it, and using a completely inappropriate comparison between vision and hearing (they are not the same, one can be static, one can not, for instance, making the "memory issue" for a properly subject-controlled switching system (with proper switching) completely moot), appears to have been guided by someone or something to play "god of the gaps" reasoning.

You can NEVER listen to 3 tracks simultaneously, not during the test, or ever, any time, any place. It's not how human perception works. So what you're really objecting to is how evolution designed our ability to sense atmospheric vibrations.  I'm not sure what the point of that is.
Title: Biophysics, Limitations of Shannon and Issues with ABX Testing
Post by: ajinfla on 2014-12-09 22:49:27
I fear the battle is lost on the issue of honesty controls themselves (I guess due to 30+ yrs of derisive amusement), so the new war is to attack the most "common" method, ABX.
http://www.stereophile.com/content/listening-143 (http://www.stereophile.com/content/listening-143)

Purportedly it presents a "higher cognitive load" than ABC/HR et al, or good ol' LLT...Long Term "Listening". Evidence seems scant, but on we go.

cheers,

AJ