Moulton Laboratories
the art and science of sound
24 Bits: Can You Hear ‘Em? 96 kHz.: Can You Hear It?
Dave Moulton
October 1999

How Do You Know? What Does It All Mean?

24 Bits: Can You Hear ‘Em? 96 kHz.: Can You Hear It?

I do a fair amount of subjective testing in the audio field, in addition to my audio production and studio design work. Over the past six months, I’ve gotten caught up in a food fight on the Internet about the audibility of various audio processes.

That food fight really got me to thinking about the trouble we all have communicating about these issues. There are two central parts to the problem. First, we aren’t very objective about describing our subjective impressions and, second, we mostly fail to consider what is generally meant by the terms that we use. Such foibles are perfectly normal for us audio engineer types, and because we aren’t scientists by profession we can’t be held responsible for not making the kind of careful objective claims that real science requires.

Nevertheless, it is important for us to be able to understand at least a little bit about what is really going on in regard to the concept of audibility, just to get along in our professional audio lives. At the same time, we hear a lot of talk about the need for expanded resolution in digital audio these days, which raises some important questions. Can we really hear the difference? And, how important is it?

The answers we get to these questions when we carefully read the trade journals and the reported findings about audibility are a little confusing and contradictory. People have described the difference between 16-bit and 20-bit audio as the difference between mediocre and awesome. Others have ascribed a remarkable transparency to 20-bit signals. Similarly, individuals have reported increased clarity, definition, detailing and other virtues to audio signals with sample rates significantly higher than 48 kHz. Meanwhile, others note that such observations do not get made when we use double-blind tests, nor are they supported by such blind tests, presumed to be more rigorous and “objective” than more informal studies. In fact, double blind tests often seem to show that listeners can’t reliably distinguish between 16-bit audio and 20-bit audio.

How can this be? How can people clearly hear an effect, to a point where they choose to describe it with a superlative such as “amazing,” when under controlled blind conditions other people (and sometimes they themselves) can’t distinguish it from the same signal without said “amazing” effect? One cynical answer is that they are simply making it up. This is called the “Emperor’s New Clothes” syndrome. Humans are suggestible, and it is pretty easy for us to get stampeded into a group-think state of mind where we will clearly hear anything we think will be socially acceptable, whether it’s real or not. All of us working in audio have had the experience of equalizing a channel to a point where everybody in the control room agrees we’ve made some really significant, perhaps awesome, improvements, only to discover that we were equalizing an adjacent channel that wasn’t even switched on!

But I’m not satisfied with “The Emperor’s New Stereo” answer. I’m sure that suggestibility is often an issue, but we know very well that we are susceptible to making these mistakes and we’re generally pretty careful about avoiding them. Further, too many people I know whose hearing acuity I really respect have reported hearing things like a “BIIIIG” difference between 16-bit and 20-bit audio for me to say, “Nah, that’s just group-think. They’re making it up.”

One of my goals, as a researcher, is to find out how to resolve these differences, how to reconcile apparently contradictory and mutually exclusive reports about the audible effect of some audio process. I’d like to be able to explain how it can be that these differences exist, rather than assuming and trying to prove that one of the viewpoints is simply “wrong.”

There are several things to think about.
  • First, we need to consider the actual nature of blind tests.
  • Second we need to think about what we really mean when we describe an improvement as “amazing.”
  • Third, we need to know how the term “audible” is really defined.
  • Fourth, we need to know something about the nature of human perception.
Over the next several months, we’ll look into these questions in detail. There’s actually quite a bit to be said about each of the four questions I’ve raised. But first, we need to consider why this stuff is so important.

We are involved in a business of creating illusions for our customers. We spend a great deal of time and money on these illusions to make them effective. Our viewers/listeners similarly spend a great deal of money to experience illusions they find compelling, convincing and satisfying. At the same time, we’ve bought into the notion that the illusions become better as they become “more accurate,” and so we keep trying to improve the resolution of our illusions.

Meanwhile, it’s a competitive world out there. He/she who creates the most powerful illusion for the least bucks wins. We need to not waste our time and resources on effects that don't matter. We need to know what matters, how much it matters and how much it costs. Only then can we begin to really get into enhancing the illusion we are creating in a really meaningful way.

With that in mind, we’ll dive into the weird and wacky world of blind testing next month. Thanks for listening.
Note: The following group of columns that I wrote for TV Technology are an attempt on my part to describe some of the issues surrounding our attempts to measure and evaluate the audibility of high-resolution formats. Together, I think they make an excellent short survey of these issues. I hope you find them useful.
Members
Login | Register
Mailing List

COMMENTS

     May 01, 2006 05:46 PM
This site is like finding my perfect Les Paul. Incredible wealth of knowledge. Thanks for being open to midnight mixers like myself to explain why.
Random Jam 
UK     Mar 26, 2007 04:34 PM
Further, too many people I know whose hearing acuity I really respect have reported hearing things like a "BIIIIG" difference between 16-bit and 20-bit audio for me to say, "Nah, that's just group-think. They're making it up".

How do you respect someones hearing acuity? You can't use their hearing ability, you can only use your own. So the required respect couldn't come from experiencing that persons engineering skills, because you'd be using YOUR ears, not theirs. And whether or not their hearing is good in your opinion (thus leading to respect) comes from hearing their work through your own hearing acuity.

It isn't possible to respect someones hearing acuity, because you have no way of experiencing that. I tend to feel its more a case that you respect them as people, which should not have any place in determining these kind of issues.

In other words: You fell for a different kind of emperors clothes syndrome.
A Person 
Grosse Pointe Woods, MI     Mar 28, 2007 11:17 AM
I've been trying to come up with convincing demonstrations related to high sample rates and long data words. You can find my efforts here: http://www.pcabx.com/sample_rates/index.htm
Arny Krueger 
     Apr 02, 2007 11:22 AM
I respect their hearing acuity because I've witnessed them working with their ears. I think that is valid. Nonetheless, you raise a good point - we all hear with our own ears and we have to learn to trust them, and calibrate them as best we can.

In general, what you suggest is probably true. Others' claims only reveal to us that somebody else "claimed" to hear something, and "may" have heard it. Beyond that, we don't really know anything.

Best regards,

Dave
Dave Moulton 
     Feb 08, 2009 06:53 PM
Thanks, David!
Feels like the preasure levels become more transparent:)
Best wishes.
Dainius
Daains 
edinburgh     Dec 18, 2009 08:39 PM
Dave like the site good work; a little concerned about how to perceive the application of sample rates, why are you discussing sample rate frequency by comparing it to audio frequency in terms of pitch and human acoustic perception? That is not the actual use of the additional bandwidth surely? I believed it to be (for one purpose) a useful solution that provides (for example) DAWs with room to complete the complex algebraic summing that is required of them when mixing a great many channels of audio data with additional audio effects, plugins and other source data that may add to the complexity of what is essentially an incredibly complicated waveform. If you were to try and plot the same graph using cm squared paper it would render a much less detailed graph, is that not the point of the additional sample-rate resolution?

Kind Regards

Rob
Robert Henderson 
Groton, MA     Dec 20, 2009 11:51 AM
I think you've got a good point, Rob, and a useful way of looking at the issue. Particularly, the extra bits help with arithmetic resolution and the elimination of errors. Similarly, extra bandwidth gives us many fewer errors at the boundaries of bandwidth that ARE audible. Along with that, as you note, the extra resolution makes it possible for DSP to function more effectively, and, yes, that seems to be audible. Keep in mind, though, it is at the expense of the total amount of DSP that can be applied. At the time of writing, that was still a fairly significant issue, particularly working in multitrack surround work.
Dave Moulton 

Post a Comment



rss2

rss atom