|
|
Tech Topic: What You See Is Not What You Get
Why measured response doesn’t always match what’s
heard
By
Pat Brown

|

The measured versus ideal response for the direct field
of a loudspeaker.
|
 |
Many audio field technicians are now in possession of measurement
systems that can be used to assist the listening process in equalizing
sound reinforcement systems. But, they’re often surprised to find
that the measured system response correlates poorly with subjective
impression of how the system sounds.
In other words, the system can sound good when it looks bad on the
analyzer, and it can sound bad when it looks good on the analyzer.
As a result, some users have become frustrated and distrustful of
analysis systems in general.
|
lLet's look at why the eye and ear do not always agree on what is best
regarding the response of the sound system.
First, consider the most popular methods of measuring the response of
the sound system. By “response,” I am referring to the magnitude of the
frequency response as displayed on a dB (vertical) vs. logarithmic (horizontal)
scale. The goal of technical system equalization is to produce a “flat”
horizontal line on this display.
|

The effect of increasing distance outdoors (top) versus indoors.
|
 |
WORKING IN REAL TIME
The real-time analyzer (RTA) is essentially a bank of meters, each
driven with a 1/n-octave constant percentage bandwidth filter so
that only the level of a limited range of frequencies is displayed
by each meter.
The original RTAs used analog meters, but current versions use
a vertical row of LEDs for each 1/n-octave band. One-third octave
resolution is the most popular, and correlates well with the response
of the human auditory system.
The RTA input is fed from an omnidirectional test microphone located
at a listener position. Omni’s are used because they typically have
a very flat, “benign” frequency response over most of their band
pass.
RTAs can also be software-based, utilizing the sound card on a
personal computer to provide the A/D conversion of the microphone
output voltage.
|
A mathematical algorithm (the FFT) is used to produce the previously
described dB vs. frequency display. These “digital” analyzers emulate
their analog counterparts in how the information is displayed, but differ
in that the filters and display is the product of a computer algorithm
rather than analog filters. This type of RTA is more versatile, as the
octave-fractions, colors, etc. are under software control.
|

A target curve can be used with the RTA to compensate for the low-frequency
build-up that occurs in many rooms.
|
 |
Regardless of which type is used, the standard method-of-use is
to drive the sound system with pink noise (equal energy per 1/n
octave) and adjust the system equalizer for a “flat” magnitude response
on the analyzer display.
RTAs are powerful tools when certain guidelines are followed, but
indoors they can indicate a system response with poor correlation
to what the listener is hearing. The major consideration is the
placement of the measurement microphone.
|
If the mic is placed in the near field of the loudspeaker (typically
less than 10 feet), the correlation with human hearing is pretty good.
At this position, the direct energy from the loudspeaker dominates what
is being observed on the analyzer and very little of the reflected energy
from the room is included in the displayed response. Adjustment of the
equalizer for a flat direct sound field on the analyzer produces a desirable
result.
The down side to the near-field placement is that the measured response
is very sensitive to small vertical movements of the microphone when the
loudspeaker has offset vertical components (as most do). This sensitivity
can be reduced if the microphone is moved to a greater distance from the
loudspeaker (into the far field) since the path-length difference back
to the individual components becomes more equal. But, as the microphone
is moved further away, the reflected energy from the room begins to dominate
the displayed response.
GIVING EQUAL WEIGHT
Microphones have no “perceptual” abilities. They do not localize sound
or discriminate early sound energy from late energy like humans do. A
listener at a distance remote from the loudspeaker will pay more attention
to the direct field of the loudspeaker than sound that is building-up
in the room. A microphone gives equal weight to all energy without regard
to where it is coming from.
A simple experiment to verify this is to stand at the microphone position
and listen to the loudspeaker and then route the mic through a headphone
amplifier and listen to it through headphones - not the same thing at
all.
|

A full-bandwidth transfer function measurement (with SIA SMAART
Live) using variable time windows. This measurement was made indoors
at about 50 feet from the loudspeaker.
|
 |
Low frequency sounds tend to linger in rooms longer than high frequency
sounds, because most rooms have more high frequency absorption than
low frequency absorption.
As such, the room becomes “bass heavy” when the total sound field
is considered. This extra low frequency information will dominate
what is observed on the RTA, and the knee-jerk reaction is to attempt
to “flatten” the response by boosting the high frequency bands on
the equalizer. The result is a system with excessive high frequency
output and a resultant “harsh” sound quality.
When RTAs are used in this manner, it is important to equalize
to a “target curve” rather than for a flat frequency response.
|
The popular “X” curve for theaters is flat to 2 kHz, where it starts rolling
off the high frequency response at about 3 dB per octave. It is -10 dB
at 10 kHz relative to 2 kHz. This represents 1/10th power at 10 kHz relative
to flat response. The one-third-octave analyzer and the target curve have
served sound practitioners well for years, and remains a viable approach
to system calibration.
RECENT METHODS
Technology has yielded some new methods for acquiring the system response
at a listener position. A complex comparison (both time and frequency
information) of the input and output of a system is called the transfer
function. It includes both the magnitude and phase response of the loudspeaker/room
at the microphone position. This has become a popular method of analysis,
as it allows any input stimulus to be used to test the system, since the
displayed response is just the difference between “what you put in” and
“what you got out.”
|

Placing the test microphone on a stand makes it impossible to observe
the loudspeakers response without interference.
|
 |
Transfer function analysis has the added advantage of the ability
to use a “time window” to exclude late arriving energy from consideration
in the response. This can prevent the low-frequency build-up problem
that plagues traditional real-time analysis. With proper implementation
of a time window, the system response can be adjusted without the
need for frequency weighting via a target curve.
A major difference between transfer function analysis and 1/n-octave
real-time analysis is that the former requires the removal of the
signal delay between the two signals being compared.
The stimulus (the reference signal) always has a much shorter path
back to analyzer input than the output of the measurement microphone.
Sources of delay include the travel time through the air and the
latency of digital processors.
|
Failure to properly synchronize the reference signal and the microphone’s
signal will result in an erroneous display of the system’s response. The
length of the time window must also be selected - in other words, “how
much of the room decay do I want to include in the response?”
|

The test microphone was placed on a stand for this measurement.
Note the comb filtering due to the floor bounce effect.
|
 |
Unfortunately, there is not an optimum size for the entire spectrum.
A short time window excludes much of the room decay at the expense
of low-frequency resolution. A long time window improves frequency
resolution at the expense of gathering too much of the room’s decay.
A compromise is required.
The human auditory system perceives pitch on a proportional (logarithmic)
frequency scale. This is one reason that we use constant-percentage
bandwidth filters for tuning audio systems - the bandwidth grows
with increasing frequency.
|
Frequency-dependent bandwidth suggests that the length of the windowing
function used in transfer function analysis should be varied in the same
manner - a decreasing length with increasing frequency. This produces
a somewhat “anechoic” response at high frequencies with increasing frequency
resolution as frequency decreases. The time window length is a function
of frequency, with even the longest window (highest frequency resolution)
excluding much of the late energy from the room.
Another caveat of this type of analysis is that much greater frequency
detail is possible than with the typical 1/3-octave banded display. Phase
interference effects from reflections or multiple drivers are clearly
visible on the analyzer. Such anomalies are almost always position-dependent,
so careful “corrections” at one seating position will be inappropriate
for another. Both the loudspeaker and the measurement microphone should
be carefully positioned to avoid the creation of very early high-level
reflections.
SPECIAL EFFECTS?
The “floor bounce” effect is a common example of a very early reflection
(typically within a few milliseconds of the first sound arrival) that
produces a unique acoustic response for each listener seat for all but
the lowest octaves of the spectrum. This is an example of “less is better”
when measuring the response, as a 1/3-octave display lacks the resolution
to observe the effect in detail and produces less of a temptation to “fix”
it.
|

The same measurement displayed at one-third octave resolution. The
floor bounce effect is significantly masked by the lower resolution.
|
 |
The floor bounce effect can be minimized by use of an appropriate
frequency-dependent time window or by simply laying the measurement
microphone on the floor, or on a board placed across the listener
seats. The effect usually disappears with the presence of an audience,
so we do not wish to consider it when tuning the sound system.
The use of variable-length time windows and the synchronous transfer
function allow the system to be tuned in a manner similar to the
near-field RTA method (flat response on a log frequency display),
even at remote positions in the room. It is superior to the RTA
method in that the effects of air absorption are readily apparent
and can be compensated for via equalization.
|
Near-field techniques do not include air absorption for the simple fact
that the sound has not traveled very far before it is picked up by the
microphone, so it hasn’t passed though enough air to be significantly
attenuated.
By far, the biggest problem with tuning sound systems is failure on the
part of the technician to recognize anomalies that cannot be corrected
with equalization. The equalizer is a “global” device, meaning that its
response curve will be impressed on all of the sound radiated from the
loudspeaker, regardless of the direction in which it is radiated.
Many, if not most, of the anomalies observed on the analyzer are unique
to each listener position. The technician must learn to recognize and
ignore such events. They include:
• Floor-bounce effect;
• Interference between multiple drivers;
• Reflections from objects near the mic or loudspeaker.
Events that produce a more global effect, and can therefore be addressed
with equalization include:
• Boundary-loading of loudspeakers;
• Coupling between multiple low-frequency drivers;
• The direct-field loudspeaker response.
With training and experience, the system technician can implement methods
that reveal system imperfections that are correctable, and hide those
that are not - regardless of the analysis method used. Better yet, system
designers can design systems with fewer “un-equalizable” problems.
BAD IS ALWAYS BAD
The old adage “an ounce of prevention...” could never be MORE true. System
equalization then becomes meaningful and fast, providing the “icing on
the cake” of the performance of a sound system. It makes a good loudspeaker
sound better, and brings the system to its fullest potential given the
acoustic environment into which it is placed. A bad room is a bad room,
regardless of how we process the electrical signal that drives the loudspeakers.
|

The microphone was placed on the floor for this measurement. Anomalies
inherent to the loudspeaker are now visible on the analyzer.
|
 |
When used properly, the traditional 1/n-octave real-time analyzer
is a useful tool outdoors at any distance. Indoors, the effects
of reflected sound and non-frequency-uniform room absorption produce
some problems for this method at measurement distances remote from
the loudspeaker.
One solution is to utilize a weighting curve that reduces the target
level of the high-frequency portion of the spectrum. Attempts to
achieve a flat system response at remote listener positions without
the use of a weighting curve can result in harsh-sounding systems
and even component damage.
|
Transfer function analysis addresses some of the shortcomings of the 1/n-octave
RTA, but it requires greater expertise on the part of the user. Failure
to properly compensate for the time differential between the reference
and measured signal can produce wildly erroneous results. The time window
length must also be selected by the user, and different lengths will produce
different displayed responses. A frequency-dependent time window produces
a display that correlates well with human perception.
The most important feature of either measurement method is a knowledgeable
operator - one who understands the caveats of each approach along with
the basic characteristics of the human auditory system. None of the questions
raised here have a single, correct answer. This means that experience,
good judgment, and common sense rooted in Newtonian physics are still
a part of the measurement process.
Sound is a relatively easy quantity to measure, but measurements that
correlate with human perception are much more difficult. Analyzers driven
by omni directional microphones do a poor job of emulating the human listener.
At this point one could ask, “So why measure at all? Why not just listen?”
Next issue, we’ll have a look at this provocative question.
Pat and Brenda Brown own and operate Syn-Aud-Con, conducting audio
training sessions around the world. For more info, go to www.synaudcon.com
October 2003 Live Sound International
|