class auditory perception
TRANSCRIPT
-
7/28/2019 Class Auditory Perception
1/42
Department of Electrical & Computer Engineering
Auditory Perception
Meena Ramani
04/09/2004
-
7/28/2019 Class Auditory Perception
2/42
Department of Electrical & Computer Engineering
Note
For this lecture many of the slides will be
accompanied by scanned pictures shown onthe OHP from Zwicker and Fastls
Psycho-acoustics facts and models 2nd edition
-
7/28/2019 Class Auditory Perception
3/42
Department of Electrical & Computer Engineering
Main Outline
Anatomy of the Ear and Hearing DONE
Auditory perception
Hearing aids and Cochlear implants.
Extra: Direction of Arrival Estimation
-
7/28/2019 Class Auditory Perception
4/42
Department of Electrical & Computer Engineering
Auditory perception
Shepard Tones
Masking
Ohms Acoustic Law Critical Bands
Webers law
Just Noticeable Frequency
-
7/28/2019 Class Auditory Perception
5/42
Department of Electrical & Computer Engineering
Roger Penrose M.C. Escher
Ascending andDescending
OpticalIllusion
AudioIllusion
-
7/28/2019 Class Auditory Perception
6/42
Department of Electrical & Computer Engineering
Shepard Tones Circularity in Judgments of Relative Pitch, Roger N. Shepard,
JASA 1964.
Sensitivity to descending pitch
Sensitivity to volume changes between thesepitches.
A set of eight tones all an octave apart
The tones simultaneously descend in pitch till half oftheir original pitch.
Jump back up to their original pitch and repeat the cycle.
Perceive this change?
Unique volume curve
Effect: Seamless transition in the cycle.
Its all in your head!
Omit two of the eight tones in the mid frequency range.
http://images.google.com/imgres?imgurl=http://www.clallam.net/EnvHealth/assets/images/tap_water.jpg&imgrefurl=http://www.clallam.net/EnvHealth/html/dw_availability.htm&h=192&w=169&sz=6&tbnid=Cs8vMAYzLjwJ:&tbnh=97&tbnw=86&start=20&prev=/images%3Fq%3Dtap%2Bwater%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en%26sa%3DNhttp://images.google.com/imgres?imgurl=http://www.cbfisap.sfa.ed.gov/tutorial/lesson5/images/phone.gif&imgrefurl=http://www.cbfisap.sfa.ed.gov/tutorial/lesson5/l05_008.htm&h=250&w=198&sz=13&tbnid=VOO6YPrv3IsJ:&tbnh=104&tbnw=83&start=66&prev=/images%3Fq%3Dcell%2Bphone%2Bman%26start%3D60%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en%26sa%3DN -
7/28/2019 Class Auditory Perception
7/42
Department of Electrical & Computer Engineering
You know I can't hear you when the water isrunning!
MASKING
http://images.google.com/imgres?imgurl=http://www.clallam.net/EnvHealth/assets/images/tap_water.jpg&imgrefurl=http://www.clallam.net/EnvHealth/html/dw_availability.htm&h=192&w=169&sz=6&tbnid=Cs8vMAYzLjwJ:&tbnh=97&tbnw=86&start=20&prev=/images%3Fq%3Dtap%2Bwater%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en%26sa%3DNhttp://images.google.com/imgres?imgurl=http://www.cbfisap.sfa.ed.gov/tutorial/lesson5/images/phone.gif&imgrefurl=http://www.cbfisap.sfa.ed.gov/tutorial/lesson5/l05_008.htm&h=250&w=198&sz=13&tbnid=VOO6YPrv3IsJ:&tbnh=104&tbnw=83&start=66&prev=/images%3Fq%3Dcell%2Bphone%2Bman%26start%3D60%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en%26sa%3DNhttp://images.google.com/imgres?imgurl=http://homepage.ntlworld.com/myronf/images/mask.jpg&imgrefurl=http://homepage.ntlworld.com/myronf/mask.html&h=599&w=899&sz=44&tbnid=D-YQopVtFMkJ:&tbnh=96&tbnw=144&start=42&prev=/images%3Fq%3Dmask%2B%26start%3D40%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en%26sa%3DN -
7/28/2019 Class Auditory Perception
8/42
Department of Electrical & Computer Engineering
Masking
Low-frequency, broad banded sounds (like water running) will maskhigher frequency sounds which are softer at the listener's ear (a
conversational tone from across the room).
Example 2: Truck in street
Masking occurs because two frequencies lie within a critical band and
the higher amplitude one masks the lower amplitude signal. Masking can be because ofbroad band, narrowband noise, pure and
complex tones.
Masking threshold
Amount of dB for test tone to be just audible in presence of noise
See OHP Figure
http://images.google.com/imgres?imgurl=http://homepage.ntlworld.com/myronf/images/mask.jpg&imgrefurl=http://homepage.ntlworld.com/myronf/mask.html&h=599&w=899&sz=44&tbnid=D-YQopVtFMkJ:&tbnh=96&tbnw=144&start=42&prev=/images%3Fq%3Dmask%2B%26start%3D40%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en%26sa%3DN -
7/28/2019 Class Auditory Perception
9/42
Department of Electrical & Computer Engineering
Masking by Broad band noise
White noise- frequency independent PSD
Masked thresholds are a function of frequency.
Low and very high frequency almost same as TOQ.
Above 500Hz, thresholds increase with increase in frequency
Increasing white noise by 10dB increases masked threshold up by10dB for frequencies >500Hz.
=>Linear behavior of masking
NOTE: TOQs frequency dependence almost completelydisappearsEars frequency selectivity and critical bands.
See OHP Figure
-
7/28/2019 Class Auditory Perception
10/42
Department of Electrical & Computer Engineering
Masking by Narrow band noise
Narrow band
-
7/28/2019 Class Auditory Perception
11/42
Department of Electrical & Computer Engineering
Masking by Narrow band noise (cont)
Noise (Varying Amplitude, Fixed Frequency)
1KHz noise
20-100dB
Slope of rise seems independent of Amplitude
But slope of fall is dependent on amplitude Non-Linear frequency dependence
Strange effect at high masker amplitudes:
At high amplitudes ear begins to listen to anything audible!!
Begin to hear difference noise (noise and testing tone)
See OHP Figure
-
7/28/2019 Class Auditory Perception
12/42
Department of Electrical & Computer Engineering
Masking by Pure and Complex tones Pure tones:
Below threshold of Quiet of test tone can hear only masking tone
Above it
-
7/28/2019 Class Auditory Perception
13/42
Department of Electrical & Computer Engineering
Temporal Aspects of Masking
Previously assume long lasting test and masking sounds
Speech has a strong temporal structure
Vowels --loudest parts
Consonants faint
Often plosive consonants are masked by preceding loud vowel
-
7/28/2019 Class Auditory Perception
14/42
Department of Electrical & Computer Engineering
Temporal Aspects of Masking (cont)
Simultaneous Masking
Pre-Stimulus/Backward/Premasking
1st
test tone 2nd
Masker Poststimulus/Forward/Postmasking
1st Masker 2nd test tone
-
7/28/2019 Class Auditory Perception
15/42
Department of Electrical & Computer Engineering
Types of MaskingSimultaneous masking
Duration less than 200ms test tone threshold increases with decrease induration.
Duration >200ms constant test tone threshold
Assume hearing system integrates over a period of 200ms
Postmasking (100ms)
Decay in effect of masker 100ms More dominant
Premasking (20ms) Takes place before masker is on!!
Each sensation is not instantaneous , requires build-up time
Quick build up for loud maskers Slower build up for softer maskers
Less dominant effect
See OHP Figure
-
7/28/2019 Class Auditory Perception
16/42
Department of Electrical & Computer Engineering
Ohms Acoustic Law
The sound quality of a complex tone depends ONLY on theamplitudes and NOT relative phases of its harmonics.
-
7/28/2019 Class Auditory Perception
17/42
Department of Electrical & Computer Engineering
Critical Bands
Proposed by Fletcher Noise which masks a test tone is the part of its spectrum which lies
near the tone
Masking is achieved when the power of the tone and the power
of the noise spectrum lying near the tone and masking it are
the same.
Bands defined this way have a BW which produces same acoustic
power in the tone and in the noise in the band when the tone is
masked.CRITICAL BANDS
See OHP Figure
-
7/28/2019 Class Auditory Perception
18/42
Department of Electrical & Computer Engineering
Critical Band (cont.)
How to measure?
Masking of a band pass noise using 2 tones
CB corresponds with1.5mm spacing on BM.
24 such band pass filters
BW of the filters increases with increasing center frequency Logarithmic relationshipWebers law example.
Bark scale
See OHP Figure
-
7/28/2019 Class Auditory Perception
19/42
Department of Electrical & Computer Engineering
Webers law
Weber's Law states that the ratio of the increment threshold tothe background intensity is a constant.
So when you are in a noisy environment you must shout to be heard
while a whisper works in a quiet room.
when you measure increment thresholds on various intensitybackgrounds, the thresholds increase in proportion to the
background.
-
7/28/2019 Class Auditory Perception
20/42
Department of Electrical & Computer Engineering
Just noticeable change in
Frequency (Pg:183)
Similar to variation in the critical band structure
This is because it depends on number of BPFs
More BPF better resolution
Till about 500Hz JND is about 3.6Hz. After 500Hz it varies as 0.007f
See OHP Figure
http://images.google.com/imgres?imgurl=http://www.hearingaidstoday.net/palm.gif&imgrefurl=http://www.hearingaidstoday.net/widex.html&h=199&w=341&sz=40&tbnid=sdMXduexLXoJ:&tbnh=67&tbnw=114&start=12&prev=/images%3Fq%3Dhearing%2Baids%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:enhttp://images.google.com/imgres?imgurl=http://www.marshfieldclinic.org/cattails/00/janfeb/images/hearingaids.jpg&imgrefurl=http://www.marshfieldclinic.org/cattails/00/janfeb/protect.stm&h=200&w=206&sz=11&tbnid=dU3rsA7gP_kJ:&tbnh=97&tbnw=99&start=4&prev=/images%3Fq%3Dhearing%2Baids%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:enhttp://images.google.com/imgres?imgurl=http://www.hearingtests.ca/behind_the_ear.jpg&imgrefurl=http://www.hearingtests.ca/hearing_aids.htm&h=285&w=186&sz=7&tbnid=Y0JaC5OZOUAJ:&tbnh=108&tbnw=71&start=1&prev=/images%3Fq%3Dhearing%2Baids%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:enhttp://images.google.com/imgres?imgurl=http://www.statesmanjournal.com/salemaudiology/graphics/perseo_fprdt_250_rev.jpg&imgrefurl=http://www.statesmanjournal.com/salemaudiology/products_phonakperseo.htm&h=290&w=250&sz=12&tbnid=hYbtCDvrUaMJ:&tbnh=109&tbnw=94&start=6&prev=/images%3Fq%3Dhearing%2Baids%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en -
7/28/2019 Class Auditory Perception
21/42
Department of Electrical & Computer Engineering
HEARINGAIDS
http://images.google.com/imgres?imgurl=http://www.hearmorehearingaidcenter.com/evofsvc.jpg&imgrefurl=http://www.hearmorehearingaidcenter.com/&h=1002&w=949&sz=40&tbnid=viVJDCcsAYoJ:&tbnh=148&tbnw=141&start=44&prev=/images%3Fq%3Dhearing%2Baids%26start%3D40%26imgsz%3Dxxlarge%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en%26sa%3DNhttp://images.google.com/imgres?imgurl=http://www.hearingaidstoday.net/palm.gif&imgrefurl=http://www.hearingaidstoday.net/widex.html&h=199&w=341&sz=40&tbnid=sdMXduexLXoJ:&tbnh=67&tbnw=114&start=12&prev=/images%3Fq%3Dhearing%2Baids%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:enhttp://images.google.com/imgres?imgurl=http://www.marshfieldclinic.org/cattails/00/janfeb/images/hearingaids.jpg&imgrefurl=http://www.marshfieldclinic.org/cattails/00/janfeb/protect.stm&h=200&w=206&sz=11&tbnid=dU3rsA7gP_kJ:&tbnh=97&tbnw=99&start=4&prev=/images%3Fq%3Dhearing%2Baids%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:enhttp://images.google.com/imgres?imgurl=http://www.hearingtests.ca/behind_the_ear.jpg&imgrefurl=http://www.hearingtests.ca/hearing_aids.htm&h=285&w=186&sz=7&tbnid=Y0JaC5OZOUAJ:&tbnh=108&tbnw=71&start=1&prev=/images%3Fq%3Dhearing%2Baids%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:enhttp://images.google.com/imgres?imgurl=http://www.statesmanjournal.com/salemaudiology/graphics/perseo_fprdt_250_rev.jpg&imgrefurl=http://www.statesmanjournal.com/salemaudiology/products_phonakperseo.htm&h=290&w=250&sz=12&tbnid=hYbtCDvrUaMJ:&tbnh=109&tbnw=94&start=6&prev=/images%3Fq%3Dhearing%2Baids%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2004-10,GGLD:en -
7/28/2019 Class Auditory Perception
22/42
Department of Electrical & Computer Engineering
Outline
Facts on hearing loss
Cell phones and hearing loss
Types of Hearing aid Inside a hearing aid
Audiogram
-
7/28/2019 Class Auditory Perception
23/42
Department of Electrical & Computer Engineering
Facts on Hearing Lossin Adults
One in every ten (28 million) Americans has hearing loss.
The vast majority of Americans (95% or26 million) with hearing losscan have their hearing loss treated with hearing aids.
Only 5% of hearing loss in adults can be improved through medical orsurgical treatment
Millions of Americans with hearing loss could benefit fromhearing aids but avoid them because of the stigma.
-
7/28/2019 Class Auditory Perception
24/42
Department of Electrical & Computer Engineering
Cell phonesand Hearing aids
Cell Phones emit a type of electromagnetic energy that interfereswith the operation of hearing aids.
The Federal Communications Commission in mid-July 2003 orderedthe cell phone industry to help out the hard-of-hearing.
Within two years, cell-phone manufacturers must offer at leasttwo phones with reduced interference for each type of cellulartechnology used, or ensure that one-fourth of phones thecarriers sell produce less interference.
The FCCs final milestone is February 2008.
-
7/28/2019 Class Auditory Perception
25/42
Department of Electrical & Computer Engineering
Types ofHearing aids
Behind The earIn the Ear
In the Canal Completely in the
canal
-
7/28/2019 Class Auditory Perception
26/42
Department of Electrical & Computer Engineering
Anatomy of a HearingAid
Microphone
Tone hook
Volume control
On/off switch Battery compartment
-
7/28/2019 Class Auditory Perception
27/42
Department of Electrical & Computer Engineering
Inside aHearing aid
1: The microphone
The microphone picks up sound waves from the air and transformsthem into electrical signals.
2: The microphone suspension
The microphone suspension holds the microphone in place.
3: The loudspeaker
The loudspeaker sends the amplified sounds into your ear. The
loudspeaker is also called the receiver and sometimes the telephone.
4: The battery drawer
The battery drawer holds the battery in place.
5: The amplifier
The amplifier makes the signals that come from the microphone louder.
6: The telecoil
The telecoil makes it possible for you to hear one specific person if you
are in a place that supports the use of a telecoil. Many classrooms,
churches and cinemas have telecoil. The telecoil makes it possible for
you to hear i.e. your teacher without hearing the noise around you. It is
also possible to use the telecoil at home - with the TV or the radio.
-
7/28/2019 Class Auditory Perception
28/42
Department of Electrical & Computer Engineering
Audiograms
-
7/28/2019 Class Auditory Perception
29/42
Department of Electrical & Computer Engineering
Direction of Arrival (DOA) estimation algorithm
-
7/28/2019 Class Auditory Perception
30/42
Department of Electrical & Computer Engineering
Talk outline
Necessity for DOA
DOA algorithm Requirements Types of DOA algorithms
Delay and sum
Minimum variance
MUSIC
Coherent MUSIC
Root MUSIC ESPRIT
Comparison Measures
Computational Intensity comparison
Accuracy Comparison
Accuracy vs Computational intensity Conclusion
-
7/28/2019 Class Auditory Perception
31/42
Department of Electrical & Computer Engineering
Where does the DOA come into the picture?
Has 2 microphones
DOA Estimation
qs
& qn
Beamformer
Lets meet
at 11?!?
7 is good
for me
too!!
11 sounds
good!!
http://images.google.com/imgres?imgurl=www.jach.hawaii.edu/JAClocal/images/clipart/Sports/boxing.gif&imgrefurl=http://www.jach.hawaii.edu/JAClocal/images/clipart/Sports/index1.html&h=141&w=174&prev=/images%3Fq%3Dclipart%2Bboxing%26svnum%3D10%26hl%3Den%26lr%3D%26ie%3DUTF-8%26oe%3DUTF-8%26safe%3Dactivehttp://images.google.com/imgres?imgurl=staff.uscolo.edu/peterssl/topics/edition19/images-high-tech/cell-phone-car2b.jpg&imgrefurl=http://staff.uscolo.edu/peterssl/topics/edition19/high-tech/page.htm&h=174&w=200&prev=/images%3Fq%3Dcell%2Bphone%26svnum%3D10%26hl%3Den%26lr%3D%26ie%3DUTF-8%26oe%3DUTF-8%26safe%3Dactivehttp://images.google.com/imgres?imgurl=www.edmunds.com/edweb/editorial/tech/photos/audio.radio1.256.jpg&imgrefurl=http://www.edmunds.com/ownership/techcenter/articles/43831/article.html&h=191&w=255&prev=/images%3Fq%3Dcar%2Baudio%26svnum%3D10%26hl%3Den%26lr%3D%26ie%3DUTF-8%26oe%3DUTF-8%26safe%3Dactive -
7/28/2019 Class Auditory Perception
32/42
Department of Electrical & Computer Engineering
Direction of Arrival Estimation Algorithms
The DOA algorithm must satisfy the following conditions :
Low computational intensity(MIPS/MFLOPS)
High accuracy(RMSE)
High speed Easy implementation
Good performance at low SNRs
Works on a 2 microphone array system with 4cm
separation between them.
-
7/28/2019 Class Auditory Perception
33/42
Department of Electrical & Computer Engineering
DOA Algorithms
Spatial Correlation
methods
Subspace decomposition
methods
MUSIC
Multiple Signal Estimation
ESPRIT
Estimation of Signal parameters
using rotational invariance
Delay and Sum Minimum Variance
Coherent MUSIC
Root MUSIC
-
7/28/2019 Class Auditory Perception
34/42
Department of Electrical & Computer Engineering
DOA Method Equation for Implementation
Delay and Sum
Minimum
Variance
MUSIC
Coherent
MUSIC
Root MUSIC
ESPRIT
*
( ) ( ) ( )P a Saq q q
*
1( )
( ) ( ) ( )P
a inv s aq
q q
*
* *
( ) ( )( )
( ) ( )N N
a aP
a E E a
q qq
q q
'( )
' 'N N
a aP
a E E aq
1 0sin . ( ) /( )K c angle z d q
10
sin arg( ) /( )K K
c dq
-
7/28/2019 Class Auditory Perception
35/42
Department of Electrical & Computer Engineering
Comparison Measures
To evaluate the computational intensity
MFLOPS comparison plot
To evaluate the accuracy
Root Mean Square Error comparison plot To evaluate the effect at low SNRs
SNR vs Estimated angle plot
To evaluate overall performance
Accuracy vs computational intensity
-
7/28/2019 Class Auditory Perception
36/42
Department of Electrical & Computer Engineering
Evaluation of computational Intensity:
MFLOPS comparison chart Min Variance
0.93 Mflops
Coherent MUSIC
0.3958 Mflops
DS
0.3573 Mflops
MUSIC
0.0813 Mflops
ESPRIT
0.0086 Mflops
Root MUSIC
0.0068 Mflops
-
7/28/2019 Class Auditory Perception
37/42
Department of Electrical & Computer Engineering
Comparison of accuracy at different SNR values
0 5 10 15 20 250
20
40
60
80
100
120
140
160
180
Frame Number
EstimatedDOA
(degrees)
Comparison of Estimated DOAs: SNR=10dB,Speech= 90' ,Noise=0'
ESPRITRMUSIC
CMUSICMUSICMVDS
0 5 10 150
20
40
60
80
100
120
140
160
180Estimated DOAS for only those regions which are speech
Frames Number
Estim
atedDOA(Degrees)
-
7/28/2019 Class Auditory Perception
38/42
Department of Electrical & Computer Engineering
Comparison of Accuracy-MFLOPS
-
7/28/2019 Class Auditory Perception
39/42
Department of Electrical & Computer Engineering
Conclusion
Tradeoff between Accuracy and Computational intensity
leads to the conclusion that ESPRIT is the Direction of
arrival estimation algorithm best suited for our purpose
MFLOPS value: 0.0086
RMSE value:~3 (at 10dB)
C i f E i d DOA SNR dB S h N i
-
7/28/2019 Class Auditory Perception
40/42
Department of Electrical & Computer Engineering
0 5 10 15 20 250
20
40
60
80
100
120
140
160
180
Frame Number
Estimate
dDOA
(degrees)
Comparison of Estimated DOAs: SNR=10dB,Speech= 90' ,Noise=0'
ESPRITRMUSICCMUSIC
MUSICMVDS
-
7/28/2019 Class Auditory Perception
41/42
Department of Electrical & Computer Engineering
0 5 10 150
20
40
60
80
100
120
140
160
180Estimated DOAS for only those regions which are speech
Frames Number
EstimatedDOA
(Degrees)
-
7/28/2019 Class Auditory Perception
42/42