ece 598: the speech chain lecture 10: auditory physiology

25
ECE 598: The Speech ECE 598: The Speech Chain Chain Lecture 10: Auditory Lecture 10: Auditory Physiology Physiology

Upload: zechariah-masden

Post on 15-Dec-2015

223 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

ECE 598: The Speech ECE 598: The Speech ChainChain

Lecture 10: Auditory Lecture 10: Auditory PhysiologyPhysiology

Page 2: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

TodayToday Outer Ear: Sound LocalizationOuter Ear: Sound Localization Middle Ear: Impedance MatchingMiddle Ear: Impedance Matching Basilar Membrane: Frequency AnalysisBasilar Membrane: Frequency Analysis

Mechanical PrinciplesMechanical Principles Frequency Response of Auditory FiltersFrequency Response of Auditory Filters Nonlinearity of Basilar Membrane Nonlinearity of Basilar Membrane

ResponseResponse Mechano-Electric TransductionMechano-Electric Transduction

Inner and Outer Hair CellsInner and Outer Hair Cells Neuro-transmitter Uptake ModelsNeuro-transmitter Uptake Models Neural Activation ThresholdsNeural Activation Thresholds

Page 3: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Auditory Anatomy: Auditory Anatomy: OverviewOverview

Page 4: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Localization of Sound: Inter-Localization of Sound: Inter-Aural Time Delay (ITD)Aural Time Delay (ITD)

r

rcosr(/2-)

ITD = (r/c)(/2-+cos)

Wavefronts (lines of constant pressure)

Wave traveling direction

Diffusion of sound around the head

Page 5: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Localization of Sound: Inter-Localization of Sound: Inter-Aural Amplitude DifferenceAural Amplitude Difference

f < c/4r ~ 1kHz: head << f < c/4r ~ 1kHz: head << =c/f; sound diffuses around head=c/f; sound diffuses around head f > c/2r ~ 2kHz: head > f > c/2r ~ 2kHz: head > , so sound is blocked by the head, so sound is blocked by the head

Low frequency, Long wavelength(shown: wave “troughs”, i.e., pressure minima)

High frequency, Short wavelength(shown: wave “troughs”, i.e., pressure minima)

Sound diffuses around the head; no shadow

Head shadow: Sound unable to diffuse around large obstacle

Page 6: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Localization of Sound: Echoes Localization of Sound: Echoes from the Pinna and Shouldersfrom the Pinna and Shoulders

Direct Sound

Pinna Echo

Shoulder Echo

Page 7: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Localization of Sound, Localization of Sound, Summary: Head-Related Summary: Head-Related

Transfer FunctionTransfer Function Source: Source:

s(t) = cos(s(t) = cos(t)t) Received at near ear:Received at near ear:

xxRR(t) = A(t) = ARR(() cos() cos(t+t+RR(()))) Received at far ear: Received at far ear:

xxLL(t) = A(t) = ALL(() cos() cos(t+t+LL(())ITDITD)) Near ear frequency response:Near ear frequency response:

HHRR(() = A) = ARR(()e)ejjRR(())

Far ear frequency response:Far ear frequency response:

HHLL(() = A) = ALL(()e)ej(j(LL(()-)-ITDITD))

Page 8: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Middle Ear FunctionsMiddle Ear Functions Impedance Matching: Impedance Matching:

Sound transmission in water (inner ear) requires Sound transmission in water (inner ear) requires much higher pressure than sound transmission in air much higher pressure than sound transmission in air (outer ear).(outer ear).

Without middle ear, sound incident on oval window Without middle ear, sound incident on oval window would bounce away (would bounce away (=1) =1)

Middle ear reduces g so that not all sound is Middle ear reduces g so that not all sound is reflectedreflected

Reduce Exposure to Loud EnvironmentsReduce Exposure to Loud Environments Strap muscles loosen in loud environments, reducing Strap muscles loosen in loud environments, reducing

the amplitude of sound transmitted to inner earthe amplitude of sound transmitted to inner ear Effect is relatively slow (hundreds of milliseconds), Effect is relatively slow (hundreds of milliseconds),

so not useful for adaptation to rapid sounds so not useful for adaptation to rapid sounds (gunshots)(gunshots)

Page 9: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Impedance Mis-Match Between Water Impedance Mis-Match Between Water and Air: Without a Middle Ear, What and Air: Without a Middle Ear, What

Would You Hear?Would You Hear?

Continuity of pressure at the boundary:Continuity of pressure at the boundary:

(p(pa+a++p+pa-a-) = (p) = (pw+w++p+pw-w-)) Continuity of volume velocity at the boundary:Continuity of volume velocity at the boundary:

(A/(A/aaccaa) (p) (pa+a+ppa-a-) = (A/) = (A/wwccww) (p) (pw+w+ppw-w-)) Densities: Densities: a a = 0.001 g/cc, = 0.001 g/cc, w w = 1 g/cc= 1 g/cc Speeds of Sound: cSpeeds of Sound: caa = 354m/s, c = 354m/s, cww=1000m/s=1000m/s Suppose pSuppose pw-w-=0, meaning that the only input sound is p=0, meaning that the only input sound is pa+a+

Then…Then…

The reflected sound is: pThe reflected sound is: pa-a- = ( = (wwccwwaaccaa)/()/(wwccww++aaccaa) p) pa+a+ = 0.9994 p = 0.9994 pa+a+

The transmitted sound is: pThe transmitted sound is: pw+w+ = 2 = 2aaccaa/(/(wwccww++aaccaa) p) pa+a+ = 0.0006 p = 0.0006 pa+a+

pa+ pa- pw-

pw+

Scala Vestibuli, contents: perilymph ≈ sodium water

Auditory Canal, contents: air

Oval Window

A

Page 10: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Hammer-Anvil-Stirrup = Lever-Hammer-Anvil-Stirrup = Lever-Based Impedance Matching Based Impedance Matching

SystemSystem

Lever system reduces the effective input Lever system reduces the effective input impedance (z=p/v) of water by a factor of Limpedance (z=p/v) of water by a factor of L22

Resulting reflection coefficientResulting reflection coefficient

= (= (wwccwwLL22aaccaa)/()/(wwccww+L+L22aaccaa) ~ 0.98-0.99 < 1) ~ 0.98-0.99 < 1

L unitslength 1 unit length

Eardrum Velocity = (1/aca)(pa+pa-)

Pressure = (pa++pa-) Oval Window Velocity = (1/Laca)(pa+pa-)Pressure = L (pa++pa-)

Page 11: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Acoustic Impedance of Ear Canal Acoustic Impedance of Ear Canal Informative About Middle Ear Informative About Middle Ear

FunctionFunction

Remember how to calculate impedance?Remember how to calculate impedance?1.1. Impose a Boundary Condition at Far EndImpose a Boundary Condition at Far End

ppa-a-==ppa+a+

2.2. Calculate z=p/v at Near EndCalculate z=p/v at Near Endz = p/v = z = p/v = c (pc (pa+a+ee-jkx-jkx+p+pa-a-eejkxjkx)/(p)/(pa+a+ee-jkx-jkx-p-pa-a-eejkxjkx))

= = c (ec (e2jkL 2jkL + + )/(e)/(e-2jkL -2jkL )) So by measuring the acoustic input impedance So by measuring the acoustic input impedance

of the auditory canal very precisely, it’s possible of the auditory canal very precisely, it’s possible to deduce to deduce at different frequencies, and thus to at different frequencies, and thus to learn something about health of the middle ear learn something about health of the middle ear (product: Mimosa Acoustics)(product: Mimosa Acoustics)

pa+ pa-

Outer Ear, contents: air

Eardrum

-L 0x

Page 12: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Inner Ear AnatomyInner Ear Anatomy(image courtesy Alec Salt, Otolaryngology, Washington University)(image courtesy Alec Salt, Otolaryngology, Washington University)

Page 13: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Inner Ear Anatomy: Charged FluidsInner Ear Anatomy: Charged Fluids(image courtesy of Alec Salt, Otolaryngology, Washington University)(image courtesy of Alec Salt, Otolaryngology, Washington University)

Page 14: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Cross-Section of the Basilar Cross-Section of the Basilar MembraneMembrane

(image courtesy wikipedia)(image courtesy wikipedia)

Page 15: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Frequency Selectivity of Places on Frequency Selectivity of Places on the Basilar Membranethe Basilar Membrane

Unroll

Basilar Membrane (separates scala media & scala tympani)

Base, x=0mm:High StiffnessLow Massfc = (k/m)1/2/2 ~ 16000Hz

Apex, x~3cm:Low Stiffness

High Massfc = (k/m)1/2/2

~ 40Hz

Oval Window

In between: Each position, x, is tuned to a different mechanical resonancex(fc) ~ 30mm – (11mm) ln(1 + 46fc/(fc+14700))

Page 16: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Frequency Selectivity of Places on Frequency Selectivity of Places on the Basilar Membranethe Basilar Membrane

Wave pw+e-jx/c propagates forward at c=1000m/s until…

Wave energy is absorbed by oscillation of the basilar membrane at x(fc=/2)

Scala Vestibuli

Scala Tympani

Oval

Win

dow

Rou

nd

Win

dow

Page 17: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Frequency Selectivity of Places on Frequency Selectivity of Places on the Basilar Membranethe Basilar Membrane

Traveling waves in the Traveling waves in the cochlea.cochlea.

““Concerning the Concerning the pleasures of pleasures of observing, and the observing, and the mechanics of the mechanics of the inner ear,” inner ear,”

Nobel Lecture, 1961, Nobel Lecture, 1961, Georg von Békésy Georg von Békésy

(courtesy of Pacific (courtesy of Pacific Biosciences Research Biosciences Research Center Hawaii)Center Hawaii)

Page 18: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Bandwidth of the Auditory Filters: Bandwidth of the Auditory Filters: 100Hz at f100Hz at fcc<500Hz, 0.2f<500Hz, 0.2fcc at at

ffcc>500Hz>500Hz(image courtesy Julius Smith and Jonathan Abel, CCRMA, Stanford)(image courtesy Julius Smith and Jonathan Abel, CCRMA, Stanford)

Equivalent Equivalent RectangulRectangular ar Bandwidth Bandwidth (ERB) = (ERB) =

Bandwidth of Bandwidth of an ideal an ideal BPF that BPF that passes the passes the same total same total energy as energy as the basilar the basilar membrane membrane section at section at the same the same ffcc

Page 19: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Velocity of Basilar Membrane Velocity of Basilar Membrane Causes Inner Hair Cell Follicles to Causes Inner Hair Cell Follicles to

BendBend

Page 20: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Velocity of Basilar Membrane Velocity of Basilar Membrane Causes Inner Hair Cell Follicles to Causes Inner Hair Cell Follicles to

BendBend

Page 21: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Bending of Follicles Causes Bending of Follicles Causes Depolarization of IHCDepolarization of IHC

Scala Media (+80mV)

Organ of Corti (0mV)

Cations enter through follicle tips when follicles bend

Page 22: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Depolarization of IHC Causes Depolarization of IHC Causes Release of NeurotransmitterRelease of Neurotransmitter

Cations enter through follicle tips when follicles bend

Synapse, afferent neuron

Neurotransmitter released

Page 23: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Neurotransmitter DynamicsNeurotransmitter Dynamics(Three-Store Model: Meddis, JASA 1986)(Three-Store Model: Meddis, JASA 1986)

Result: probability of neuron firing is a smoothed (lowpass filtered) version of the IHC voltage

Neurotransmitter release (instant)

Neurotransmitter Re-uptake (several ms)

Neurotransmitter binding (several ms)

Page 24: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Signal Processing in the Inner Ear Signal Processing in the Inner Ear (Simulated)(Simulated)

Page 25: ECE 598: The Speech Chain Lecture 10: Auditory Physiology

Neural Response to a Synthetic Neural Response to a Synthetic VowelVowel

(Cariani, 2000)(Cariani, 2000)