digital audio. acknowledgement some part of this lecture note has been taken from multimedia course...
TRANSCRIPT
![Page 1: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/1.jpg)
Digital Audio
![Page 2: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/2.jpg)
Acknowledgement
Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul L. Browning's thesis report "Audio Digital Signal Processing in Real Time." , available at http://www.tcicomp.com/paul/dsp/
Additional information may be found at http://www.sonicspot.com
![Page 3: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/3.jpg)
Sound Sound (to a physicist)
is a pressure wave which travels in air at 331ms-1 at 0 degrees: at 343m/s at 20 degrees C
with a frequency between 20 and 20,000 Hz (variations/second)
and to a Psychologist... Sound is a perceptual effect caused by a pressure w
ave between 20 and 20,000Hz being detected at the ear.
![Page 4: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/4.jpg)
Sound
The vibration of matter, which in turn causes the surrounding air to vibrate, produces sound.
The resulting pattern of oscillation is known as a waveform.
A waveform may repeat its shape at regular intervals, known as the period.
![Page 5: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/5.jpg)
Physical characteristics of soundThe pressure wave has two physical characteristics: amplitude
the size of the pressure wave that is, the size of the rarefactions and compressions in the
pressure wave frequency
the number of compressions (or rarefactions) per second related to
the period of the sound = 1/frequency, and the wavelength of the sound = Cycle of sound/frequency
![Page 6: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/6.jpg)
![Page 7: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/7.jpg)
![Page 8: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/8.jpg)
The frequency of a sound is the reciprocal of its period.
Frequency represents the number of waveform periods in a second and is expressed by hertz (Hz) or cycles per second.
![Page 9: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/9.jpg)
Characteristics of real sounds Sound waveform
Frequency spectrum
![Page 10: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/10.jpg)
Devices for sound generation and transduction For input to a computer, the pressure wave is
converted to an analogue electrical signal (transduced) converted to a digital signal (digitised)
For output from a computer, the digitised signal is converted to an analogue signal converted to a pressure wave
![Page 11: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/11.jpg)
Psychological characteristics of sound (I)From the perspective of sound being what we hear,
sound has three defining characteristics: loudness
how loud (intense) the sound appears pitch
the sense of the sound having a tone timbre
the nature of the sound as befits psychological descriptions, these are
inexact All sounds have a loudness, but many are unpitched timbre is often used as a catch-all term to describe
those aspects of the sound not captured by loudness and pitch.
![Page 12: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/12.jpg)
Pitch and loudness Pitch perception is complex Complex tones (many frequency
components) often have a lower pitch than a pure tone of the same mean frequency
Apparent loudness of a sound depends on the frequency as well as the amplitude of the sound human ear responds differently to different
frequencies young people can often hear a higher frequencies
than older people.
![Page 13: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/13.jpg)
Measuring loudness Our ears have (essentially) a logarithmic response
loudness depends on power: proportional to amplitude * amplitude
doubling the power of a sound does not make it twice as loud actually, (real, perceptual) loudness is difficult to compute
Decibels the ratio of the power of two signals is measured in decibels
(dB) this is a logarithmic scale if signal1 has power P1, and signal2 has power P2, then P2 is 10 log10(P2/P1) dB louder than P1 e.g. If P2 has 100 times the power of P1, it is 20dB louder
0 dB is the threshold for a human to hear a sound of 1000Hz
20dB whisper , 90dB loud music , 100dB risking damage , 140dB aeroplane
![Page 14: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/14.jpg)
Quality and Fidelity (I) Whenever sound is transduced, digitised, or reconverted
to analogue, the original signal is altered in some way. When high quality reproduction is required, we need to
keep this alteration to a minimum. Transduction:
Microphones and loudspeakers have a limited frequency response that is, they are more sensitive to sounds with certain frequencies we would like a flat frequency response from 20 to 20KHz
They also have a limited dynamic range that is they cannot deal with sounds from the quietest up to the
loudest the range in energy of everyday sounds is huge
For some applications, we may sacrifice quality e.g. telephony: we care really only about comprehensibility
![Page 15: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/15.jpg)
Quality and Fidelity (II) Digitizing Sound
sound is digitised using an analogue to digital converter (ADC)
sound is converted back to analogue using a digital to analogue converter (DAC)
Both forms of conversion can introduce alterations in the sound but the ADC is the more problematic.
Analogue to digital conversion has two parameters: sampling rate sample size
![Page 16: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/16.jpg)
Sampling rate Sampling rate describes how frequently the
analogue signal is converted Normally measured in samples/second
conversion is done regularly, at a fixed number of samples/second
sampling rate must be at least twice the highest frequency of interest Nyquist sampling theorem otherwise aliasing can occur
![Page 17: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/17.jpg)
Signal reconstruction Quantization
Sample and hold reconstruction
![Page 18: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/18.jpg)
Aliasing Aliasing occurs if a sound is sampled too slowly
Better...
![Page 19: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/19.jpg)
Sample size (I) Sample size refers to the characteristics of the
sample value taken each sample time Samples have a fixed length
8-bit, (16-bit or 32-bit) means each sample is a (2’s complement) 8-bit (16-bit or
32-bit) integer e.g. range -128 to +127 for 8-bit; -32768 to +32767 for 16-
bit
![Page 20: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/20.jpg)
Sample size (II) Sampling may be linear or logarithmic
linear: for sample value x, actual value is (x/maximum)* K for some K
logarithmic: provides more resolution at lower levels mu-law (m-law) or A-law a form of data compression
![Page 21: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/21.jpg)
Sample size (III) Major concern for storage of a sampled
sound is the total amount of data collected Data length is proportional to sample rate *
sample size so 1 second of sound sampled at 44100 16 bit
samples/second uses 44100 * 2 = 88200 bytes/second
and that is just 1 channel: stereo takes 176400 bytes/second about 10.5Mbytes/minute
this is CD-audio quality Data can be compressed
but decompression must take place in real time
![Page 22: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/22.jpg)
Power and loudness: dynamic range Dynamic range
loudest measurable signal compared to quietest signal Measured using decibels
if signal1 has power P1, and signal2 has power P2, then P2 is 10 log10(P2/P1) dB louder than P1
How this relates to sample size consider the loudest signal possible, and the quietest signal
possible for 16 bit samples, the loudest one has 32000 times as
high a value but this value is a voltage: power is proportional to
voltage * voltage so it has 32000 * 32000 as much power that is, about 1,000,000,000 times as much that is 90dB dynamic range. Note that the accuracy for quiet sounds is low.
![Page 23: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/23.jpg)
Volume Scaling The volume of a digitized audio source may be incre
ased or decreased by multiplying each sample value by an appropriate constant.
This process is also known as applying a "gain" to the audio.
A scale factor of 1.5 will increase the waveform amplitude or volume by 50%.
A scale factor of 0.25 will reduce the waveform amplitude or volume to 25% of its original value.
The pseudo code for the volume scaling effect is given below.
For i = 1 to NumSamplesResult[i] = Samples[i] * scaleFactor
![Page 24: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/24.jpg)
Mixing Two waveforms can be "mixed" together by a
dding corresponding samples of each source waveform.
It may sometimes be necessary to clamp the result values to lie within the dynamic audio range represented by the 8 or 16 bit sample values.
For i = 1 to NumSamples// Assume both waveforms have same number of samples
Result[i] = Samples1[i] + Samples2[i]
![Page 25: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/25.jpg)
Example of mixing two audio waveforms
Wave 1 Wave 2 Wave 1 + Wave 2
![Page 26: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/26.jpg)
Echo An echo is a delayed and volume reduced copy of th
e original sound mixed with the original sound. Assume sound travels at a speed of 340 meters per
second. To simulate the effect of sound bouncing off a wall 2
0 meters away, the sound travels 20 + 20 meters over a duration of 0.117 seconds (40 m / 340 m per second).
If audio is processed at 11,000 samples per second, then the delay factor in number of samples is 0.117*11,000 = 1,287.
By convention, the first n samples remain unchanged since no preceding samples are available.
The resulting audio data will be n samples longer due to the echo.
![Page 27: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/27.jpg)
For i = 1 to NumSamples+n
if (i < n)
Result[i] = Samples[i]
else if (i >= n) and (i <= NumSamples)
Result[i] = Samples[i] + scaleFactor*Samples[i – n]
else
Result[i] = scaleFactor*Samples[i-n]
![Page 28: Digital Audio. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul](https://reader035.vdocuments.us/reader035/viewer/2022062519/5697bfb81a28abf838c9f0d8/html5/thumbnails/28.jpg)
Reverberation Reverberation is the combination of multiple echo ef
fects, from different distances and with different attenuation factors.
Reverb simulates the effect of multiple echoes caused by sound bouncing back and forth within an enclosed space.
After each subsequent bounce, the echo produced will have a further reduction in volume and additional delay.
Generally, reverberation in a small room decays much faster than reverberation in a large room, because in a small room the sound waves collide with walls much more frequently, and thus are absorbed more quickly, than in a large room.