Download - Audio Coding
![Page 1: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/1.jpg)
Audio Coding
Team Member:ChungMing Yan, Chun Tong
![Page 2: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/2.jpg)
Overview Mp3, AAC, Ogg Vorbis
• Technical specifications Test Results
• Sample clips Conclusion
![Page 3: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/3.jpg)
Mp3 MPEG1 Layer 3 Audio Coding A research project in EUREKA Digital
Audio Broadcasting (DAB) in 1987 A power of data reduction algorithm Standardized as ISO-MPEG Audio-Layer 3
![Page 4: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/4.jpg)
Mp3 (Continued)Pros: Fast Decoding Excellent hardware
support ISO standard
Cons: Quality varies widely
between encoders Even at highest
quality, quality still suffers
![Page 5: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/5.jpg)
Mp3 (Continue) Bit Rate: Average 128 kbps or 192 kbps Sampling Frequency:
16-24KHz (MPEG2 Layer 3)32-48KHz (MPEG1 Layer 3)
Parameters:Birate:
1. CBR (Constant Bitrate)2. VBR (Variable Bitrate)3. ABR (Average Bitrate)
![Page 6: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/6.jpg)
![Page 7: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/7.jpg)
Mp3 (Continue) Compression Techniques
• Huffman coding• Non-linear quantization• M/S Matrixing (Mid/side matrixing)• Intensity stereo• MDCT
![Page 8: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/8.jpg)
Mp3 (Continue) Encoders: Lame, Audio Catalysis Decoders: Winamp, Window Media
Player, etc.
![Page 9: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/9.jpg)
AAC MPEG2/MPEG4 Advanced Audio
Coding Developed by MPEG group (Dolby,
Frauhofer, AT&T, Sony…etc) More over of mp3
![Page 10: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/10.jpg)
AAC (Continued)Pros: Competitive at low
and mid bitrates against other formats
Decoders/Encoders work on all platforms
Cons: All high-quality
implementations of AAC encoding are non-free and closed source.
Relatively CPU intensive
![Page 11: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/11.jpg)
AAC (Continue) Bit Rate: 96 kbps, 128kbps, 196kbps Sampling Frequency:
48 full-bandwidth (up to 96 KHz)Low Frequency Enhancement (LFE to 120 KHz)
![Page 12: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/12.jpg)
![Page 13: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/13.jpg)
AAC (Continue) Profiles
• LC (Low Complexity)• Main• Main LTP
![Page 14: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/14.jpg)
AAC (Continue) Compression techniques
• Huffman coding• Non-linear quantization and scaling• Vector quantization• M/S matrixing (middle/side channels) for high bitrates• Intensity stereo for low bitrates • TNS (temporal noise shaping)• LTP(MPEG4 profile 2, reduce redundancy in successive
frames)• MDCT• PNS (perceptual noise shaping)
![Page 15: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/15.jpg)
AAC (Continue) Encoders: Psytel AacEnc, Nero Decoders: Winamp (with an AAC plug-
in), QuickTime 6
![Page 16: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/16.jpg)
Ogg Vorbis Open source project Free, open, unpatented from other audio
coding format
![Page 17: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/17.jpg)
Ogg Vorbis (Continue)Pros: Open source and
patent free No loyalties even in
commercial products
Cons: No commercial
hardware players High bitrates not fully
tuned
![Page 18: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/18.jpg)
Ogg Vorbis (Continue) Bit rate: ~64kbps Sampling Frequency:
From 8 KHz (telephony) to 192 KHz (Digital Masters)
![Page 19: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/19.jpg)
Ogg Vorbis (Continued) Compression techniques
• Huffman coding• MDCT (Cosine + Sine)• Wavelet in Vorbis II to improve quality
![Page 20: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/20.jpg)
Ogg Vorbis (Continue) Encoders: Besweet, OggDrop Decoders: Winamp (with an Ogg Vorbis
plug-in)
![Page 21: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/21.jpg)
Test Result Three music clip used
• Orchestra• Music with voice• Voice only
Different bitrate setting (switches)• High bitrate• Medium variable bitrate• Low bitrate• Additional switches (voice, pns)
![Page 22: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/22.jpg)
Sample clips Wav Mp3 Ogg AAC
![Page 23: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/23.jpg)
filesizes mp3: track 1 track 2 track 3 original 2470KB 2587KB 318KB cbr 256 (160 voice) 451 472 291 r3mix (96) 283 381 174 abr 48 83 91 90 abr 32 54 60 60/60
aac: track 1 track 2 track 3 original 2470KB 2587KB 318KB cbr 256 446 467 232 abr 96 217 231 145 abr 48 (tape, 40-59) 104 102 90 cbr 32 57 60.4 59.2 cbr 32 resampled 56.9 59.8 53.6
ogg: track 1 track 2 track 3 original 2470KB 2587KB 318KB cbr 256 /8 389 427 304 abr 96 /2 140 167 131 abr 48 /-1 63.9 98.7 80.7 abr 32 -the encoder cannot encode lower bitrate
![Page 24: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/24.jpg)
Switches used Mp3
• GUI automatically writes the proper command line• CBR 256 - "c:\EE3414\lame\lame.exe" -m s -b 256 -k "C:\EE3414\input.wav" "C:\
EE3414\output.mp3“• r3mix - "c:\EE3414\lame\lame.exe" --nspsytune --vbr-mtrh -V1 -mj -h -b96 --lowpass
19.5 --athtype 3 --ns-sfb21 2 -Z --scale 0.98 -X0 "C:\EE3414\input.wav" "C:\EE3414\output.mp3"
• abr 48 - "c:\EE3414\lame\lame.exe" --abr 48 -b 32 -B 320/160 "C:\EE3414\input.wav“ "C:\EE3414\output.mp3“
• abr 32 - "c:\EE3414\lame\lame.exe" --abr 32 -b 32 -B 320/160 "C:\EE3414\input.wav" "C:\EE3414\output.mp3"
• abr 32 voice - "c:\EE3414\lame\lame.exe" --voice --abr 32 -b 32 -B 160 "C:\EE3414\voice.wav" "C:\EE3414\voice(abr32-voice).mp3"
![Page 25: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/25.jpg)
Switches used (Continued) AAC cbr 256 -production -low_ath -profile 0 -br 256 abr 96 -production -profile 0 -br 96 –vbrhi abr 48 -tape abr 32 -br 32 abr 32 -br 32 -resample 22050
![Page 26: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/26.jpg)
Switches used (continued) Ogg Vorbis Lacking control parameters besides
“quality”• cbr 256 GUI quality set to 8• abr 96 GUI quality set to 2• abr 48 GUI quality set to -1• abr 32 such bitrate is not possible with
given tools even with manual bitrate tweaking
![Page 27: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/27.jpg)
Conclusion ACC = Ogg > MP3
• There are very little differences, very hard to tell
Depends on application Alternative Audio Coding
• Lossless encoding• Monkey audio
• Speech specific• Speez
![Page 28: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/28.jpg)
Future research or improvements As technology improves, there will be
newer coding schemes to be examined More extensive research of the
parameters and encoding procedures Matlab waveform analysis (object
analysis) Alternative Implementation
![Page 29: Audio Coding](https://reader035.vdocuments.us/reader035/viewer/2022081507/5681638d550346895dd481db/html5/thumbnails/29.jpg)
Resources http://www.audiocoding.com http://lame.sourceforge.net http://www.vorbis.com
Team website• http://chii.servehttp.com:10240/ee3414