lab_week_8

Upload: ganadim9795

Post on 14-Apr-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/27/2019 lab_week_8

    1/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 1

    ENEE408G: Capstone Design Project:Multimedia Signal Processing

    Design Project 4:Digital Audio Processing

  • 7/27/2019 lab_week_8

    2/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 2

    Outline of Design Project 4

    Part I : Perceptual Coding and MP3

    Part II : Digital Audio Watermarking

    Part III: Synthetic Audio (I): MIDI

    Part IV: Essays: IP rights on Digital Media

    Part V: Mobile Computing and Pocket PC

    Programming Bonus part I: Synthetic Audio (II): MPEG4-SA

  • 7/27/2019 lab_week_8

    3/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 3

    Part I. Perceptual Coding & MP3 (1)

    Anatomy of Human Ear

  • 7/27/2019 lab_week_8

    4/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 4

    Part I. Perceptual Coding & MP3 (2)

    Absolute Threshold of Hearing

    102

    103

    104

    -10

    -5

    0

    5

    10

    15

    20

    25

    30

    35

    40Stage 2: Measure Absolute Threshold of Hearing

    Frequency(Hz)

    Related

    Sou

    ndPressureLevelto4KH

    z,rSPL(dB)

    Left Click mouse to adjust the volume. Right click to Exit

  • 7/27/2019 lab_week_8

    5/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 5

    Part I. Perceptual Coding & MP3 (3)

    2900 3000 3100 3200 3300 3400 3500 3600 3700 3800 3900-10

    0

    10

    20

    30

    40

    50

    60Stage 2: Measure simultaneous masking

    Frequency(Hz)

    Related

    Sou

    ndPressure

    Levelto

    4KH

    z,rSPL(dB)

    Left Click mouse to adjust the volume. Right click to Exit

    Simultaneous Masking

  • 7/27/2019 lab_week_8

    6/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 6

    Part I. Perceptual Coding & MP3 (4)

    Audio Extraction by GoldWave

    Compare spectrum for original audio andreconstructed MP3 audio

    difference

  • 7/27/2019 lab_week_8

    7/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 7

    Pat II. Digital Audio Watermarking (1)

    Watermark Embedding, Detection, and Attack

    Use AudioMark to embed, detect, and attack

    Digital Audio Watermarking Design Digital Watermarking in Time Domain

    Digital Watermarking in Frequency Domain

    Pseudo-Noise Sequence (Optional)

  • 7/27/2019 lab_week_8

    8/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 8

    Part II. Digital Audio Watermarking (2)

    Pseudo-Noise Sequence

    1 2 3 p

    Logic

    Flip-Flop

    ...... output

    sequence

    clock

    For more reference: : Simon Haykin: Communication Systems, 4thedition, Wiley, 2000

  • 7/27/2019 lab_week_8

    9/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 9

    Part II. Digital Audio Watermarking (3)

    Time Domain

    Convert message into bits

    Put the bits into LSBS of the audio samples

    Repeatedly embed each message bit in a numberof audio samples for a more reliable detection

  • 7/27/2019 lab_week_8

    10/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 10

    Part II. Digital Audio Watermarking (4)

    DCT Domain

    DCTAudio file in

    question

    DCTOriginal

    audio file

    Extractmid-frequency

    part

    +

    _

    iDCTwatermark

    detector

    Detectionresult

    Ownershipwatermark

    Noise-like seq.

    Watermark

    Key/Seed

    Segmentation

    (frame size L)Audio File iDCT

    Watermarked

    audio fileDCT

    added in mid-

    frequency

    region

    V(j) = V(j) + a(j) W(j)

    V(j) V(j)

    W(j) e.g., a(j) = 0.05 |V(j)|

    Noise-like seq.

    Watermark

    Key/Seed

    Segmentation

    (frame size L)Audio File iDCT

    Watermarked

    audio fileDCT

    added in mid-

    frequency

    region

    V(j) = V(j) + a(j) W(j)

    V(j) V(j)

    W(j) e.g., a(j) = 0.05 |V(j)|

  • 7/27/2019 lab_week_8

    11/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 11

    Part III. Synthetic Audio (I)

    Musical Instrument Digital Interface (MIDI)

  • 7/27/2019 lab_week_8

    12/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 12

    Part IV. Essay and Ethics Assignment

    IP Rights in Digital Media in EngineeringEthics

    Intellectual Property and copyright play important rolesin contemporary society. Discuss the issues of IP rights

    protection for digital multimedia among your team, andinterview your friends/families to see what they thinkabout this. Write an essay to report and summarize your

    friends and opinions Ethics essay: follow Part IV instruction (due in two weeks)

    Ethics discussion and assignment is on Wednesday 04/06(due date will be announced later)

  • 7/27/2019 lab_week_8

    13/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 13

    Part V. Pocket PC Programming

    From the above three parts, we have learned the

    fundamentals of digital audio processing. Now

    design a simple Pocket PC application related to

    digital audio processing using the Microsoft

    eMbedded Tools.

    Ideas:

    Demo Project 4(1): Piano Demo Project 4(2): TCP/Playsound

    Demo Project 4(3): MP3/MIDI player

  • 7/27/2019 lab_week_8

    14/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 14

    Bonus Part I. Synthetic Audio (II)

    Structured Audio: One component in the MPEG audio standard.

    MPEG 4

    audio systemvideo

    SA

    Natural coding Synthetic coding

    AAC T/F CELP Parametric TTS

    ISO/IEC 14496-3 sec5

    From John

    Wawrzynek's talk

  • 7/27/2019 lab_week_8

    15/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 15

    Bonus Part I. Synthetic Audio (II)

    Idea: Musical performance have temporal structurethat changes over several timescales

    MPEG-4 Structured-Audio Structured Audio Orchestra Language (SAOL)

    Structured Audio Score Language (SASL)

    Sample-by-sample

    10s of usec

    Amplitude & timbre envelopes: 10s of msec

    Note-by-note: 100s of msec

    From John

    Wawrzynek's talk

    For more reference: John Wawrzynek's talk http://bmrc.berkeley.edu/bibs/instance?prog=1&group=13&inst=35

    http://bmrc.berkeley.edu/bibs/instance?prog=1&group=13&inst=35http://bmrc.berkeley.edu/bibs/instance?prog=1&group=13&inst=35
  • 7/27/2019 lab_week_8

    16/18

  • 7/27/2019 lab_week_8

    17/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 17

    Announcement

    All files related to this project can be found intheAudioZip.zipand the bonus part can also

    be found in Bonus.zip. The grade for this project weights 12% of the

    final grade: 8% for technical tasks and 4%for the ethics essay and assignment.

    Matlab tasks: 2 (Part II. 2), Pocket PC task: 1

  • 7/27/2019 lab_week_8

    18/18

    03/18/2005 ENEE408G Spring 2005 Multimedia Signal Processing 18

    Announcement

    Project reports and Matlab files need to beimproved in their structures and descriptions.

    TA evaluation.