speech recognition ppt

23

Upload: bharath-vaishnov

Post on 21-Sep-2014

121 views

Category:

Documents


2 download

DESCRIPTION

concepts of speech recognition made easy

TRANSCRIPT

Page 1: Speech Recognition Ppt
Page 2: Speech Recognition Ppt

BIOMETRICS

• USES PHYSICAL OR BEHAVIOURAL CHARACTERISTICS

• DIFFERENT TYPES OF BIOMETRICS

• BIOMETRIC SYSTEM

Page 3: Speech Recognition Ppt

VOICE RECOGNITION

• RECOGNIZES PEOPLE FROM THEIR VOICES

• VOICE IS A UNIQUE CHARACTER TRAIT

• VOICE DEPENDS ON MANY FACTORS

• TYPES OF VOICE RECOGNITION

Page 4: Speech Recognition Ppt

SPEECH AND SPEAKER RECOGNTION

• SPEECH->WHAT?

• SPEAKER->WHO?

Page 5: Speech Recognition Ppt

SPEAKER RECOGNITION

Page 6: Speech Recognition Ppt

Speaker Recognition

Classification –Styles of input

Speaker Identification

Speaker Verification

A technology that verifies a speaker’s identity based on the speaker’s voice.

Technology of determining an unknown speaker's identity.

Page 7: Speech Recognition Ppt

Classification – Styles of input

Speaker Recognition

Text Independent Text Prompted

Text Dependent

Fixed Phrase Fixed Phrase

Page 8: Speech Recognition Ppt
Page 9: Speech Recognition Ppt

Speaker #1 Speaker #n

Speaker #2

...

Feature Extractor

Hypotheses space

Speaker Modeling

Under preparation

Page 10: Speech Recognition Ppt

Hypotheses space

Hypothesis representation

Desired hypothesis

define

SearchTraining

examples

Best fit?

Page 11: Speech Recognition Ppt
Page 12: Speech Recognition Ppt

Markov Models

• Model to capture extracted voice features

• Probablistic process

• state diagrams- states , transitions

• applied in weather forecasts,dna modelling x — hidden states

y — observable outputsa — transition probabilitiesb — output probabilities

Page 13: Speech Recognition Ppt

• Coin toss method

• 2 coins A,B: head or tail of any one may appear which is a probability

• feature vectors correspond to head /tail

• state corresponds to the coins A/B

Page 14: Speech Recognition Ppt

Hidden markov models• Observations are probablistic functions

of states: urn and ball model

• main elements of hmm

• {1,2…N}-individual states ,the initial state time being qt

• {v1,v1…vm}-observation symbols

• state -transition probability

Page 15: Speech Recognition Ppt

• Observation symbol probability distribution

• B = {bik = P(ok | qi)}

• initial state distribution

• Π = {pi = P(qi at t=1)}.

• System is given by:

• F= (A, B, Π).

Page 16: Speech Recognition Ppt
Page 17: Speech Recognition Ppt

Baye’s rule

P(A).P(B|A) = P(B).P(A|B)

P(Fj|OT) = P(OT|Fj).P(Fj) P(OT)

Fj speaker model

OT feature vector of test utterance

Page 18: Speech Recognition Ppt

SPEAKER IDENTIFICATION

The model with maximum probability of P(Fj|OT) is identified as speaker.

Speaker X

Speaker Y

Speaker Z

Speaker with max probability

Identified

Feature vectorsO1,O2,….OT

Page 19: Speech Recognition Ppt

SPEAKER VERIFICATION

Target model FA

Background models FB

P(FA|OT) P(OT|FA).P(FA)/P(OT)

P(FB|OT) P(OT|FB).P(FB)/P(OT)

taking log

X = log[P(OT|FA)] – log[P(OT|FB)]

Page 20: Speech Recognition Ppt

Speaker Y

Imposter 1

Imposter 2

Imposter 3

Feature vectorsO1,O2,..

OT

X

X>=0, acceptX<0, reject

Speaker verification

Page 21: Speech Recognition Ppt

APPLICATIONS OF SPEAKER RECOGNITION

• USED TO SECURE OUR COMPUTER PASSWORDS

• USED IN ATM’S

• POTENTIAL APLLICATIONS

Page 22: Speech Recognition Ppt

ADVANTAGES OF SPEAKER RECOGNITION

• IN THE WORLD OF COMPUTER AND INTERNET

• IN THE FIELD OF CREDIT CARD,DEBIT CARDS AND ATM

• POTENTIAL COSTUMERS

Page 23: Speech Recognition Ppt

DISADVANTAGES OF SPEAKER RECOGNITION

• LANGUAGE PROBLEMS

• CAN BE EASILY OPENED BY MIMICRYING

• IN CASE OF TEMPERORY USE WE CANT HAVE THIS SYSTEM

• TECHNICAL AND HARDWARE PROBLEMS