enhancement of electrolaryngeal speech by reducing leakage noise using spectral subtraction by

24
1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work P.C. Pandey / EE Dept / IIT Bombay IIT Bombay Enhancement of Electrolaryngeal Speech by Reducing Leakage Noise Using Spectral Subtraction by Prem C. Pandey < [email protected] > EE Dept, IIT Bombay Electro Info Com’2007 / St Francis Inst. of Technology, Mumbai / 4-6 Jan’07

Upload: jaguar

Post on 08-Jan-2016

21 views

Category:

Documents


2 download

DESCRIPTION

Enhancement of Electrolaryngeal Speech by Reducing Leakage Noise Using Spectral Subtraction by Prem C. Pandey < [email protected] > EE Dept, IIT Bombay Electro Info Com’2007 / St Francis Inst. of Technology, Mumbai / 4-6 Jan’07. Abstract - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Enhancement of Electrolaryngeal Speech

by Reducing Leakage Noise Using Spectral Subtraction

by

Prem C. Pandey

< [email protected] >

EE Dept, IIT Bombay

Electro Info Com’2007 / St Francis Inst. of Technology, Mumbai / 4-6 Jan’07

Page 2: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Abstract

Transcervical electrolarynx is a vibrator held against the neck tissue in order to provide excitation to the vocal tract, as a substitute to that provided by a natural larynx. It is of great help in verbal communication to a large number of laryngectomee patients. Its intelligibility suffers from the presence of a background noise, caused by leakage of the acoustic energy from the vibrator. Pitch synchronous application of spectral subtraction method, normally used for enhancement of speech corrupted by uncorrelated random noise, can be used for reduction of the self leakage noise for enhancement of electrolaryngeal speech. Average magnitude spectrum of leakage noise, obtained with lips closed, is subtracted from the magnitude spectrum of the noisy speech and the signal is reconstructed using the original phase spectrum. However, the spectrum of the leakage noise varies because of variation in the application pressure and movement of the throat tissue. A quantile based dynamic estimation of the magnitude spectrum without the need for silence/voice detection was found to be effective in noise reduction.

2

Page 3: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Overview

● Introduction

● Spectral subtraction for enhancement of

electrolaryngeal speech

● Quantile-based noise estimation

● Results, summary, & ongoing work

3

Page 4: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Natural speech productionIntroduction 1/5

Glottal excitation to vocal tract

4

Page 5: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

External electronic larynx (Barney et al 1959)

Excitation to vocal tract from external vibrator

Introduction 2/5

5

Page 6: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Problems with artificial larynx

● Difficulty in coordinating controls

● Spectrally deficit

● Unvoiced segments substituted by voiced segments

● Background noise due to leakage of acoustic energy

Introduction 3/5

6

Page 7: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Model of noise generation

Causes of noise generation:• Leakage of vibrations produced by vibrator membrane• Improper coupling of vibrator to neck tissue

Introduction 4/5

7

Page 8: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Methods of noise reduction

Vibrator design Acoustic shielding of vibrator ( Epsy-Wilson et al 1996)

Piezoelectric vibrators (Katsutoshi et al 1999)

Signal processing 2-input noise cancellation based on LMS algorithm ( Epsy-Wilson et al 1996)

Single input noise cancellation ( Pandey et al 2002) based on spectral subtraction algorithm (Boll 1979 & Berouti et al 1979)

Introduction 5/5

8

Page 9: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Spectral subtraction for enhancement of electrolayngeal speech (Pandey et al 2000)

s(n) = e(n)*hv(n), l(n) = e(n)*hl(n)x(n) = s(n) + l(n)Xn(ej) = En(ej)[Hvn

(ej) + Hln(ej)]

Assumption: hv(n) and hl (n) uncorrelated Xn(ej)2 = En(ej)2[Hvn

(ej)2 + Hln(ej)2]

Noise estimation mode: s(n) = 0Xn(ej)2 = Ln(ej)2 = En(ej)2 Hln

(ej)2

L(ej)2 : averaged over many segments

Speech enhancement mode: Yn(ej)2 = Xn(ej)2 - L(ej)2

contd…

Spect. subtrn. 1/4

9

Page 10: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Implementation using DFT

Yn(k)2 = Xn(k)2 - L(k)2

yn(m) = IDFT [ Yn(k) ej Xn

(k)]

Modified spectral subtraction (Berouti et al 1979)

Yn(k) = Xn(k) -L(k)

Yn(k) = Yn(k) if Yn(k) L(k)

=L(k) otherwise

( : subtraction, : spectral floor, : exp. factors)

Output normalization factor for < 1 (Berouti et al 1979)

G = {(Xn(k)2 - L(k)2)/ Yn(k)2}/

Spect. subtrn. 2/4

10

Page 11: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Spectral subtraction method with ABNE (Pandey et al 2002)

Spect. subtrn. 3/4

11

Page 12: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Drawback of averaged noise estimation during silence

● Two modes: noise estimation & speech enhancement

● Estimated noise considered stationary over entire speech enhancement mode

● Some musical & broadband noise in the output Investigations for continuous noise estimation & signal enhancement● System with voice activity detector (Berouti et al 1979)

● Without involving speech vs non-speech detection (Stahl et al 2000, Evans et al 2002, Houwu et al 2002)

Spect. subtrn. 4/4

12

Page 13: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Quantile-based noise estimation

Basis for the technique

● During speech segments, frequency bins tend not to be permanently occupied by speech

● Speech / non-speech boundaries detected implicitly on per frequency basis

● Noise estimates updated throughout non- speech and speech periods

QBNE 1/6

13

Page 14: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Implementation of QBNE

● DFT of windowed speech segments

● FIFO array of past spectral values for each freq. sample is formed

● An efficient indexing algorithm used to sort the arrays to obtain particular quantile value:

– A sorted value buffer and an index buffer, for each frequency sample

– New data placed at locations of oldest data in sorted buffer by referring index buffer

– In all sorted buffers only one value needs to be placed at correct position

QBNE 2/6

14

Page 15: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

QBNE 3/4Spectral subtraction with QBNE

QBNE 3/6

10

Page 16: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Investigations with QBNE

● Single quantile value - Quantile value which gives best visual match between quantile derived spect. & avg. spect. of noise is selected

● Two quantile values- Two quantiles for two frequency bands, which estimates noise close to avg. spect. of noise, were selected

● Frequency dependent quantile values - Estimated spectrum from noisy speech will be close match to the avg. spectrum of noise

QBNE 4/6

16

Page 17: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

QBNE 5/6Investigations with QBNE (Contd..)

● Smoothened quantile values - Matched quantiles were averaged using 9 frequency values

● SNR based dynamic quantiles - Dynamic selection of quantiles depending on signal strength

q(k) = [(q1 (k) - q0 (k)) SNR (k) / SNR1 (k)] + q0 (k) q0 (k) if q (k) < 0

q1 (k) if q (k) > q1 (k)

17

Page 18: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Plot of SNR and frequency dependent quantilesfor three different applications of vibrator

Frequency sample

18

QBNE 5/6

Page 19: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Recorded and enhanced speech with (α=2,β=0.001,γ=1,N=16 ms), speaker: SP, material: /a/, /i/,and /u/ using electrolarynx Servox

Noise segment

/a/

/u/

/i/

Unprocessed Processed

Enhancement results

19

Results 1/3

Page 20: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Enhancement results Results 2/3

Recorded and enhanced speech with (α=2,β=0.001,γ=1, Widow length=16 ms), speaker: SP, material: question-answer pair in English “What is your name? My name is santosh” using electrolarynx Servox

20

Page 21: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Recorded and enhanced speech with (α=2,β=0.001,γ=1), speaker: SP, material: question-answer pair in English “What is your name? My name is santosh” using electrolarynx NP-1, Servox, and Solatone

Results 3/3Enhancement results

21

Page 22: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Conclusion 1/2

Conclusion● QBNE technique implemented for cont. updating of noise spectrum

& different methods for selection of quantile values for noise estimation investigated

● Results with QBNE during non-speech segment are comparable with results using ABNE

● Smoothened quantiles and SNR based quantiles resulted in better quality speech

● QBNE is effective for longer duration

● QBNE using SNR based dynamic quantiles is effective during long pauses

22

Page 23: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay

Ongoing work

● Evaluation of intelligibility and quality improvement

● Selection of optimum quantile values for different models of electrolarynx and users

● Phase resynthesis

from magnitude spectrum

using cepstral method● Real-time implementation of noise reduction, using ADSP- BF533 board

● Analysis-synthesis for

introducing small amount of

jitter to improve naturalness

Conclusion 2/2

23

Page 24: Enhancement of  Electrolaryngeal Speech  by Reducing Leakage Noise  Using Spectral Subtraction by

1 Introduction 2 Spectral subtraction 3 QBNE 4 Results 5 Conclusion, & future work

P.C

. P

an

de

y

/ E

E D

ep

t /

IIT

Bo

mb

ay

IIT

Bo

mb

ay