team sound assistimages.shoutwiki.com/mindworks/a/ae/sound_assist... · design overview sound...

Post on 24-Aug-2020

4 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Team Sound Assist

Adam Seubert, Alex Inskeep

Client: Dr. James FrenzelMentor: Dr. Feng Li

Members: Adam and AlexClient: James Frenzel

Mentor: Feng Li

Problem Statement

● Speech in noisy media can be difficult for the hearing impaired

● Action sounds can make it difficult to understand characters

2https://coub.com/view/f166n16

Project goals

● To create an assistive listening device to increase the intelligibility of human speech in common

media, such as movies and television

● The device must increase the intelligibility of human speech

● The device must easily integrate with a home theater system

3

Design Overview

Sound Assist DeviceAnalog Audio Stream Vocal Boosted Analog Audio Stream

4

Technical Approach

DUET

● Blind Source Separation Algorithm

● Isolate and amplify the human voice

Filtering

● Separately Increase the volume of the human voice frequency range

● Remix with scalar values to achieve greater intelligibility

5

Hardware Design Overview

ADCInput Signal

RaspberryPI

DAC Output to Headphones

6

RCA Jacks

RCA Jacks

Sound Card

7

Device Overview

https://en.wikipedia.org/wiki/Raspberry_Pihttp://www.audioinjector.net/rpi-hat

8

Use Case

Software Design Overview

9

PyAudio PyAudioRCA Sampling

DUET

RCA Out

Filtering

PyAudio

● Python package to help facilitate audio recording and playing

● Using a callback method to reduce sound lag

● Samples small amount of audio, processes, and plays back

10

Filtering

● Apply a 3rd order bandpass filter to a copy of the

original sample

● Cut off frequencies are 1kHz and 5kHz

● Mix filtered signal and original with linear scalars

to increase vocal intelligibility

11S. Makino et al. (eds.), Blind Speech Separation, 217–241. © 2007 Springer

Duet

● Using only 2 inputs, it can separate an arbitrary

number of sources so long as the phase-frequency

representations of each source do not overlap

● Creates a 3D weighted histogram of the

phase-frequency data and the peaks of the histogram

are the separable sources

12S. Makino et al. (eds.), Blind Speech Separation, 217–241. © 2007 Springer

End Product

● Plug and Play experience

○ Scripts start all programs on powerup

○ No additional connections or UI needed

● Compact and powerful solution with future options

14

Future Work options

● Portability

● Wireless (Bluetooth, wifi)

● Real time DUET

● Customizable user profile

● User controlled knob for isolated vocal gain

15

Got Questions?

16

top related