Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming

Posted on:2002-10-05

Degree:Ph.D

Type:Dissertation

University:Northwestern University

Candidate:Drake, Laura Ann

Full Text:PDF

GTID:1468390011491116

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

In this work, techniques are developed and studied for the extraction of single-source acoustic signals out of multi-source mixtures. Such extracted signals can be used in a variety of applications including: automatic speech recognition, digital hearing aids, teleconferencing, and robot auditory systems. Most previous approaches fall into two categories: computational auditory scene analysis (CASA) and array signal processing.; The approach taken here is to combine these complementary techniques into an integrated one: CASA-enhanced beamforming. This integrated approach has the advantage of combining the array processing location attribute (direction of propagation through a sound-field) with the monaural CASA source attributes (fundamental frequency, on/offset, etc.). The motivation for the CASA-enhanced beamforming approach is the recognition that, by combining the statistically independent location and source attributes, more mixtures can be separated. A mixture that could not be separated by the location attribute alone (for example, if the single-source signals in the mixture have the same location attribute value) may be separated using source attributes, and vice versa.; An alternative to beamforming is binaural CASA. Beamforming is chosen for our integrated approach because it has the following advantages: (1) Since binaural CASA evolved to operate under the constraints of the human auditory system (with only two ears and spectral shaping due to the shape of the human body), it is not clear that it is an ideal method for a computer implementation. Beamforming is more flexible. It allows for any array geometry (number and arrangement of sensors). (2) The beamforming approach is mathematically derived based on a physical model of the acoustic wavefield. So, its processing effect is well-understood. (3) Beamforming operates via an analytic expression. So, its performance can be quantified (as a function of array geometry and the frequency content of the signals in the wavefield).; Experimental results show that CASA-enhanced beamforming extracts wideband signal estimates with higher signal-to-interference ratios (SIR) than monaural CASA, or beamforming alone. That is, it generates wideband signal estimates with the most interference rejection. Regarding intellibility, beamforming produces the lowest spectral distortion. However, CASA-enhanced beamforming's spectral distortion is shown to be comparable to monaural CASA's, and better than binaural CASA's.

Keywords/Search Tags:

CASA, Beamforming, Source, Auditory, Signals

PDF Full Text Request

Related items

1	Single-channel Speech Separation Based On Computational Auditory Scene Analysis
2	Research On Sound Source Locating Based On Humanoid Robot Auditory System
3	The Research Of Speech Segregation Based On Computational Auditory Scene Analysis And Microphone Array
4	Parallel Implementation Of Huwang Model Algorithm For Casa
5	The Blind Separation Of Monaural Speech Based On Computational Auditory Scene Analysis
6	Sound Source Localization Based On Binaural Auditory Time Delay Estimation
7	Study On DOA Estimation In The Presence Of Strong And Weak Signals
8	Separation Of Overlapping Speech Signals Based On Auditory Scene Analysis
9	Cochannel Speech Separation Based On Computional Auditory Scene Analysis
10	Study On Sound Source Localization Algorithm Based On Binaural Auditory And Naive Bayes Theory