Speech Segregation Based On Sound Localization Cues

Posted on:2004-01-10

Degree:Master

Type:Thesis

Country:China

Candidate:L Ge

Full Text:PDF

GTID:2168360122465710

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

At a cocktail party, we can selectively attend to a single voice and filter out all the other acoustical interferences. How to simulate this perceptual ability remains a great challenge. This paper raised a new way for overlapping speech segregation based on sound localization cues. In this paper, we first divide the speech stream into some time-frequency regions and calculate the ITD and IID of each region. Then the notion of a "time-frequency" binary mask is given, which selects the target if it is stronger than the interference in a local time-frequency region. Finally, we regroup the selected time-frequency regions and get the resynthesized speech. The results obtained indicate that the approach described here is efficient.

Keywords/Search Tags:

cocktail party effect, localization cues, ITD, IID, Binary perceptual model

PDF Full Text Request

Related items

1	Research On Perceptual Characteristics Of Spatial Cues In3D Audio
2	Reseach On Perceptual Cues Of Nasal Consonants
3	Research On Auditory Sound Localization And Coding Technology
4	Perceptual Measurement And Research In The Effect Of Interaural Time And Level Differences To The Acoustic Localization
5	Research On Influence Of Individual Estimation Changes On Effect Of The Wisdom Of Crowds Based On Information Cues
6	Perceptual cues and subjective organization in a virtual information workspace
7	Blind Separation Of Multiple Motions Based On Doppler Radar Sensing
8	Analysis And Investigation On Distance Localization Cues Of Near-Field Horizontal Sound Sources
9	Research On Real-time Extraction Of Target Person’s Speech In Multi-person Speech Scene Based On Single-channel
10	The Effect Of Global And Local Geometric Cues On Reorienation