Research On Broadcast News Audio Structure Analysis

Posted on:2010-06-26

Degree:Master

Type:Thesis

Country:China

Candidate:R J Zhang

Full Text:PDF

GTID:2198330332978440

Subject:Signal and Information Processing

Abstract/Summary:

Broadcast news audio structure means segmenting broadcast news audio into independent and stable audio units according to their content and analyzing audio data in them to obtain their corresponding audio scenes. Broadcast news audio structure is not only helpful for in-depth broadcast news audio information processing and analysis but also helpful to assist content-based broadcast news video analysis and retrieval. This paper mainly focuses on technologies related to broadcast news audio structure analysis, including audio feature extraction, audio segmentation and audio classificatioin. We have got the following three aspects of contributions:Firstly, in audio feature extraction aspect, UBM(Universal Background Model)-based MFCC segmental feature and two chroma-based segmental features: Intra-Frame standard variance mean and Inter-Frame standard variance mean are proposed, experimental results validates the effectiveness of our new features. Besides, Orthogonal experiment design method is introduced to choose specific features for different program recognition tasks.Secondly, in audio segmentation aspect, Believable Degree based audio segmentation algorithm is presented. Audio stream is searched using a sequential fixed-size analysis window in order to avoid accumulative errors. In the analysis window, to avoid detection errors due to threshold setting and hard threshold judging, the Believable Degree value for every hypothesized change point is computed and the true change point is detected according to the change trend of Believable Degree value. Experimental results demonstrate that our algorithm is superior to KL2-based algorithm, HMM-based algorithm, BIC-based algorithm and Entropy-based algorithm.Thirdly, in audio classification aspect, the paper puts forward SVM-GMM which combines advantages of both SVM and GMM. Considering characteristics of broadcast news audio, the"coarse to refine"strategy is employed which first coarsely classifies broadcast news audio into speech, music, speech over music and background using SVM, then recognizes the concrete program types for each audio class using GMM. Experimental results demonstrate that our algorithm outperforms cascade algorithm, NFL(Nearest Feature Line) algorithm, HMM algorithm and SVM algorithm.

Keywords/Search Tags:

Audio Segmentation and Classification, Universal Background Model, Chroma, Orthogonal Experiment Design, A Sequential Fixed-Size Window, Believable Degree, SVM-GMM, Coarse to Refine

Related items

1	Audio Architecture Technology Research
2	Audio Classification And Segmentation Method Based On Beats And Key Background Models
3	Research Of Audio Classification And Segmentation
4	Researching Audio Segmentation And Classification Methods Of The Broadcasting Repertoire
5	Calibration Of Spaceborne Microwave Radiometer With Fixed-beam Pointing
6	Based On The Pro Tools Of Digital Audio Workstation Video Website Design And Implementation
7	Research On Acoustic Feature Analysis In Audio Retrieval
8	Detection Of Audio Events With Scene Dependence
9	Research On The Segmentation And Classification For Broadcast Audio
10	Research On Universal Background Model And Preliminary Study On Deep Learning In Speaker Recognition