The Fusion Algorithm Research Of Video And Audio Information

Posted on:2012-10-09

Degree:Master

Type:Thesis

Country:China

Candidate:J Wang

Full Text:PDF

GTID:2178330332990709

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

In recent years, with the development of the computer's informationization, more and more video equipments as well as the technology applied in people's daily life. Video conference, video search engine technology as well as video data inquiry technology and so on.which were including the massive non-text datum from the source of television, the meet in notes, the scientific literature and so on. To the one, the popularity of personal photographic equipment, and the improvement of Internet technology, so that ordinary people publish personal video recording has become extremely simple, and have consequently a large amount of video data. It is a severe test for the existing video processing technology that how to deal with so many multi-media information, how to organize data and make the index for it to retrieve.Early multimedia information retrieval algorithm has deviated from the original purpose of easy operations, future retrieval algorithms need to integrate low-level feature which contains more representative feature of the vision, hearing, semantic.The multi-modal property of video information provide a basis for information fusion. Video The most existing analysis tools for the single modal of video features, but the video is the special data which has multi-modal nature and there's a relevance about the multi-modal of the video features when it describes the same topics. Therefore need a kind of effective method to fusion video features for more accurately classify and retrieval. The major work at the process of video feature and the fusion process of video features are as follows: The current model of video data processing limited the application of news, advertising and other specific areas, and the technology used in processing is single and old. This article uses a video processing technology which is relative efficiency to define a relatively complete model of video retrieval pretreatment. In this model, used the multi-modal nature of the video low-level feature to extract the video time structure, to feature extraction for the content, and constructed the subset of video data from the original video. Based on this process to extract the key frames of video and extract audio features form video's audio streams. For simplify the calculations, to reduce dimensions for the low-level feature. We used the latest research of dimensionality reduction algorithm named the marginal fisher analysis algorithm in this paper. This method is superior to the commonly used PCA, LDA and other dimension reduction algorithms. According to obtained a variety of feature vectors, using robust classifier-support vector machine (SVM) to classify.To fusion the result of classification, we propose a fusion algorithm to improve the MGR fusion algorithm. Based on the output matrix of the sample number processed by feature vector which input classification and the integration framework of Melnik and others design, and to optimize preference and confidence, designed a fusion score function to improve MGR algorithm. The improved algorithm than MGR algorithm to reduce the computational cost, reduces the number of parameters and in the recognition rate also has some improvements.

Keywords/Search Tags:

media information, fusion analysis, cross-modal

PDF Full Text Request

Related items

1	Analysis Of Multi-Modal Social Media Based On Graph Model
2	Research On The Relevance Of Multi-modal Data In Cross-media Retrieval
3	The Research Of Cross-modal Sentiment Analysis For Images On Social Media
4	Cross-modal Multimedia Information Retrieval
5	Research On Information Retrieval Based On Cross-modal Association Analysis
6	Research Of The Rotor’s Movement Form Based On The Modal Analysis And The Information Fusion Technology
7	Research On Algorithm Of Deep Convolution Network And Feature Fusion For Cross Modal Commodity Retrieval
8	Research On DNN-based Cross-Modality Media Analysis
9	Cross-modal Multimedia Information Retrieval With CCA And Adaboost
10	Research On Text And Image Information Fusion In Cross-media Retrieval