Content Based Video Retrieval

Posted on:2013-03-31

Degree:Master

Type:Thesis

Country:China

Candidate:Y H Mao

Full Text:PDF

GTID:2248330371966581

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

With the development of Internet and computer technology, a large number of video data is produced everyday. So how to get the interested video clips from the massive video dataset quickly and efficiently has become an urgent problem in the field of information retrieval (IR).In this paper, Content-based Video Retrieval (CBVR) is studied in four levels:low-level visual feature extraction, Semantic Indexing (SIN), Instance Search (INS), and Known-item Search (KIS). The main contents are illustrated as follows:For the problem of low-level visual feature extraction, some existing low-level feature extraction algorithms are analyzed. The Bag-of-Visual-Words algorithm is proposed to quantify the high-dimensional SIFT features. To evaluate the performance of the feature, an experiment is designed on TRECVID IACC training dataset. The result shows that this feature is effective.For the problem of Semantic Indexing (SIN), a novel framework for video semantic concept detection is proposed. A total of eight kinds of low-level visual features about color, shape, texture, and key points are extracted from each video key frame. A support vector machine (SVM) classifier is trained for each feature. The semantic detection result is obtained by decision level weighted fusion of SVM classifiers. In Semantic Indexing task of TRECVID 2010, the evaluation result of our system is higher than the average performance of all the participants.For the problem of Instance Search (INS), we propose a novel framework with three kinds of video retrieval methods which based on face recognition, color information of clothes, and the global image. In TRECVID 2010 Instance Search task evaluation, the performance of our framework ranked the second place, and is much better than the average of all participantsâ€™ results.For the problem of automatic Known-item Search (KIS), the textual query is pre-processed and then searched in video metadata and video data respectively. Text-based Retrieval (TBR) method is used in video metadata and voice script data transformed from voice in video by Automatic Speech Recognition (ASR) technology. Content-based Retrieval (CBR) method is used in visual and auditory information of video. Re-ranking the original result list of Text-based Retrieval method by the result of Content-based Retrieval method, the final result of our automatic KIS system is acquired. In Know-item Search evaluation of TRECVID 2010, the result of our framework ranked the second place among all participantsâ€™results, which shows our algorithm is effective.

Keywords/Search Tags:

low-level visual feature, Semantic Indexing, Instance Search, Known-item Search, TRECVID

PDF Full Text Request

Related items

1	Video Retrieval And Semantic Information Extraction
2	Semantic And Instance Based Video Retrieval
3	Towards Instance Level Feature Representation For Instance Search
4	Semantic Search On Large Scale RDF Data
5	A Study On Models And Methods Of Visual Search Engine
6	High-Level Video Semantic Concept Detection Based On Multiple Features
7	Instance Search Based On Visual Content
8	Research On Distributed Semantic Search For Resources In Distributed Network
9	Key Technology Of Instance Search In Image And Video
10	Research On Search Results Clustering And Label Extraction