Large-scale Content-based Multimedia Analysis and Applications Using Bag-of-words Model

Posted on:2013-11-22

Degree:Ph.D

Type:Dissertation

University:Ryerson University (Canada)

Candidate:Zhang, Ning

Full Text:PDF

GTID:1458390008986932

Subject:Engineering

Abstract/Summary:

This dissertation focuses on the analysis of large-scale image and video data consortia with applications to multimedia indexing and retrieval. Bag-of-words (BoW) model is adopted and improved to suit the efficiency and effectiveness requirements in analyzing large-scale multimedia data. BoW method has been developed from the text retrieval domain and successfully applied in computer vision, such as image scene and object categorization. Specifically, we utilized the BoW model in the domain of image classification and retrieval, tackled challenges of large-scale multimedia applications of video analysis and mobile-based social activity recommendation using visual intents, respectively.;Combining the BoW model with advanced retrieval algorithms, we propose a mobile-based visual search and social activity recommendation system. The merit of the BoW model in large-scale image retrieval is integrated with the flexible user interface provided by the mobile platform. Instead of text or voice input, the system takes visual images captured from the built-in camera and attempts to understand users’ intents through interactions. Subsequently, such intents are recognized through a retrieval mechanism using the BoW model. Finally, visual results are mapped onto contextually relevant information and entities (i.e. local business) for social task suggestions. Hence, the system offers users the ability to search information and make decisions on-the-go.;Incorporating the BoW model with unsupervised classification, we propose a scalable and generic approach in video analysis. The method aims at systematically analyzing unlabeled video from its genre identification, frame classification, and event detection. Unlike conventional domain-knowledge dependent approaches, the BoW model is domain-knowledge independent. Moreover, the system is mainly unsupervised and requires minimum human input. Therefore, our method is capable of processing massive quantity of videos generically. In addition, for the evaluation, sports video has been used as the testing ground.

Keywords/Search Tags:

Large-scale, Multimedia, Model, Video, Applications, Retrieval, Using, Image

Related items

1	Research On Fast Retrieval Of Large Scale Web Video Contents
2	Concept-based large-scale video database browsing and retrieval via visualization
3	Large Scale Video Retrieval And Feedback With Multi-level Content Represeentation
4	Research On Technology Of Content-Based Large-Scale Image Retrieval
5	Study On Overlay-based QoS Mechanism Of Internet Large-Scale Multimedia Applications
6	An information-theoretic framework towards large-scale video structuring, threading, and retrieval
7	Neural Networks Learning For Large Scale Image Retrieval And Classification Problems And Its Applications
8	Research On Key Techniques Of Content-Based Large-Scale Image Retrieval
9	Efficient Query Processing Over Large-Scale Multimedia Databases
10	The Research And Implementation Of Very Large Scale VOD Architecture And Service Strategy