Multi-video Summarization Based On Graph Model

Posted on:2019-11-21

Degree:Master

Type:Thesis

Country:China

Candidate:Y Y Zhang

Full Text:PDF

GTID:2428330623962530

Subject:Electronics and Communications Engineering

Abstract/Summary:

With the tremendous growth of web videos,it is more complicated for users to capture accurate information quickly and manage the video content of their interest,so that the intensity of work is getting bigger.Video Summarization is a good solution to solve this problem and it can perceive important content of videos quickly.Traditional video summarization methods based graph usually consider video frames as vertices to construct simple graph models.Although it has better performance,it can not represent the complex relationship of video frames.Since the content of multi-video data sets are diversity,redundancy and have large noise,complex graph models are needed to better characterize video content.To solve this problem,we apply hypergraph to describe complex relation-ships between video frames.In addition,multi-modal graph is also used to model the relationship between video frames with the help of tag information of video and complementary information of web images.These two graph models are applied to improve Multi-Video Summarization(MVS)algorithm.To well address the challenges in MVS,we firstly propose a novel framework with Hypergraph Dominant Set(HDS),termed MVS-HDS.Specifically,we first formulate the MVS task into a problem of finding dominant sets in a hypergraph to select the candidate frames.Then,with the help of web images searched by the same query,a Query Dependent Maximum Marginal Relevance(QD-MMR)method is developed to refine these candidate frames by balancing the criteria of redundancy and query adaptation.In order to make the summarization user-friendly and easy to understand,we also propose a Graph-based Topical Closeness(GTC)method to make it more comprehensible.In addition,we propose a Multi-modal Weighted Archetypal Analysis(MVS-MWAA)method to extract a concise summarization which is both representative and informative.In order to overcome the redundancy,diversity and other issues of multi-video data sets,we establish a multi-modal graph to guide the generation of the weight in WAA,which we call query-dependency weight.Specifically,the multi-modal graph fuses the complementary property of video frames,tags,and query-dependency web images.The candidate frames obtained by shot detection are divided into different archetype sets and the summarization is selected according to the importance scores of the archetype sets and the video frames in the archetype sets.Experiments for both approaches are carried out on two multi-video data sets of MVS1 K and TVSum.The experiments results demonstrate that our proposed methods clearly outperform the state-of-art approaches,which verify the effectiveness and superiority of both proposed algorithms.

Keywords/Search Tags:

Multi-Video Summarization, Hypergraph model, Dominant set, Archetypal analysis, Multi-modal, Key frame extraction

Related items

1	Research On Video Summarization Technology Based On Content Analysis
2	Video Summarization Based On Hypergraph Ranking
3	Research And Application Of Classification Algorithm Based On Multi-modal Hypergraph
4	Research On Simultaneous Text Summarization And Keyword Extraction Based On Hypergraph
5	Analysis Of Multi-Modal Social Media Based On Graph Model
6	Research On Static Video Summarization Technology Based On Key Frame Extraction
7	Research On Compressed Domain Video Summarization Technology Based On HEVC
8	Hypergraph Neural Networks And Applications In 3D Object Recognition
9	Research On Video Summarization Based On Frame Score
10	Research On Multi-scale Multi-modal Source Code Summarization Technology Based On Program Feature Enhancement