Research And Implementation Of Multimodal Information Fusion Annotation System For Image-Text Mixed Data

Posted on:2021-03-09

Degree:Master

Type:Thesis

Country:China

Candidate:N Li

Full Text:PDF

GTID:2518306308467154

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

In the era of digital intelligent network,a large number of cultural digital resources are gathered,and new means and methods are urgently needed to organize and manage cultural resources effectively and rationally.At present,a large number of professional image-text mixed data have been accumulated in the field of culture,which is characterized by the mutual association of image and text,which is of great significance to the automatic annotation of image.This paper focuses on the humanities and arts books in the field of culture,and uses decorative images as the research carrier to implement the digital image-text processing and multimodal image annotation of image-text mixed data.The main contents of this paper include:(1)Aiming at the demand of multi-modal data for cultural big data,an adaptive image-text separation algorithm based on layout analysis is proposed.Using the professional image-text mixed books in the cultural field as the data source,the image-text information pairs consisting of images,titles and descriptive text are structured to form a multimodal data specimen database.(2)A new word discovery algorithm for domain-oriented lexicon construction is proposed.The algorithm improves the new word discovery algorithm based on information entropy and mutual information by combining a variety of statistical features with the text of professional books in the field of culture,completes the expansion and update of the domain lexicon,improves the low segmentation accuracy of the general word segmentation tool for professional words in the field of culture,and lays the foundation for the image annotation algorithm.(3)A multi-mode information fusion algorithm for image annotation is proposed based on the image-text separation algorithm,new word discovery algorithm and domain lexicon.The algorithm adopts PageRank-based multi-mode information decision fusion to merge the annotation information of both image and text modes,which ensures the richness and reliability of image annotation words to some extent.(4)Develop a annotation system of humanities and art books,and integrates the algorithm of image-text separation,new word discovery and multimodal information fusion image annotation proposed in this paper,which has certain practical value.This paper proposes a set of digital image-text processing methods for organizing cultural resources reasonably around the multimodal image-text datasets,and verifies the effectiveness of the proposed multimodal information fusion annotation method for image-text mixed data.

Keywords/Search Tags:

new word discovery, multimodal image annotation, digital image-text processing, information Fusion

PDF Full Text Request

Related items

1	Research On Web Image Retrieval Based On Web Information And Image Feature
2	Research On Essay-level Image-text Question Answering
3	Research On Key Issues Of Image Classification And Annotation By Fusing Text Information
4	MITK Based Multimodal Molecular Imaging Fusion Software Design And Implementation
5	The Research On Image Annotation Applied In Image Retrieval
6	Study On Large-Scale Dataset And Multimodal Image Fusion Methods In Face Recognition
7	BoVW Model Based Research On Image Annotation
8	Research On Web Image Annotation Based On Context Information
9	Simultaneous image classification and annotation via fusing multimodal heterogeneous image features
10	Study On Image Annotation Based On Web Training Data