Research On Multimodal Sentiment Analysis Via Hierarchical Cross-Modal Attention

Posted on:2024-04-30

Degree:Master

Type:Thesis

Country:China

Candidate:H B Zou

Full Text:PDF

GTID:2568307058481914

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Multimodal sentiment analysis has become a popular research direction in the field of deep learning in recent years,playing a crucial role in applications such as smart healthcare,smart education,smart customer service and social media analysis.Compared with unimodal sentiment analysis that relies only on a single modality to predict sentiment,multimodal sentiment analysis incorporates the features of different modalities to improve prediction accuracy through the complementarity of different modalities.Some existing multimodal sentiment analysis methods only consider how to integrate modal features,but do not explore the interactions between modalities,which will greatly reduce the performance of the model.Moreover,video,audio and text modalities contain rich information by themselves,and most of the existing research methods neglect to utilize the rich unimodal information,which leads to unsatisfactory prediction results.To address these issues,we design a novel multimodal sentiment analysis framework that learns intra-and inter-modal dynamics using attention mechanisms for modality interaction and improved fusion strategies.Specifically,(1)we introduce a hierarchical cross-modal attention module to model inter-modal dynamics,a bi-modal interaction layer and a tri-modal interaction layer in this module to fuse multimodal features.(2)We design a modal reconstruction module with three modal reconstruction submodules to model intra-modal dynamics.(3)To achieve more reliable predictions,we propose a decision-level fusion sub-work to fuse the inference results generated independently from the above two modules for sentiment analysis.Comprehensive experimental results and comparisons on CMU-MOSI,CMU-MOSEI,and CH-SIMS demonstrate the effectiveness of our model on public datasets.Our proposed model significantly improves the classification performance of multimodal sentiment analysis,especially on CMU-MOSEI,where the proposed model achieves an accuracy of 87.61%.The proposed model has been extensively tested and compared to existing state-of-the-art methods using publicly available datasets,demonstrating competitive performance.These results indicate that the model has significant potential for further exploration and development in the field of multimodal sentiment analysis.

Keywords/Search Tags:

Multimodal, Sentiment analysis, Attention mechanism, Modality interaction, Unimodal reconstruction

PDF Full Text Request

Related items

1	The Research On Multimodal Sentiment Analysis Based On Deep Neural Network
2	Multimodal Sentiment Analysis For Text,Audio And Video
3	Research On Algorithms For Multimodal Sentiment Analysis Based On Interaction Fusion
4	Research On Multimodal Sentiment Analysis Method Based On Deep Learning
5	Research On Algorithms For Multimodal Sentiment Analysis Based On The Attention Mechanism
6	Multimodal Sentiment Analysis Based On Attention Mechanism
7	Research And Implementation Of Sentiment Analysis Mechanism Based On Multimodal Information Fusion
8	Research On Multimodal Sentiment Analysis Based On Joint Learning Of Image-text Features
9	Multimodal Sentiment Analysis Based On Facial Expression,Speech And Text
10	Research On Multimodal Sentiment Analysis Of Social Media Based On Deep Learning