Font Size: a A A

Research On Graph Model-based Short Text Classification Algorithm

Posted on:2022-08-03Degree:MasterType:Thesis
Country:ChinaCandidate:J J TanFull Text:PDF
GTID:2518306524493804Subject:Master of Engineering
Abstract/Summary:PDF Full Text Request
In real life,a large number of short text data are constantly generated,such as news text,comments on some aspects published by users on Meituan,Dianping and other websites.The generation of text data is inevitably accompanied by the classification.How to improve the classification efficiency and reduce the labor cost is the research direction of text classification.In addition,mining users' emotions from these massive data helps to accurately depict users,thus assisting the platform to provide targeted services.However,most of the current methods ignore the relationship between words or between aspect words and context,which leads to poor classification performance.This thesis mainly studies the text classification algorithm based on graph model,including whole-text classification and aspect-level sentiment analysis task.In order to improve the performance of the model,graph model combined with attention mechanism is used to mine the relationship between words,aspect words and text context.The main points of this thesis are summarized as follows:1)An algorithm for whole text classification is proposed.This method constructs a graph for each text with words as nodes and the relationship between words as edges.At the same time a hyper-node connected to all word nodes is established to represent the overall information of the text.Then a graph convolutional neural network with an attention mechanism is used to learn the vector representations of hypernode and word nodes,and finally the information of the two vectors is merged to improve the accuracy of text classification.2)An aspect-level sentiment analysis algorithm is proposed.First,this method converts the text into a graph,and then constructs a hyper-node that connects all the words in the aspect words.Learning word vector representation and hyper-node vector representation through graph convolutional neural network with attention mechanism and gating mechanism.Finally,the hyper-node vector is used as the aspect words vector to realize the classification task with the text word vectors.In addition,the BERT pre-training model is combined to further improve the classification performanceIn general,the methods proposed in this thesis are based on graph model,using graph models to mine the relationship between text words and learn better word vector represen-tations.In addition,the attention mechanism and gating mechanism are used to control the transfer process of node information in the graph model.Under the verification of a large number of data sets,the method proposed in this thesis shows good performance and can be used for short text data.
Keywords/Search Tags:Text Classification, Graph Model, Vector Representation, Aspect Words, Attention Mechanism
PDF Full Text Request
Related items