Research On Short Text Semantic Similarity Computation

Posted on:2017-11-03

Degree:Master

Type:Thesis

Country:China

Candidate:K Li

Full Text:PDF

GTID:2348330518470924

Subject:Engineering

Abstract/Summary:

The issue of semantic similarity problem in text mining has been paid close attention in the academic and industry areas.It has been widely studied in information retrieval,automatic question answering,text classification,natural language processing,machine learning etc.Short text semantic similarity calculation is to calculate the semantic similarity between two short texts.At present,aiming at this problem,many researchers have proposed a variety of similarity measures,mainly including word co-occurrence similarity measurement,similarity measurement based on grammatical structure and feature measurement based on semantic.Among them,method based on word co-occurrence,it doesn’t work well in short text,because the length of short text is limited-method based on syntactic structure,given a certain weight to different sentence elements via syntax analysis,and then extract grammatical information of text.feature measurement based on semantic,using background knowledge to learn semantic information of words,is well suited to solving a synonym similarity calculation.However,it is lack of consistent expression framework in nonsynonymous words and words of different sentence constituents.According to the above issues and based on short text property,constructed multi-level structure and proposed the multi-level feature fusion method,obtain more complete information from text,therefore improving the accuracy of short text semantic similarity calculation.First of all,the model combines 6 different kinds of text similarity measuring features.These features include lexical features,features based corpus,grammatical features,syntactic features and diversified combination features and other features.Then;for dimensionality reduction in these various features,reduce the redundancy and noise of the text.Thirdly,study and use ensemble learning model-boosting algorithm to improve generalization of the model,training multi-classification model.Finally,through comparing with the existing methods,to validate the effects of multi-level feature fusion method proposed in this paper,and the effects of short text semantic similarity calculation results.The experimental results show that our proposed multi-level feature fusion method for short text enhance the accuracy of semantic similarity calculation effectively.

Keywords/Search Tags:

short text, semantic, similarity, feature fusion, ensemble learning

Related items

1	Research On Calculation Of Semantic Similarity Of Short Text Based On Feature Fusion
2	The Study Of Measures And Applications Of Short Text Semantic Similarity
3	Research And Application Of Short Text Semantic Similarity Model Based On Deep Learning
4	Analysis And Design Of Short Text Similarity Based On Deep Learning
5	The Research And Application Of Unsupervised And Supervised Short Text Similarity Measure
6	Research On The Method Of Semantic Similaritycalculation Of Short Texts Based On HowNet
7	Research On Short Text Classification Based On Ensemble Learning
8	Short Text Classification Method Based On Ensemble Learning
9	Research On Semantic Similarity Between Words And Between Short Texts Based On WordNet
10	A Short Texts Matching Methodusing Multi-level Features