Reordering Of Question Answering System Based On Deep Learning And Gradient Lifting Tree Algorithm

Posted on:2021-01-02

Degree:Master

Type:Thesis

Country:China

Candidate:C X Zhao

Full Text:PDF

GTID:2428330614970990

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

In today's life and work,the Internet,with its high availability and convenient accessibility,as well as its huge amount of data sources,has taken human to an era of information explosion.Getting the necessary information accurately and quickly from huge information pool becomes more and more important.Q&A system is created based on this demand.Q&A system is widely used in practical fields such as intelligent customer service,shopping recommendation,knowledge quiz,and seeking medical advice etc.When the user enters his or her own question or a keyword,the system will return a piece of knowledge or a sequence of urls or texts.However,many Q&A systems has a problem in results sorting,leading to unsatisfactory sorting of answers,which further causes mismatch with the needs of users,and reducing the user experience and affecting the efficiency of acquiring knowledge from the Internet eventually.The reason why the sorting efficiency is not high enough may be that the feature is not accurate enough,or the feature is not comprehensive enough or the feature weight setting is not reasonable enough.In order to optimize the sorting of Q&A system,in this paper,the characteristics of Q&A system are diversified and studied combing traditional feature engineering and deep learning semantic similarity calculation,aiming to find the intrinsic relationship between the Question and Answer by combining statistics and semantics.After removing the stop words from the questions and answers,then the features are extracted,the traditional feature engineering is established,the features are normalized,the features to eliminate the magnitude effects between the features are filtered and the redundancy features and noise are reduced.Then the text is mapped to a vector,and the deep learning method is used to calculate the semantic similarity between the question and the answer.Then the author combines the two parts as input,and calculate the gain of the feature through the gradient lifting tree learning method to achieve the final optimized sorting result.In this paper the efforts are made to use a variety of short text semantic calculation methods,including traditional methods like word frequency statistics,editing distance,and language models such as word2 vec and bert,in order to improve the accuracy of similarity calculation using a hybrid multi-feature hybrid strategy.Taking into account the recall time of the streamlined Q&A system,the question was identified by an entity and the answers containing the question entity were weighted.The refined sorting results on the public dataset are verified and final sorting effect is proved to be improved.

Keywords/Search Tags:

Q&A system, learn to rank, feature processing, text similarity, deep learning, gradient boosting tree

PDF Full Text Request

Related items

1	Optimization And Parallelization Of The GBRT Algorithm For Learning To Rank
2	Research On Question Answering System Based On Learning To Rank
3	Research On Recommendation Algorithm Based On Gradient Boosting Decision Tree And Deep Belief Network
4	Intrusion Detection Model Optimization Scheme For Gradient Boosting Trees In Edge Computing
5	Research On High Risk Information Processing Module Of Internet Public Opinion Based On Natural Language Processing
6	The Research Of Algorithm Of Face Detection And Recognition Based On Video
7	Research On Context-aware Recommendation Method Based On Gradient Boosting Decision Tree
8	Research On Code Plagiarism Detection Model Based On Random Forest And Gradient Boosting Decision Tree
9	Analysis And Design Of Short Text Similarity Based On Deep Learning
10	Forestnet: A Learning Architecture Combining Deep Networks And Decision Forest