Font Size: a A A

Research On Chinese Spelling Correction In Question And Answer System

Posted on:2013-05-19Degree:MasterType:Thesis
Country:ChinaCandidate:Y QinFull Text:PDF
GTID:2248330374983750Subject:Control Science and Engineering
Abstract/Summary:PDF Full Text Request
The Question and Answer system for the Internet has played a more and more important role as in the development of Web. For the number of search engine users are growing quickly, and together with the requirements for Question and Answer system are higher and higher, its functions continuously improved. The spelling checking and correcting function is a very important additional technology.In the Chinese search engine, the definition of spelling auto-checking and correcting function is that when a user input a question which is not in the QA database, it will return a similar question as the suggest.Finally the user will see the suggest question in the QA system page.In terms of the characteristics of Chinese language, an N-gram statistical language model has been set up. The model is also analyzed detailed to determine the necessary parameters. On this basis, the language model will be optimized and closer to real language. A pinyin correction and mapping table combine is introduced to the field of Chinese spelling correction in QA system. The language model decoding algorithm to optimize the correction results is firstly introduced.All the theoretical models proposed in this paper were all verified based on platform. On the statistical language model basis, it verified by three experiment. One is only has Pinyin correction, second is Pinyin and mapping table combine, last is on the second basis, use language model decoding algorithm to optimize the results.Finaly,it verified that the models can achieve good results by the last experiment,and the more contextual information,the higher error correction recall and precision.Finally, conclusions are given with recommendation for future work.
Keywords/Search Tags:Question and answer system, statistical language model, Spelling correction, Decoding Algorithm
PDF Full Text Request
Related items