Font Size: a A A

Research And Implementation Of Shopping Website System Based On Extended Chinese Word Segmentation Method

Posted on:2019-08-12Degree:MasterType:Thesis
Country:ChinaCandidate:T ZhaoFull Text:PDF
GTID:2428330545964760Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the advent of the Internet era,online shopping has gradually entered people's lives,in a large number of B2C shopping sites,Tmall,Jingdong have taken up the majority of online sales.Driven by profits,more shopping websites emerge as the times require.As the main module of shopping website,search system can not be replaced in online shopping.Users can quickly complete the shopping process by searching for goods.In the search system,Chinese Word Segmentation is the foundation.Although the current Chinese Word Segmentation technology is becoming more and more mature,the two gaps in Chinese Word Segmentation still cannot be overcome: ambiguity recognition and neologism detection.For the above problem,this theme studies the Chinese Word Segmentation method based on extended lexicon,and proposes a method to create an extended lexicon.This method is based on the Solr Chinese Word Segmentation system allowing users to join the extended thesaurus.The feature selection of the CRF algorithm is improved.Using a bidirectional maximal matching method combined with a high-frequency word temporary lexicon.The results of the CRF algorithm are corrected.Finally,the extended word library of the user is created.In this theme,a series of experimental data on the accuracy index of Chinese Word Segmentation are obtained by the IK Analyzer Chinese participle that use the extended word library and who does not use the extended lexicon.The experiment proves that this method solves the ambiguity problem in Chinese Word Segmentation and the problem of new word discovery to some extent,and improves the accuracy of Chinese Word Segmentation.Then,by searching module application in shopping websites,users can get better searching and shopping experience.In order to construct the shopping website with practical value,this theme has studied the composition of the shopping website,and has carried out functional requirement analysis and non functional requirement analysis to the shopping website system,and this shopping website system conclude the backgroundmanagement module,the front interface module,the search module,the order module,the single sign-on system module,and the member system module.The system has realized the management of the commodity information,web page information and so on in the background interface.The user can browse,register,log in,search,complete the order and so on in the front interface.The system has built 3Linux servers,namely image processing server,redis server and Solr server.The system has stored goods and user data into the My SQL database.The extended word library data created in the Chinese Word Segmentation method based on the extended lexicon is stored in the mydic.dic file under the Linux system.Finally,the system has been tested in a comprehensive way.It proves that the system realizes all the functions identified in requirement analysis and can run stably and effectively.
Keywords/Search Tags:Shopping Website, Chinese Word Segmentation, Extended Thesaurus, CRF Algorithm, SOLR
PDF Full Text Request
Related items