Font Size: a A A

Research And Implementation Of Chinese Herbal Literature Service System Based On Probabilistic Topic Model

Posted on:2015-02-04Degree:MasterType:Thesis
Country:ChinaCandidate:C LingFull Text:PDF
GTID:2268330425486460Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Today’s Internet is in the knowledge era of big data explosion, large amounts of all kinds of data resources that produced everyday need to be effectively use and deep mining. Literature data is also an important resource for big data era. Literature data is important for researchers to generate new research results in academic research. Although there are many literature systems, but how to improve literature searching effiency using advanced technology, provide more intelligence and more knowledgeable literature analysis service system is also the topic of developing literature service system.As research background is actual project and target is meet the needs of Chinese herbal literature system in the research area of Chinese herbal medicine, the main task of this paper is research and develop a Chinese herbal literature system. The main work of this paper includes:1) investigate and research related technologies and methods, focuses on the technology of literature data crawling, compression and update method of index data and related algorithms of topic model;2) analysis the overall demand on the system, presents the overall system architecture and the bottom storage architecture;3) mainly describe detailed design scheme of key module, such as crawling module, pre-processing module, search module, similar literature calculation module, similar field scholars recommend module,trend analysis module, as well as the core algorithm used;4) complete the implementation and optimization of the entire system performance.The system has been run on the Internet, validate the effiency and feasibility of system.
Keywords/Search Tags:VSM, Topic Model, LDA, MCMC, GibbsSampling
PDF Full Text Request
Related items