Font Size: a A A

Design And Implementation Of Classification And Aggregation System Of Scientific And Technological Information

Posted on:2019-03-27Degree:MasterType:Thesis
Country:ChinaCandidate:D K ChenFull Text:PDF
GTID:2348330545458408Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the rapid development of the Internet,the network information is rapidly expanding,and the number of patent papers and periodicals is growing every time.Faced with such a huge knowledge network,scientific researchers often find themselves in a situation where it is difficult to obtain accurate scientific and technological information.How to design a system that can scientifically and effectively organize information,classify a large number of information and aggregate them according to different conditions,to save users' time and effort,becomes a hot topic in current technology research and application.This paper mainly designs and studies the classification and aggregation system of science and technology information to provide users with convenient information services.Users can easily acquire the required scientific and technological information through this system,and classify and aggregate scientific and technological information according to their needs.The system uses text classification,text aggregation,web information crawling,according to the keyword entered by the user,automatically collect relevant technological information from the Internet.through the classifier to classify the information,and then aggregated them according to user specified conditions,and finally it is displayed to the user in the form of web.In this system,the text classification is mainly based on the Naive Bayes polynomial classification model to implement the trainer and classifier,the text aggregation is to achieve aggregation functions under different conditions through various filters of Elasticsearch search engine,web information crawling mainly uses python's requests and beautiful soup library to complete the web crawler program.The main work of this paper includes:1.Design functional requirements and module structure of the classification and aggregation system of scientific and technological information;2.Implement the functions of each module of the system;3.Test and evaluate the system.
Keywords/Search Tags:text classification, elasticsearch, web spider
PDF Full Text Request
Related items