Font Size: a A A

Research And Implementation Of Chinese And English Classification Academic Search Engine Based On Cloud Platform

Posted on:2020-02-09Degree:MasterType:Thesis
Country:ChinaCandidate:B M ZhaoFull Text:PDF
GTID:2428330590964167Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the development and advancement of Internet information,search engines have become an indispensable part of people's lives.Researchers are highly demanding on the accuracy of information,and they are accustomed to applying professional database searches.However,the current domestic and international academic search engines have some shortcomings.For example,foreign academic search engines have a high dead chain rate and unstable services,articles that are not ordered,only part of the content is displayed;Some domestic academic search engines require user registration;Some of the search results are not highly relevant to the user,and the user needs to search in depth one by one;Most academic search engines do not have access to academic texts for free.These situations reduce the user experience.The paper is aimed at the current imperfections of academic search engines,in order to meet the user's need for the acquisition of academic texts,a Chinese-English academic search engine based on cloud judgment classification was developed.The paper obtains data sources by data capture of academic journals in Chinese core journals,Clustering and extracting webpage data content information;using Chinese word segmentation technology to achieve word segmentation;The crawled webpage is based on the webpage format and webpage content,and the spatial vector model algorithm(VSM)and Kmeans algorithm are compared.Finally,the cloud-based webpage academic judgment is implemented based on the improved VSM judgment algorithm;The academic webpage is based on the Chinese map method and the academic classification lexicon.By comparing the decision tree classification algorithm and the naive Bayesian classification algorithm,the middle graph method combined with the improved naive Bayesian algorithm is used to realize the cloud-based academic webpage science classification.Finally,through test and analysis,it is concluded that the Chinese and English academic search engines based on cloud judgment classification realize the academic judgment of web content;Academic search engines can efficiently achieve scientific classification of academic web pages;Academic search engines can satisfy the user to obtain the original text and preview the full text free of charge,and the user can add the publication source;User search interface displays various fields of academic disciplines to realize user classification query;Through the test results,it is meaningful and valuable to demonstrate the Chinese-English academic search engine based on cloud judgment and classification for the purpose of serving users.
Keywords/Search Tags:Academic search engine, Space vector model algorithm, Middle figure method, Naive Bayesian algorithm, cloud platform
PDF Full Text Request
Related items