Font Size: a A A

Research And Implementation Of Data Mining System For Scientific Literature

Posted on:2018-10-29Degree:MasterType:Thesis
Country:ChinaCandidate:L Z LinFull Text:PDF
GTID:2348330518493387Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Scientific and technical literature is the carrier of science and technology,efficient analysis of scientific and technological literature is an important role in promoting the development of science and technology. Accuratly and automatic extraction of information which contained in the scientific and technical literature can improve the efficiency and accuracy of scientific and technical literature analysis. The traditional method of document analysis is difficult to deeply understand the information of the text, and the subjectivity and inefficiency of the manual analysis are the uncontrollable factors that influence the text analysis. Recent years, data mining has gradually become a popular subject, natural language processing technology is also widely used in scientific and technical literature analysis and processing. The combination of the two has a broad application prospect in the field of scientific literature analysis.This paper aims to provide a data mining platform for English scientific and technical documents, including data acquisition, semantic mining, document clustering, and visual display, in combination with natural language processing technology. Based on the theory of crawler, the paper has realized the science and technology literature crawling subsystem. On the basis of studying the basic theory and process of data mining, the paper has realized the clustering subsystem of scientific and technical literature. Through the deep analysis of the open domain information extraction (OpenIE) The application of open domain information extraction technology to the key phrases extraction and the extraction of key phrases in the scientific literature. In addition, the system also adds full-text search and statistical analysis capabilities, and ultimately the results of scientific and technological literature mining and visual display of key phrases to the system users to help science and technology practitioners to better grasp the scientific and technical documents within the semantic and scientific literature, and the relationship between scientific literature.
Keywords/Search Tags:scientific literature, data mining, open domain information extraction, key phrases
PDF Full Text Request
Related items