Font Size: a A A

Design And Implementation Of News Real-time Words Cloud System Based On Elasticsearch

Posted on:2017-11-17Degree:MasterType:Thesis
Country:ChinaCandidate:Z M LiuFull Text:PDF
GTID:2428330569985066Subject:Software engineering
Abstract/Summary:PDF Full Text Request
In today's information technology in the context of the rapid development of the world every day will produce tens of millions of news data,how these news data accurate fast search,how to find the hot news in these data,using the traditional search method has been Inappropriate.For enterprises,news data hidden behind a great value,how to rational use of these news data,has become a hot research.Based on the analysis of a large number of news data and the characteristics of the news data,the system selects the appropriate keyword extraction algorithm to extract the hot news data,select the most popular Elasticsearch full-text search engine to provide ms-level search services,and design High-throughput,high-availability and high-performance framework to deal with massive data,the final design and implementation of the news real-time word cloud system.The system extracts key words from news data,generates easily observable hot words and word clouds,and provides users with the function of news search.The realization of this system is divided into three parts: data acquisition analyzer,data storage management and word cloud display and search.The data collection and analysis part of the data storage management part is ElasticSearch as the storage core,which provides news data storage function and efficient full-text search function.The word retrieval Cloud display and search management to provide users with hot news word cloud display,as well as the function of news search.The system uses Ansj algorithm to extract the key words from the news data,which makes the data provided by the system have high accuracy.Through the ElasticSearch search engine as the core of the storage medium,the system has high stability and performance.The system analyzes the news within the company to provide users with an intuitive word cloud page,while providing high-performance news search capabilities,enabling users to quickly understand the current news hot spots,fast search-related news.By combining a variety of new technologies to design a framework of high availability,so that the system can solve the practical problems of enterprises,is the important significance of the study.
Keywords/Search Tags:News, Word cloud, Search, Analysis
PDF Full Text Request
Related items