Font Size: a A A

Research On Key Business Data Extraction And Display Technology Under Big Data Background

Posted on:2017-01-13Degree:MasterType:Thesis
Country:ChinaCandidate:X S RenFull Text:PDF
GTID:2278330488465711Subject:Electronic and communication engineering
Abstract/Summary:PDF Full Text Request
With the rapid development of the network hardware level, every moment in the world of all kinds of data information can obtain a more complete preservation. Data collected in the field of how to put a space and time domain of the image display, let us better read the implications, become a hot research direction of the era of big data, also is for later more huge amounts of data to provide important reference value.Hadoop is highlighted in the massive data computation. It is a distributed cloud computing platform based on the cloud computing platform. It has important significance to our traditional database. Almost all of the large programs now have either the basic data or the specific information is PB level, how to make these data better serve our human, this is where the responsibilities of Hadoop.This topic is carried out under the background of big data, in this paper first introduced the based on the research background and research significance of the massive data, generally introduced the research status at home and abroad; in data extraction, with the original pools table encounter a problem, we introduce the Hadoop hive data warehouse and improved canopy algorithm technology to solve, finally, we use the recruitment, for example, finally analyze the canopy algorithm improved the performance advantages. On the basis of this, the paper introduces the technical architecture of data display platform, including the basic working principle and structure of AngularJs, D3 and so on. From the dimension of the determination to the development of the former background has done a detailed description, and then use the column storage to do data storage, and finally build a automatic update tool. In this paper, a detailed description of the Hadoop’s pseudo distribution model is described in detail, and the final platform is built successfully. In the end of this paper, the data display platform module for use case test, the performance of the platform is used LoadRunner stress test, the test results in line with expectations.
Keywords/Search Tags:Hadoop, Hive, AngularJs, Canopy, DataPresentation
PDF Full Text Request
Related items