Font Size: a A A

Research On Hbase Data Storage Based On Hadoop Platform In Express Industry

Posted on:2018-09-12Degree:MasterType:Thesis
Country:ChinaCandidate:L WenFull Text:PDF
GTID:2348330536484804Subject:Logistics Engineering and Management
Abstract/Summary:PDF Full Text Request
In the big data environment,the express industry information surge and show the characteristics of time concentration.Traditional database can not meet the current express industry information storage requirements.The new large data processing tool Hadoop platform,can be well adapted to the development needs of social development,all kinds of data can be large-scale storage,to meet the enterprise's ability to control the data.By digging large-scale data rules,which found the potential value.The Hbase database built on the Hadoop platform can store small-scale data in large quantities to meet the needs of enterprises,especially small and medium-sized enterprises,for data mining.Hbase database on the theoretical research,its application in the express industry,the express delivery industry to generate information for storage,analysis,will enable enterprises to headache express data information turning waste into treasure.In order to solve the "data waste" problem faced by express information storage.This paper studies the construction of Hadoop distributed storage and computing platform,and studies the applicability of Hadoop's distributed computing system MapReduce data processing model and task scheduling function in the express industry.On this basis,the working principle of HDFS(Hadoop Distributed System)distributed storage system and Hbase database built on this system is studied.According to the characteristics of Hbase data model,the data storage and data reading mechanism of HDFS is clarified,and the Hbase data storage model is established.The relationship between the physical view and logical view of Hbase database is solved,and the Hbase database data index and data Error recovery mechanism,the study of the Hbase database data model,the distribution of the nature of the applicability of the courier industry.At the same time,the express data provided by Henan Xinyang Company was used as the experimental data,and the Hbase database was used as the experimental tool.The data of the Hbase database were used to store and query the experiment.The experimental data were stored with MapReduce Analysis and processing,built on the Hadoop platform Hbase database in the courier industry to study the applicability.
Keywords/Search Tags:Hadoop platform, Habse database, Express data, Data storage and analysis
PDF Full Text Request
Related items