Font Size: a A A

Research Of A Mass Transaction Record Query System Based On Hadoop

Posted on:2014-01-25Degree:MasterType:Thesis
Country:ChinaCandidate:J B WeiFull Text:PDF
GTID:2248330395484010Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the rapid development of the Internet, people’s consumption habits are changing and moreand more people prefer shopping online, because shopping online is convenient. With more andmore businesses are stationed in the e-commerce site, the type and quantity of goods have increasedrapidly, along with a huge user visits and the huge volume of transactions, e-commerce sites willgenerate vast amounts of transaction records, thus the processing capabilities of traditionalrelational database is facing severe challenges.Cloud computing can provide almost unlimited computing and storage capacity by connecting alarge number of inexpensive computers. Cloud computing is a new solution for massive datastorage and processing. Hadoop is a framework for distributed processing of big data and it allowsusers to develop distributed procedures for large data processing without understanding theunderlying details of the distributed system. As an open source system, Hadoop becomes theresearch focus of enterprises and research institutions.This thesis implemented a mass transaction records query system based on Hadoop by in-depthanalysis and research on Hadoop. Firstly, this thesis researched the Hadoop and related technologies,mainly introduced the Hadoop Distributed File System and the HBase, including the characteristicsof HDFS, HDFS architecture, HDFS Data Replication strategy, HBase architecture and data model.Secondly, this thesis analyzed the HBase storage characteristics, including the HBase data storagemethod, HBase Region locate method and the writing data process, and then put forward the systemdesign optimization and improvement suggestions. Then, this thesis introduced the design andimplementation of the Hadoop-based mass transaction record query system in detail, including thedata access module, storage module, query module and the TSS subsystem. Finally, the system istested, including functional testing and performance testing,and the feasibility and correctness ofthe system are verified.
Keywords/Search Tags:Cloud Computing, Hadoop, HBase, Big Data
PDF Full Text Request
Related items