Research Of Massive Data Distributed Storage And Processing System In Internet Of Vehicles

Posted on:2018-08-28

Degree:Master

Type:Thesis

Country:China

Candidate:S Y Chen

Full Text:PDF

GTID:2382330542976927

Subject:Electronic and communication engineering

Abstract/Summary:

PDF Full Text Request

With the explosion of vehicles,the concept of Internet of Vehicles(IoV)has drawn increasing concern among academics.The IoV refers to extracting and processing information from massive data which are gathering from vehicles and their surrounds.With the development of the Internet of Vehicle,the massive data which collected from some sensors for the equipment monitoring requires higher performance of data storage and inquiry.So an integrated IoV system which is reliable on data storage and efficient on data processing is indispensable.The paper designs a distributed storage and processing system depending on Hadoop,HBase and Spark to store and process IoV data.The system can not only overcome the single point of failure problem,guarantee the reliable data storage and meet the efficient computing requirements,but also can realize the clustering algorithm of parallel calculation.The contributions of this paper are as follows:Firstly,the paper studies the characteristics of the IoV data,the study finds that IoV data has the some characteristics of the big data,such as large amount of data and fast processing.So the paper combines the big data technologies,proposing a distributed storage and parallel processing system depending on Hadoop,HBase and Spark to store and process the IoV data.And the construction of the system environment and deployment are completed.The study on the system is focus on the data storage layer and the data processing layer.The HDFS and the HBase are used for the data storage layer that resolves the problem of single point of failure.The Spark parallel computing framework is deployed as the data processing layer that benefits the efficient analysis of massive IoV data.A variety of big data technologies are combined together,constituting a complete,reliable and stable IoV data distributed storage system.Secondly,according to the user requirements,the paper combines the Spark parallel programming model with the clustering algorithms,realizing the parallel clustering algorithm.The anomalous data is filtered out before the data processing and analysis via Spark computing.Then the parallelized K-means algorithm and DBSCAN algorithm are realized on Spark.The experimental data comes from practical taxi data in Fuzhou and the clustering results are shown in Fuzhou road network map.The experimental results can facilitate the public to take a taxi,make it possible for the taxi driver to shift or have a rest.Finally,the thorough tests are done to verify the performance of stability,distributed storage and parallel computing on this system,which use the taxi data in Fuzhou.The results show that the system can satisfy the requirements of the reliable data storage,the load balance and stable data processing.

Keywords/Search Tags:

Internet of Vehicles, Hadoop, Spark, HBase, Clustering Algorithm

PDF Full Text Request

Related items

1	Research And Implementation Of Parallel Processing Technology For Remote Sensing Image On Cloud Architecture Storage And Classification Algorithm
2	Design And Implementation Of Taxi Service Application System Based On Hadoop Platform And Traiectory Mining
3	Design And Implementation Of The Storage And Mining Subsystem For Urban Rail Transit Log Data
4	The Detection Of Traffic Congestion Based On Data Mining In Internet Of Vehicles
5	Research On Construction And Application Of Pavement Image Database Based On HBase
6	Distributed Storage And Analysis Of Massive Urban Traffic Flow Data Based On Hadoop
7	Research And Implementation Of Clustering Optimization Algorithm For VANET Based On Mobility
8	Real Time Data Analysis System Of ISCS Based On Hadoop
9	Research On Traffic Big Data Query Optimization Based On HBase
10	Design And Implementation Of Traffic Information Paraller Processing Cloud Platform Based On Hadoop