Research On Analytics Of Distributed Big Temporal Data

Posted on:2020-11-10

Degree:Master

Type:Thesis

Country:China

Candidate:W Zhang

Full Text:PDF

GTID:2428330620459983

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

The temporal data is ubiquitous,and massive amount of temporal data is generated nowadays.Management of big temporal data is important yet challenging.It is a desired choice to handle massive temporal data with a distributed system.However,existing distributed solutions either cannot natively support temporal queries,or are disk-based with I/O bottlenecks,which could not well satisfy the requirements of high efficiency and scalability.This paper proposes an In-memory based Two-level Index Solution in Spark to process big temporal data.With global-local index structure,it can effecively filter candidate partitions with global index while using local index to boost in-partition query,which remarkably improves the query performance of various temporal operations such as time travel,temporal aggregation and temporal join,etc.Furthermore,we designed partition method for temporal data to optimize the partition filtering process with global index.With comprehensive experiments,the results show that our proposed solution can provide big temporal data with low latency and high throughput analysis.

Keywords/Search Tags:

Big temporal data, Temporal queries, Distributed in-memory analytics, Two-level index, Partition method, Spark framework

PDF Full Text Request

Related items

1	Temporal Query Analysis And Temporal Index Optimization Based On Apache Spark
2	Research On Consistency And Index Based On B Tree Of Temporal XML
3	Index structures for temporal and multimedia databases
4	Research And Implementation Of A Segmentation Hybrid Temporal Index Structure In Database
5	The Research Of Temporal Index Technique And Algorithm
6	Research Of Query And Analysis Technology For Spatio-temporal Big Data Based On Spark
7	Design And Implementation Of Temporal RDF Storage System Based On TimeDB
8	Research On Index And Query Technology Of Spatio-temporal Data Based On Hadoop
9	Tqindex:An Effective Index Structure For Processing Temporal Queries On Very Large Scale Of Data
10	Research And Application Of Distributed Spatio-temporal Data Index Mechanism