Font Size: a A A

Parallelization of Push-based System for Molecular Simulation Data Analysis with GPU

Posted on:2017-06-24Degree:M.S.C.SType:Thesis
University:University of South FloridaCandidate:Akhmedov, IliiazbekFull Text:PDF
GTID:2478390017953808Subject:Computer Science
Abstract/Summary:
Modern simulation systems generate big amount of data, which consequently has to be analyzed in a timely fashion. Traditional database management systems follow principle of pulling the needed data, processing it, and then returning the results. This approach is then optimized by means of caching, storing in different structures, or doing some sacrifices on precision of the results to make it faster. When it comes to the point of doing various queries that require analysis of the whole data, this design has the following disadvantages: considerable overhead on traditional disk random I/O framework while reading from the simulation output files and low throughput of the data that consequently results in long latency, and, if there was any indexing to optimize selections, overhead of storing those becomes too big, too. Beside it, indexing will also cause delay during write operations and since most of the queries work with the entire data sets, indexing loses its point.
Keywords/Search Tags:Data, Simulation
Related items