Study On Data Preprocessing Techniques In Rfid Complex Applications

Posted on:2009-01-28

Degree:Master

Type:Thesis

Country:China

Candidate:X J Li

Full Text:PDF

GTID:2198360308979229

Subject:Computer software and theory

Abstract/Summary:

As a new technology integrated with signal processing, wireless communication, embedded calculation and data management, RFID technology is being widely used in more and more areas, such as supply chain management, object tracking, quick disbursement and so on. However, RFID technology adopts wireless radio frequency signal to communicate, which is easily interfered with environment, so there are many missed readings, erroneous readings, duplicates and data out of order in time when collecting data in RFID applications, which influences the accuracy of query results for event detection badly and limits the development of RFID applications. Therefore, the preprocessing over RFID data is the prerequisite of assuring high quality of query results.For the goal of solving the issues proposed above, this paper focuses the preprocessing strategy over "dirty" data generated in RFID applications.Firstly, on the basis of triple tuple over RFID data, the paper proposes a data abstraction algorithm which transforms RFID data from data level to logic area level. This algorithm is used to compress data where lots of redundant data are deleted and some missed readings are considered. After that, a tuple may be considered as a simple event. Experimental results show that through abstraction the amount of data is extremely cut down. In this way, system resource is greatly saved for further data cleaning.Secondly, in order to solve the missed reading problem-the main type of "dirty" data in RFID applications, this thesis proposes three interpolating algorithms based on data abstraction, namely rapacity algorithm, mink-similar algorithm and allk-similar algorithm. Above all, a dynamic probabilistic event model is established by statistically studying arriving events and computing the missing rate of each logic area. Then, on the basis of this model, missed events are interpolated by searching their most similar events using different searching strategies. These three algorithms increase data accuracy largely, and eliminate the influence of erroneous data to query quality. Theoretical analysis and abundant experiments prove the effectiveness and efficiency of proposed data interpolating algorithms.Lastly, this thesis improves the above interpolating algorithms by adding the factor of time. It mainly develops probabilistic event model by introducing temporal model, and thus two improved strategies of original interpolating algorithms are proposed, namelyÎ²* improved algorithm andÎ²+ improved algorithm.Î²* improved algorithm adopts histogram graph distribution to estimate time, andÎ²+ improved algorithm adopts Euclidean distance to estimate time. In different cases, these two algorithms behave well separately. Experiments show that improved data interpolating algorithms have the superiority on accuracy of processing results.

Keywords/Search Tags:

RFID application, data preprocessing technique, data interpolating strategy, probabilistic event model, missed reading

Related items

1	Study On Data Preprocessing Techniques In RFID Complex Applications
2	Key Technology Research On RFID Data Cleaning
3	The Research Of Algorithm For Uncertain RFID Data Cleansing
4	Research On The Data Cleaning Techniques For The Railway Container Yard Management System Based On RFID
5	An Improved Probabilistic Database Model And Its Probabilisticn Earest Neighbors Query Research
6	Uncertain Rfid Data On Complex Event Processing Technology
7	Real-time Query Processing And Optimization For Basic Events From RFID Data Streams
8	Diagnostic Prediction Method And Application Based On Multimodal Sensing Data
9	Research And Application On Data Preprocessing Algorithms
10	Research And Application On Data Preprocessing System Of Mobile Internet Data