Data Mining In Sensor Networks

Posted on:2006-10-10

Degree:Master

Type:Thesis

Country:China

Candidate:J Ding

Full Text:PDF

GTID:2208360155961441

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

In pace with rapid development and increasingly maturation in communication, embedded computing and sensing technology, sensor capable of sensing, communication and computation appears in the world. Networks of such devices typically consist of tens or hundreds of small, power constrained nodes deployed in remote locations which they are expected to monitor for months or years at a time. How to process the continuous large data streams in sensor networks efficiently and how to find interesting knowledge in these streams become new challenge. Ubiquitous data mining(UDM) is concerned with this problem.This paper mainly studys the data mining technologies in sensor networks, including classification, association rule, clustering. This paper presents an algorithm designed to efficient construct a decision tree over distributed data streams in sensor networks. A numerical interval pruning approach is used for efficiently processing numerical attributes, and a probabilistic bound on the accuracy is guaranteed. Rainforest algorithm framework is also used in this algorithm which can significantly reduce the size of the dataset to be processed. In this paper, an algorithm of finding frequent items in distributed data streams of sensor networks is developed. At first, the algorithm finds local frequent itemset in each sensor, and transmits them to their parents to merge. Repeat this, until they reach a central node. At last, the central node combines them into a global frequent itemset, and produces association rules based on them. This paper describes a techniquefor clustering homogeneously distributed data in sensor networks. The proposed technique is based on the principles of the k-means algorithm. At first, the central node generates k centroids and broadcast in the network. Thereafter, each sensor node assigns each point in their local dataset to the nearest centroid, and transmits their local k clusters information to their parents node to merge. This process is repeated until the central node receives all the information of its child nodes. For each cluster, the central node recomputes the centroid as the average of data points assigned to it. If it doesn't meet with the stop condition, the algorithm will iterates the process from the start. In the end, based on the above algorithms, the paper realizes a distributed data mining prototype system in sensor networks. The theory and the technology have the general signification and it is easy to extend to some application fields.

Keywords/Search Tags:

Sensor Networks, Distributed Data Stream Mining, Classification, Association Rules, Clustering

PDF Full Text Request

Related items

1	Research On Association Rules Mining In Data Streams And Its Application
2	Research On Key Algorithms For Mining Frequent Patterns In Data Streams And Their Application In Simulation System
3	Storm-based Distributed Stream Data Association Rule Mining
4	Research And Implementation Of Data Mining Algorithms Based On Distributed Computing
5	Research Of Mining Browsing Patterns Based On Association Rules Of Stream Data
6	Data Stream Mining Technology Research
7	The Method Research Of Mining Association Rules In Distributed Environments
8	The Research Of Application On Medical Data Process And Mining Algorithm Of Association Rules
9	Classification Association Rule Induction Algorithm And Applied Research
10	Implementation And Application Of The Algorithm Of Mining Association Rules For Stream Data Based On Storm