Font Size: a A A

Research And Implementation On Burst Patterns Mining In Database

Posted on:2007-04-27Degree:MasterType:Thesis
Country:ChinaCandidate:D S CengFull Text:PDF
GTID:2178360212473182Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
In recent decades, with the development of technology, the ability of producing and collecting data is improved dramatically. Thus abundant data are accumulated. It is difficult to analyse these very large databases by only using existing methods. An embarrassing phenomenon is that"drowning in data but starving for knowledge". People wish to generate some new techniques and tools to analyse these data automatically and intelligently. Facing to this challenge, Data Mining emerged.Data Mining is a process of nontrivial extraction of implicit, previous, unknown and potentially useful knowledge from a large amount, incomplete of noisy, fuzzy and random data. Data mining is a hot topic in database, artificial intelligence, statistics etc. It attracts a great deal of attention from experts, researchers and information companies.Data mining has many categories, such as association analysis, clustering analysis and exception analysis. Exception analysis is also called exceptional pattern mining, as one of the data mining research topics. Data in a database do not always satisfy the model, which is generated from classification or clustering analysis. Those data objects, which do not satisfy the model, are called Outliers or Exceptions. Some existing algorithms in machine learning and data mining have considered exceptions, but as noises, and excluded them out of analysis. Indeed, from the point of knowledge discovery, rare events are often more interesting and valuable than others. Examples of its applications include the detection of credit card fraud and the monitoring of criminal activities in electronic commerce. Therefore, exceptional pattern mining is an important research work.In this paper, we proposed a new exceptional pattern—Burst pattern, which only appears in a single or a few time spans (or database) but has very high support compared with other patterns during the same time (or same database). High supports mean that the Burst patterns are very frequent in the corresponding time span (or database); while there are only several time spans (databases) that support them. This means that they are special and they could provide the decision-makers of a company or Enterprise with valuable supports and aids. Companies could utilize these patterns to make specific decisions, quicken the development of the companies and increase the profits. Therefore, how to discover the Burst patterns effectively from databases is an important and significative research work. In Section 1, we introduce briefly the...
Keywords/Search Tags:Data mining, Multiple databases mining, Burst patterns mining, Local pattern bases partition, Association rules
PDF Full Text Request
Related items