Font Size: a A A

Web Data Mining Based On XML And Improvement Of Apriori Algorithm

Posted on:2009-04-11Degree:MasterType:Thesis
Country:ChinaCandidate:X B ZhangFull Text:PDF
GTID:2178360245995490Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Web mining is a complex technology, which refer to the process of information or resource discovery from millions of sources across the World Wide Web. The documents and user information user browsing one or more web localities are the target of web data mining. Web mining can divided into data mining,structure mining and log mining according to different mining target. As xml can combine structural data easily, it is possible to mining multiple database. In this paper we study and discuss the use of XML as data switch pattern in web data mining and web log mining area.Data mining in association rule is an important research topic and apriori is the core algorithm in mining association rule. We propose a method that enhances the efficient of algorithm by evaluating the probability of candidate frequent itemsets. It shortens the runtime of algorithm by reducing the times of scanning database. A formula is provided in this paper.In this paper we want to discuss the use of XML for the web mining area and accomplished the following tasks:1. Study the method that applies the XML technology in the web mining a system of web mining based on XML. In this paper we study the Internet data switch technology of xml recent years. We advance a new method of data mining based xml. and we designed the function of data mining system based XML.2. Implemente the algorithms from converting XML documents into Relational database. in this paper we advance a group of methods of switch xml into Relational database pattern.3. Based on association analysis, an improved algorithm of Apriori is presented in the paper.
Keywords/Search Tags:Web Mining, Association, XML, Text mining, Apriori algorithm
PDF Full Text Request
Related items