Font Size: a A A

Study Of Chinese TExt Mining Based On Web

Posted on:2005-10-04Degree:MasterType:Thesis
Country:ChinaCandidate:H XiaoFull Text:PDF
GTID:2168360122975344Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Information mining is an important research issues in the domains of artificial intelligence and computer application. Chinese text mining based on web is an important aspect of information mining. Internet has become a great information source. It is an important issues for us to confront that how to make the Internet information serve people better. It is a real challenge for us to make Internet easier to use. The information in Internet is in short of organization, and full of a mass of pages, and on the other side, people want to obtain the information quickly and accurately. With the flood of information on the web, web mining is a new research issue which draws great interest from many communities. Now the research of web mining is in the development stage, and more and more study should get on in theory, implement methods and technique.Aim at the concrete problems of Chinese text mining based on Web, this paper mainly researched the methods and implement technique. This article discussed the Chinese word slice, character extraction, character expression and character matching methods, and established the Chinese text classification and clustering algorithms based on neural network. In the design of Chinese text mining based on web, the paper analyzed and researched the expression of web page information, structure feature, web page control symbol and HTML control symbol, and built the extraction flow of web page information, then gave two concrete application of Chinese text mining based on Web through combining with practical problems.
Keywords/Search Tags:data mining, web mining, text mining, character extraction, character express, character matching, clustering/classification
PDF Full Text Request
Related items