Font Size: a A A

Web Data Extraction Technology Research Based On XML

Posted on:2010-07-27Degree:MasterType:Thesis
Country:ChinaCandidate:L L MiaoFull Text:PDF
GTID:2178360275499551Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Along with the rapid development of Internet, the mass of infomation have increased in exponential model.How to extract data from Web application has been the hotspot of nowadays research.So, this paper researchs the extraction of Web data.Firstly,this paper brings forward a conversion method of HTML TO XML, effectively conversing the pattern of HTML to pattern of XML.It simplifys the work of data extraction .and makes the matting for the latter work.Secondly, it has analyzed the data reflection,researchs using the XSL document to reflect thr XML data.As a result the research indicate the superiority of this method,which reflects the XML data source to the needed XML data file.Finially,to analyze the XML dataset, it brings up the storage of XML data,especially the storage of database.Meanwhile it also raises the mode of Web query based on XML, to make a good support for the later stage of whole extraction work and integration. At last,base on the all reseach,combine the data extraction,XML technique and .NET technology to design a rapid,common using Web data extraction system based on XML.
Keywords/Search Tags:Data Extraction, Binary Tree, XSL, XPath, Prototype System
PDF Full Text Request
Related items