Font Size: a A A

Unstructured Document Management Method Based On Semantic Web Technology

Posted on:2013-01-02Degree:MasterType:Thesis
Country:ChinaCandidate:X L ShiFull Text:PDF
GTID:2248330362471167Subject:Management Science and Engineering
Abstract/Summary:PDF Full Text Request
At present, as database technology, data mining, data warehousing technologies are mature andwidely used, structured document management issues have been basicly solved; the extensive application of information technology brings surge unstructured documents, but its has no efficient application, and increasing global competition require effective managemen of the organization’s unstructureddigital information resources.Unstructured document management is increasingly becoming a key issue in information resource management of the Internet age.This thesis apply Semantic Web technologies and standards to unstructured documents analysis and application process, so as to establish an effective method of unstructured document management..First, this thesis introduces the Semantic Web theory, primarily on three key Semantic Web technologies: XML, RDF and ontologies. It talks about the whole unstructured document management life cycle: acquisition, Mark, organization/storage and application, and analysis the probelems in every aspect of the process. The application ofSemantic Web technologies mainly sovles problems as follows: First, the markuping language including special appearance and content appearance, the tagging processand markup tools, and eventually form a complete structure of the Mark describes to facilitate understanding of computers and automatic processing.Next, as for organizational method,use ontology-based organization instead of the traditional linear method to meet the distributed network information needs of the organization, and establish storage monitoring mechanism to ensure the source document collaboration and the synchronous relationship between the marked-up document.Finally, the specific application such as the information retrieval, automatic classification, intelligent reasoning: information retrieval using ontology_based query expansion, and sort according to semantic similarity; automatic classification places as the domain ontology classification tree to form a unified classification criteria; intelligent reasoning ontology and description logic is used for computer to understand, standardized descriptions and features the use of description logic reasoning.Finally, after completing the previous analysis and problem-solving, it establish a completeframework for unstructured documents management, then describes the whole life cycle ofunstructured documentmanagement in detail, and construction of specific experimental scenario tovalidate the feasibility and correctness of analysising and solving.
Keywords/Search Tags:unstructured document, Semantic Web, XML, RDF, Ontology
PDF Full Text Request
Related items