Font Size: a A A

The Study Of Telephone Content Text Classification Based On Ontology

Posted on:2009-01-09Degree:MasterType:Thesis
Country:ChinaCandidate:D ZhengFull Text:PDF
GTID:2178360245954057Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
The Internet develops rapidly and the methods of accessing the Internet are multifarious. People have been not satisfied with the only way to surf on line with the browsers of computers, Internet Explorer for example. And the users wish to view the web pages by telephone or mobile telephone instead of the computer screens. People prefer to communicate with natural languages rather than the figures and the letters. So the much friendly audio interface is becoming more popular. VoiceXML is a standard of exchanging audio data, which is based on XML. VoiceXML is a platform which provides an audio method to access the Internet. VoiceXML can connect and exchange data seamlessly with databases and other data documents based on XML standard. So it can connect the Internet with telephone net closely.The audio gateway based on VoiceXML submits the users'documents to the server. The server faces the huge pressure when the documents grow rapidly and it is needed to classify the information automatically. And then the classified documents will be handled respectively. It used to search or classify the information by keywords, but it doesn't work well, because the computer can't understand the implied semantic meaning of the keywords. Ontology is approached to solve the semantic problem. Ontology can be used to describe and analyze the semantic meaning of the keywords. The implied semantic information can be expressed by the ontology models. The classic search algorithms which match the words by the syntax and they lack the abilities of expressing, handling and comprehending of the knowledge. The main method to solve these problems is to match the words by semantics instead of syntax.Ontology is a kind of tool to describe knowledge, and it is a form of knowledge representation. And it can be the basis of the logical reasoning which works on rules. The reasoning of ontology means to extract the implied knowledge from the explicit definitions or statements. Ontology is an explicit and specification of a conceptualization, which is a kind of description of knowledge. If ontology is used for semantic analyze, rules must be approached. And the rules are used for reasoning. Predicate logic is an important form of knowledge representation. The rule system which is used to analyze the semantic information of text can be constructed on the knowledge repository which is based on ontology.OWL is used here to describe the knowledge in the domain and the rule system is used to express reasoning mechanism. There are lots of tools for editing and developing ontology. Protégé3.2.1 which is developed by Stanford University is the platform to construct ontology here. Protégéis an open ontology editor and it is expanded based on Java. Protégéprovides a lot plug-in and APIs. We simulate to build the ontology of Administration of a college. And a rule system is built on the ontology which is used to manipulate the text information.We advance to classify the text content of the telephone by ontology to solve the problem discoursed above. Ontology is a modeling tool to express the semantic meaning and the knowledge. It is used in taxonomy to increase the precision and the working speed.
Keywords/Search Tags:Taxonomy, Ontology, VoiceXML
PDF Full Text Request
Related items