Font Size: a A A

For Archaeological Digital Museum Restricted Natural Language Query System

Posted on:2006-01-01Degree:MasterType:Thesis
Country:ChinaCandidate:X N MaFull Text:PDF
GTID:2208360182977019Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Chinese natural language query system(CNLQS) is the result of many subjects, such as the natural language comprehension technology, the database technology, AI, the man-machine interface, etc. With the CNLQS, users can use the Chinese natural language to put forward the question and get the answer from the database system directly. In this way, it can make the interaction of the man and the machine more easily. Recently, CNLQS, which is a very important part of the man-machine interaction, begins to be paid more attention and be regarded as a research field that has the theoretical and the practical value.This paper achieves a restrictive Chinese natural language query system based on the archaeological digital museum application. This system permits users to input a restrictive Chinese query sentence through the user interface. Then it can turn the input into the standard SQL through an algorithm and get the result of the query in a cultural relic's database.Based on the above background, we have done the following work:1. This paper puts forward the restrictive grammars and rules that are consistent with the Chinese grammars and can satisfy the demands of the query. This system firstly gives four types of Chinese query sentences through the investigation paper, which includes the imperative sentence, the question, the elliptical sentence and the multiple sentence. The investigated objects are 24 students of the Computer Department of Shandong Architecture Institute. Through the analysis, we conclude that the imperative sentence's using frequency is the highest, which is about 70.2 percent. Based on the above research, we put forward a kind of restrictive grammars and rules, which are consistent with the Chinese language habits and can satisfy the query request of the archaeological digital museum.2. This paper designs a system dictionary that applies in the archaeological digital museum. How to build a simple and appropriate system dictionary is one of the foundation and difficulty of our research work. This paper introduces three kinds of system dictionary, which are general dictionary, special dictionary and associated dictionary by the analysis of data based on the archaeological digital museum. The threekinds of system dictionary have the different function. The general dictionary can be used to resolve the Chinese word's segmentation question and get the sequence of the part of speech. So we can get the query sentence's target and condition by using the sequence of the part of speech and the restrictive grammars and rules. The special dictionary can get the standardization form of the Chinese query sentence and finish the pretreatment operation. The associated dictionary can resolve the complicated relation of the tables by the associated path. We define the dictionary in the form of SQL SERVER database. In this way, we can unify the definition form of the dictionary and the data, which can improve the analytic speeds of the query sentences and then increase performance of the system.3. This paper puts forward a new word segmentation algorithm—WSDS(Word Segmentation on Database Semantic) algorithm. With the domain knowledge of the archaeological digital museum, using the information theory and the operational research and referring to the database semanteme in the system dictionary, the WSDS algorithm can resolve effectually the problem of the different meanings of Chinese syncopation and receive the correct syncopating form of the query sentence. We achieve the sentence array and the character string of the types of the sentence based on the WSDS algorithm. It supports the need of semantic analysis and the conversion from the natural language query sentence to SQL. We get the approving conclusion at the end.4. Based on the syncopation of the words, this paper puts forward a similar SQL sentences building algorithm—COS(Condition-Object Segmentation) algorithm. This algorithm achieves the apprehension of the restricted query sentences and the building of the similar SQL sentences.5. This paper can turn the similar SQL into the standard SQL by using the associated dictionary and the method of the associated path search at the end.The main innovations of this paper are as follows:1. This paper puts forward the restrictive grammars and rules that are consistent with the Chinese grammars and can satisfy the demands of the query.2. This paper studys and designs a system dictionary that applies in the archaeological digital museum.3. This paper puts forward a new word segmentation algorithm—WSDS algorithm.4. This paper puts forward a similar SQL sentences building algorithm—COS algorithm.This paper is simply researching an archetype that only deals with the Chinese natural language input by the people. In order to perfect this system and make it into practicality, there are many works that we can do in the field of archaeological digital museum actually, such as: how to improve the feasibility of the system transplant, how to resolve the different meanings of Chinese syncopation ulteriorly.
Keywords/Search Tags:Natural Language Query, Word Segmentation, System Dictionary, WSDS Algorithm, Digital Museum, Man-Machine Interaction
PDF Full Text Request
Related items