Natural language interfaces to databases (NLIDBs) provide users with a way to access information stored in databases directly in natural language. NLIDBs involve many kinds of subjects, such as AI, NLP, DB, HCI, etc. Over the past thirty years, although there have been signification advances in the area, the NLIDB systems did not gain rapid and wide commercial acceptance for the problems of portability and usability.This thesis attempts to develop a new methodology based on the database semantics to solve the key problems in NLIDBs. I argue that previous approaches to NLIDB are problematic, mainly because they do not pay more attention to benefit from different subjects synthetically.This thesis first presents a formal definition and classification about NLIDB, and then gives a general abstract model involved in NLIDB to outline the research scope and highlight the key and tough points of NLIDB. Based on the above discussion, two kinds of Chinese natural language interfaces are depicted in our project, namely Chiql, a template-based system, and NChiql, a restricted natural language based system.This thesis provides the portable architecture of NChiql, which emphasizes on the portability, usability, adaptability, robust and intelligence. In order to achieve these goals, this thesis presents a semantic conceptual model (SCM) which attempts to integrate the knowledge of language, specific domain and database. Static and dynamic knowledge acquisition mechanism is adopted to construct SCM.Based on the domain concepts and database semantic in SCM, this thesis depicts a word segmentation algorithm, which can handle the lexical ambiguity and unknown words by applying backtracking, related semantic...
|