Using inductive logic programming to automate the construction of natural language parsers

Posted on:1996-09-25

Degree:Ph.D

Type:Dissertation

University:The University of Texas at Austin

Candidate:Zelle, John Marvin

Full Text:PDF

GTID:1468390014486947

Subject:Computer Science

Abstract/Summary:

Designing computer systems to understand natural language input is a difficult task. In recent years there has been considerable interest in corpus-based methods for constructing natural language parsers. These empirical approaches replace hand-crafted grammars with linguistic models acquired through automated training over language corpora. A common thread among such methods to date is the use of propositional or probabilistic representations for the learned knowledge. This dissertation presents an alternative approach based on techniques from a subfield of machine learning known as inductive logic programming (ILP). ILP, which investigates the learning of relational (first-order) rules, provides an empirical method for acquiring knowledge within traditional symbolic parsing frameworks.; This dissertation details the architecture, implementation and evaluation of C scHILL, a computer system for acquiring natural language parsers by training over corpora of parsed text. C scHILL treats language acquisition as the learning of search-control rules within a logic program that implements a shift-reduce parser. Control rules are induced using a novel ILP algorithm which handles difficult issues arising in the induction of search-control heuristics. Both the control-rule framework and the induction algorithm are crucial to C scHILL's success.; The main advantage of C scHILL over propositional counterparts is its flexibility in handling varied representations. C scHILL has produced parsers for various analyses including case-role mapping, detailed syntactic parse trees, and a logical form suitable for expressing first-order database queries. All of these tasks are accomplished within the same framework, using a single, general learning method that can acquire new syntactic and semantic categories for resolving ambiguities.; Experimental evidence from both artificial and real-world corpora demonstrates that C scHILL learns parsers as well or better than previous artificial neural network or probabilistic approaches on comparable tasks. In the database query domain, which goes beyond the scope of previous empirical approaches, the learned parser outperforms an existing hand-crafted system. These results support the claim that ILP techniques as implemented in C scHILL represent a viable alternative with significant potential advantages over neural-network, propositional, and probabilistic approaches to empirical parser construction.

Keywords/Search Tags:

Natural language, Parser, Schill, Logic, Using, ILP, Approaches, Empirical

Related items

1	The Design And Implementation Of Modular Manipulator Programming Language Parser
2	Integrating top-down and bottom-up approaches in inductive logic programming: Applications in natural language processing and relational data mining
3	Similarity-based approaches to natural language processing
4	Analysis Of Mathematical Natural Language Structure And Its Application Based On Parser
5	The design and implementation of a parser and front-end for the ISO C++ language and validation of the parser
6	Logic-based natural language understanding in intelligent tutoring systems
7	Sequence-based High-order Dependency Parser For Non-projective Languages
8	Constructing A Chinese Semantic Auto Parser Based On An ILP Algorithm Combining Top-down And Bottom-up Methods
9	Research On The Transformation Method From Natural Language Text To PPTL Formula
10	Design And Implementation Of Language LP For Log Parser