Tractable models of natural language semantics for recognizing spoken directions

Posted on:2004-09-09

Degree:Ph.D

Type:Thesis

University:University of Pennsylvania

Candidate:Schuler, William Edward

Full Text:PDF

GTID:2465390011976945

Subject:Computer Science

Abstract/Summary:

The development of speaker-independent mixed-initiative spoken language interfaces, in which users not only answer questions but also ask questions and give instructions, is currently limited by the performance of language models based largely on word co-occurrences. Even under ideal circumstances, with large application-specific corpora on which to train, conventional language models are not sufficiently predictive to correctly analyze a wide variety of inputs from a wide variety of speakers, such as might be encountered in a general-purpose interface for directing robots, office assistants, or other agents with complex capabilities. This thesis explores the use of statistical models of language conditioned on the meanings or denotations of input utterances in the context of an interface's underlying application environment or world model, as an extension to the 'semantic grammars' used in existing spoken language interfaces (which rely on co-occurrences among words or word classes). Since there are an exponential number of possible parse tree analyses attributable to any string of words, and many possible word strings attributable to any utterance, this use of model-theoretic interpretation must involve some kind of sharing of partial results between competing analyses if interpretation is to be performed on large numbers of possible analyses in a practical interactive application. This thesis presents a formal result that model-theoretic semantic interpretation can be factored (cut into well-behaved partial results) and shared (re-used between possible analyses) in polynomial time, in much the same way that simple syntactic structure is factored into context-free rules and shared in standard dynamic programming parsing algorithms. This polynomial bound holds even for analyses containing non-immediate variable scopings (including intra-sentential anaphora and quantifier raising) and generalized quantifiers, which are traditionally analyzed to have second-order (exponential) denotations. The thesis also presents the practical result that this approach does indeed yield a statistically significant improvement in accuracy in analyzing a corpus of spoken directions to 3-D animated agents.

Keywords/Search Tags:

Spoken, Language, Models

Related items

1	Young Spoken Language Teaching Present Situation Investigation And Countermeasure Research
2	Comparison And Research On The Beijing Spoken Language Textbook "Collection Of Language Zier" Of The Mid-19th Century And Modern Spoken Language Textbook
3	From "Spoken Language" To "Poetical Language"
4	Expression and artifact in utterance: The function of models for language in two basic writing classrooms
5	The Comparison Of Complexity Between Spoken Language And Written Language
6	A Study On The Vocabulary Distribution And Growth Pattern For Spoken English
7	Research On The Grammar Exercises Of Elementary Spoken Textbooks Of Short-term Spoken Chinese(threshold)
8	Refiguring language programs in the United States through appropriation of educational models found in Canada, Hong Kong, South Africa and Costa Rica: A hermeneutic analysis of language acquisition
9	The Study On The Texts Of The Intermediate Spoken Textbook Of Teaching Chinese As A Second Language
10	Research On Objects Mentioned By Spoken Language Automatically Guide Visual Attention