Font Size: a A A

Weighted tree automata and transducers for syntactic natural language processing

Posted on:2011-07-22Degree:Ph.DType:Thesis
University:University of Southern CaliforniaCandidate:May, Jonathan David LouisFull Text:PDF
GTID:2448390002962280Subject:Computer Science
Abstract/Summary:
Weighted finite-state string transducer cascades are a powerful formalism for models of solutions to many natural language processing problems such as speech recognition, transliteration, and translation. Researchers often directly employ these formalisms to build their systems by using toolkits that provide fundamental algorithms for transducer cascade manipulation, combination, and inference. However, extant transducer toolkits are poorly suited to current research in NLP that makes use of syntax-rich models. More advanced toolkits, particularly those that allow the manipulation, combination, and inference of weighted extended top-down tree transducers, do not exist. In large part, this is because the analogous algorithms needed to perform these operations have not been defined. This thesis solves both these problems, by describing and developing algorithms, by producing an implementation of a functional weighted tree transducer toolkit that uses these algorithms, and by demonstrating the performance and utility of these algorithms in multiple empirical experiments on machine translation data.
Keywords/Search Tags:Transducer, Weighted, Algorithms, Tree
Related items