Natural language generation using an information-slim representation

Posted on:2007-05-18

Degree:Ph.D

Type:Dissertation

University:University of Southern California

Candidate:Soricut, Radu

Full Text:PDF

GTID:1458390005482600

Subject:Computer Science

Abstract/Summary:

In this dissertation, I propose a new natural language generation paradigm, based on direct transformation of textual information into well-formed textual output. I support this language generation paradigm with theoretical contributions in the field of formal languages, new algorithms, empirical results, and software implementations. At the core of this work is a novel representation formalism for probability distributions over finite languages. Due to its convenient representation and computational properties, this formalism supports a wide range of language generation needs, from sentence realization to text planning.;Based on this formalism, I describe, implement, and analyze theoretically a family of algorithms that perform language generation using direct transformations of text. These algorithms use stochastic models of language to drive the generation process. I perform extensive empirical evaluations using my implementation of these algorithms. These evaluations show state-of-the-art performance in automatic translation, and significant improvements in state-of-the-art performance in abstractive headline generation and coherent discourse generation.

Keywords/Search Tags:

Generation, Using, Representation

Related items

1	Research On Detector Generation And Self Representation Methods
2	Research On Different Grained Topic Representation Generation
3	Modele de generation de mouvements rapides en representation de signatures manuscrites
4	A Discussion Of Mass Media's Representation And Construction To The Post-80s Generation
5	A Research On Paraphrase Detection And Generation Based On Sentence Representation
6	The Second Generation ID Card Based On Sparse Representation And Deep Learning
7	A Study On Deep Learning Based Semantic Representation Of Natural Language
8	Representation and generation of terrain using mathematical modeling
9	Conditional Generation And Semantic Editing Of High-resolution Images
10	Generation And Application Of Entity Descriptions Based On Large-scale Knowledge Base