Learning Deep Models with Linguistically-Inspired Structur

Posted on:2019-07-19

Degree:Ph.D

Type:Dissertation

University:Cornell University

Candidate:Niculae, Vlad

Full Text:PDF

GTID:1478390017489174

Subject:Computer Science

Abstract/Summary:

Many applied machine learning tasks involve structured representations. This is particularly the case in natural language processing (NLP), where the discrete, compositional nature of words and sentences leads to natural combinatorial representations such as trees, sequences, segments, or alignments, among others. It is no surprise that structured output models have been successful and popular in NLP applications since their inception. At the same time, deep, hierarchical neural networks with latent representations are increasingly widely and successfully applied to language tasks. As compositions of differentiable building blocks, deep models conventionally perform smooth, soft computations, resulting in dense hidden representations. In this work, we focus on models with structure and sparsity in both their outputs as well as their latent representations, without sacrificing differentiability for end-to-end gradient-based training. We develop methods for sparse and structured attention mechanisms, for differentiable sparse structure inference, for latent neural network structure, and for sparse structured output prediction. We find our methods to be empirically useful on a wide range of applications including sentiment analysis, natural language inference, neural machine translation, sentence compression, and argument mining.

Keywords/Search Tags:

Models, Natural, Language, Representations, Structured

Related items

1	Modeling And Learning Of Representations For Natural Language Sentence-level Structures
2	Joint Learning Methods For Distributed Representations Of Natural Language
3	Distributed Representations For Cross-lingual Cross-task Natural Language Analysis
4	Computer-assisted transformation of design documents from a natural language description to structured modeling languages
5	The translator's assistant: A multilingual natural language generator based on linguistic universals, typologies, and primitives
6	Research On Natural Language Interface To Structured Data
7	Study On Efficient Representation Learning Algorithms Of Natural Language
8	The Research Of Algorithm Specification Based On Structured Natural Language
9	Massively parallel reasoning: A structured connectionist approach to natural language understanding and memory retrieval
10	Researches On Sequence Labeling Models In Natural Language Processing