Automatic language identification with recurrent neural networks

Posted on:1998-11-15

Degree:D.Sc

Type:Dissertation

University:University of Massachusetts Lowell

Candidate:Braun, Jerome J

Full Text:PDF

GTID:1468390014977317

Subject:Computer Science

Abstract/Summary:

PDF Full Text Request

Automatic Language Identification (LID) means the capability of a machine to determine the natural language from a spoken utterance. LID is an important domain in Speech Processing and its significance is growing. As a basic research area, LID is of interest as a mechanism automating one of the capabilities of the human brain. Practical applications of LID include systems that must recognize the talker's language within their primary functionality. LID is an important enabling technology that can augment and enhance speech recognition facilities, e.g., in multi-lingual multimedia and translation systems. Approaches to LID include, among others, Hidden Markov Modeling techniques, phonotactics, prosody, and Large Vocabulary Continuous Speech Recognition (LVCSR). In spite of a surge in LID efforts during recent years, Automatic Language Identification remains an open research area. While some approaches offer solutions to particular application scenarios, this dissertation is concerned with a general, essential LID task (i.e., the LID without recognition capabilities at word-level and above), exploiting general, language-related, speech phenomena.; In this dissertation, a novel approach to the essential Automatic Language Identitication is proposed. The Recurrent Neural Network (RNN) architecture is proposed as the fundamental LID mechanism. The motivation for the RNN-based approach (as opposed to feedforward networks, e.g., MLP) includes addressing the long-term intra-utterance context, proposed as a critical element for the essential LID. Our approach also postulates a non-uniform distribution of LID-specific information, and introduces the concept of Perceptually Significant Regions (PSRs) that contain elevated levels of such information within the utterance. Our approach proposes a novel method called Perceptually Guided Training (PGT) for exploitation of this non-uniformity. The developmental and experimental aspects of this research include the LIREN/PGT (Language Identification with REcurrent Neural networks and PGT) environment. The LID training experiments show the efficacy of the PGT method by demonstrating improvement of the training process behavior. This research also includes the investigation of a number of other issues in LID training, and it proposes a number of algorithmic enhancements related to the LID Recurrent Neural Network training.

Keywords/Search Tags:

LID, Language identification, Recurrent neural, Automatic language, Training

PDF Full Text Request

Related items

1	Automatic language identification with sequences of language-independent phoneme clusters
2	Mongolian Language Model Based On Recurrent Neural Network
3	Research On Automatic Answering Technique Of English Test
4	Deep Learning Based Spoken Language Identification
5	The Study Of GMM-based Language Identification
6	The Optimization And Implementation Of The Efficiency And Performance Of Chinese Language Model Based On Recurrent Neural Network
7	Acoustic-Based Research On Automatic Language Identification
8	Research On Sign Language Recogniton Method Based On Convolutional Neural Networks And Recurrent Neural Networks
9	Research On Automatic Language Identification And Its Application
10	New approaches to automatic language identification