Advances in children's speech recognition with application to interactive literacy tutors

Posted on:2007-06-04

Degree:Ph.D

Type:Thesis

University:University of Colorado at Boulder

Candidate:Hagen, Andreas

Full Text:PDF

GTID:2448390005479355

Subject:Computer Science

Abstract/Summary:

Speech technology offers great opportunity in the field of automated literacy and reading tutors for children. In such applications speech recognition can be used to track the reading position of the child, detect oral reading miscues, and even play an important, role in assessing comprehension or engaging the child in interactive dialogs for learning. Despite such promises, speech recognition systems exhibit higher error rates for children due to variabilities in vocal tract length, formant frequency, pronunciation, and grammar. In the context of recognizing speech while children are reading out loud, these problems are compounded by speech production behaviors affected by difficulties in recognizing words that cause pauses, repeated syllables, and other phenomena. This thesis presents advances in speech recognition that improve accuracy and modeling capability in the context of an interactive literacy tutor for children. This thesis presents a novel set of speech recognition techniques which can be applied to improve oral reading tracking. First, it is demonstrated that speech recognition error rates for interactive read-aloud can be reduced by more than 45% through a combination of advances in both statistical language and acoustic modeling. Next, this thesis proposes extending the baseline system by introducing a novel token-passing search architecture targeting subword-unit-based speech recognition. The proposed subword-unit-based speech recognition framework is shown to provide equivalent accuracy to a whole-word-based speech recognizer while enabling detection of oral reading events and finer grained speech analysis during recognition. The efficacy of the approach is demonstrated using data collected from children in 3 rd through 5th grade: namely 39.4% of partial words with reasonable evidence in the speech signal are detected at a low false alarm rate of 0.9%. Subword-unit-based speech recognition is extended to a large vocabulary task and its advantages for tight search beams is demonstrated when compared to word-based recognition. Finally, Subword units are shown to represent a valuable pool of potential distractors in the language modeling part of pronunciation verification tasks.

Keywords/Search Tags:

Speech, Children, Literacy, Reading, Interactive, Advances

Related items

1	Research On Mobile Application Of Literacy Reading In 3-6 Years Old Children Based On User Experience
2	Gendered literacy through social media: A study of the KidLitosphere blogs
3	Research On Problems Of Digital Reading Literacy Of Readers From The Perspective Of Reading Influence
4	The Research Of Interactive Interface Design On Literacy App For Pre-School Children Which Is Based On User Research
5	Research On The Design Of Children's Reading Service System Based On The Sharing Concept
6	TV Program And The Foster Of Children's IT Literacy
7	Application Of Children's Graded Reading Standards In Children's Book Publishing In China
8	Investigation And Analysis Of Reading Promotion Of Children's Library In Hefei From The Perspective Of Graded Reading
9	Research On User-Oriented Children Digital Reading Promotion
10	Construction Of Children's Reading Service Model In Public Library Guided By The Improvement Of Reading Ability