Feature extraction and feature reduction for spoken letter recognition

Posted on:2017-08-09

Degree:M.S

Type:Thesis

University:The University of North Carolina at Greensboro

Candidate:Wendell, Tyler James

Full Text:PDF

GTID:2458390008452959

Subject:Computer Science

Abstract/Summary:

The complexity of finding the relevant features for the classification of spoken letters is due to the phonetic similarities between letters and their high dimensionality. Spoken letter classification in machine learning literature has often led to very convoluted algorithms to achieve successful classification. The success in this work can be found in the high classification rate as well as the relatively small amount of computation required between signal retrieval to feature selection. The relevant features spring from an analysis of the sequential properties between the vectors produced from a Fourier transform. The study mainly focuses on the classification of fricative letters f and s, m and n, and the eset (b,c,d,e,g,p,t,v,z) which are highly indistinguishable, especially when transmitted over the modern VoIP digital devices. Another feature of this research is the dataset produced did not include signal processing that reduces noise which is shown to produce equivalent and sometimes better results. All pops and static noises that appear were kept as part of the sound files. This is in contrast to other research that recorded their dataset with high grade equipment and noise reduction algorithms. To classify the audio files, the machine learning algorithm that was used is called the random forest algorithm. This algorithm was successful because the features produced were largely separable in relatively few dimensions. Classification accuracies were in the 92%-97% depending on the dataset.

Keywords/Search Tags:

Feature, Classification, Spoken

Related items

1	Robust Spoken Language Understanding Across Domains And Languages
2	Research On Lattice Based Spoken Document Retrieval
3	Spoken Keyword Spotting Method And System Design Based On CRNN-CTC
4	Study On Emotion Recognition For Spoken And Written Language Considering Physiological And Behavioral Traits
5	Dialog Act Classification In Chinese Spoken Language And Its Application Under The Internet
6	Research On Chinese Spoken Term Detection Based On Deep Learning
7	Research Of Spoken Language Understanding Method Based On Deep Neural Network
8	Dependency Parsing Of Spoken Chinese Based On Graph-based Model
9	Research On Spoken Term Detection Technology In Continuous Speech Based On Sample Template
10	Topic Classification Of Spoken Document Based On LSH