Spoken Dialogue System For Service Robots And Research About Language Model

Posted on:2015-11-09

Degree:Master

Type:Thesis

Country:China

Candidate:Z Liu

Full Text:PDF

GTID:2298330428999878

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the development of speech recognition technology, it has been applied in various areas. For service robots, speech recognition technology is used mainly in spoken dialogue system. In this study, we designed a spoken dialogue system specific to "KeJia robot" and explored the implementation process of it. In addition, we also studied how to improve recurrent neural network language model using unsupervised word clustering.The research about spoken dialogue system for service robots mainly involves two parts:speech recognition and dialogue management. In terms of speech recognition, we first introduced the theory of speech recognition in details, and then the corpus we collected for the application of KeJia robot, we described the detail steps of training an acoustic model and did the testing and analysis on the corpus we collected, The experiments showed that context dependent triphone model has the highest word accuracy of98.39%and sentence accuracy of90.83%related to it. Devices on robot usually have a limited computing ability, so we took advantage of the states information that the robot provides to design the dynamic language model mechanism that can compress the search space when do speech recognition decoding. The experiments showed that the best performance of our dynamic language model based speech recognition module can get the highest sentence accuracy of87.95%, which is12.05%higher than the speech recognition module that don’t use dynamic language model. In terms of dialogue management, we used python to build the dialogue management framework based on hierarchical state machine. Then we introduced the Adding of Multi-model information and the Verification and validation mechanism. Finally we introduced the practical application of our dialogue management framework on the task of "cocktailparty" of "KeJia robot".In addition, we explored the application of word clustering on the recurrent neural network language model. Recurrent neural network language model has been proved to be one state of the art language model. Researchers found that adding part of speech tag information to the input layer of recurrent neural network language model can improve the model efficiently. However part of speech tag require hand-annotated data to train the tag model, which consumes a lot and the extra tagger also makes the model more complicated, we proposed adding the results of brown clustering, instead of part of speech tag information to the input layer of the recurrent network language model to solve the problem. In the Penn Treebank corpus, the relative improvement over the original recurrent neural network language model reaches8%-9%.

Keywords/Search Tags:

service robots, spoken dialogue system, speech recognition, dialoguemanagement, acoustic model, recurrent neural network languagemodel, brown clustering

PDF Full Text Request

Related items

1	Uyghur Speech Recognition Based On Deep Recurrent Neural Network
2	Research On Acoustic Modeling Of Speech Recognition Based On Recurrent Neural Network
3	A Study Of Key Technologies To Freely Spoken Mandarin Speech Evaluation
4	Acoustic Model Of Speech Recognition Based On Lightweight Neural Network And Its Application In Robot
5	Research And Realization On Spoken Dialogue System For Service Robot
6	Chinese Speech Keyword Detection Technology Research
7	Research On Dialogue Management Base On Real User Interaction
8	Research On Acoustic Modeling For Spontaneous Spoken Speech Recognition
9	Research And Implementation Of Speech Recognition Algorithm Based On Recurrent Neural Network
10	Structured Recurrent Neural Network And Its Applications In Automatic Speech Recognition