Font Size: a A A

Robust Spoken Language Understanding Across Domains And Languages

Posted on:2008-12-23Degree:DoctorType:Dissertation
Country:ChinaCandidate:W L WuFull Text:PDF
GTID:1118360242975982Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Spoken dialogue interface has attracted extensive attention in both the research commu-nity and the commercial application due to its great theoretical and practical value. SpokenLanguage Understanding (SLU) is one of the key technologies for implementing spokendialogue systems.One challenge for spoken language understanding is the robustness problem since thespeech recognition error is inevitable and the spoken language is mostly grammatically in-correct or ill-formed. The other challenge is the portability issue. Currently, the developmentof spoken language systems relies often heavily on human effort, which is one of the mainbottlenecks for rapid development of spoken dialogue systems. For example, the linguisticexperts handcraft the semantic grammar for parsing. Therefore, the key issue is to reduce theneed for the manual works in the development of SLU systems and automate the whole pro-cess as much as possible, which helps to reduce the whole development cost and increasesthe portability of the spoken dialogue system.This dissertation proposes a robust and portable approach for spoken language under-standing. The advantage of the proposed approach is that it is mainly data-driven and requiresonly minimally annotated corpus for training while keeping the understanding robustness anddeepness of spoken language. The research works in this thesis include:This thesis proposes a novel spoken language understanding approach, which mainlyconsists of two successive classifiers. Firstly, the topic classifier is used to identifythe topic of an input utterance. With the restriction of the recognized target topic, thesemantic classifiers are trained to extract the corresponding slot-value pairs. The twokinds of classifiers can be automatically learned from minimally labelled training sen-tences. This SLU approach has good robustness for spoken language whilst keepingthe understanding deepness.A robust chart-based local parser is used to preprocess the input utterance to recognizethe concepts, which are relevant to the application domain. This robust local parser has the ability of skipping noise words or rule symbols ensuring that the SLU system hasthe low level robustness. To avoid the side-effect resulting from the skipping ability, amachine learning system is embedded into the parser for pruning. The preprocessingstep not only facilitates the labelling of training sentences but also reduces the numberof semantic slot classifiers.For the problem of topic classification, we investigate different kinds of features andcompare their corresponding performances. The strategy of combining diverse clas-sifiers is applied to improve the precision of topic classification. At the same time,the slot-filling task is also modelled as a classification problem so called semantic slotclassification. Initially, the literal context features are used for semantic slot classi-fication. Then, the consistency of the semantic slot in a sentence is checked. If theslots clash, the semantic slot re-classification is carried out to correct the misclassifiedslots. Two learning algorithms are employed for semantic slot classification, i.e., thedecision list and winnow algorithm.To further reduce the cost of labelling training utterances, weakly supervised learningtechniques are employed to train the topic and semantic classifiers. Firstly, the strategyof combining active learning and self-training is adopted to train the topic classifier.Secondly, a practical method is proposed for bootstrapping the topic-dependent seman-tic classifiers from a small amount of labelled sentences. The two weakly supervisedstrategies allow our SLU framework to begin with a small amount of labelled data andmake use of a larger amount of unlabelled data to improve the performance.The proposed SLU approach in this dissertation has been evaluated in different domainsand languages. The experimental results show that the performance of our system is betterthan the rule-based parser and comparable to the state-of-the-art data-driven SLU systems.Furthermore, our system requires less labelled data and hence significantly reduce the devel-opment cost.
Keywords/Search Tags:spoken dialogue system, spoken language understanding, topic classification, weakly supervised learning, active learning, bootstrapping
PDF Full Text Request
Related items