Font Size: a A A

Research On The Construction Of Deaf Primary School Text Corpus Based On Chinese Word Segmentation Technology

Posted on:2020-08-12Degree:MasterType:Thesis
Country:ChinaCandidate:C ZhouFull Text:PDF
GTID:2417330578453142Subject:Education Technology
Abstract/Summary:PDF Full Text Request
Reading is one of the main ways for humans to acquire knowledge and information.It is of great significance to human beings,both in their own development and in the overall development of society.For people with hearing impairment,reading is one of the few means for them to acquire knowledge and social information.However,since the elementary school,the development of reading ability of people with hearing impairment has begun to lag behind normal people.This is because in the reading process,the hearing-impaired students lack the speech processing of the reading content compared with the normal person.The text is only a simple symbol for them so that makes it difficult for them to process the information of the reading materials.However,studies have shown that adding sign language expression to reading materials can improve the depth of processing of reading materials by hearing-impaired students,thus improving their reading efficiency.This research is based on this,and proposes to convert reading materials into sign language expression sequences to compensate for the lack of speech coding defects in hearing disabilities,and to realize the corpus corpus system of the school primary school.This study will explore the grammar and word formation differences between Chinese and Chinese sign language,use the conditional random field model to train the word segmentation model for sign language standard vocabulary,and construct a file system that can provide retrieval function,and finally realize the text corpus system.This study finds that Chinese sign language is influenced by Chinese language environment and has many similar characteristics with Chinese.However,Chinese sign language is still an independent language and still has its own unique characteristics.Word formation and sentence making rules.Therefore,this study is based on the conditional random field model to train the word segmentation model with the characteristics of sign language vocabulary,which is used as the main tool for the conversion of text sentences to sign language sequences in the corpus system.In order to satisfy the user's need for corpus retrieval,this study also establishes a hash table based on the relationship between words and vocabulary and words and sentences,so that the system can quickly locate the target sentence.The research finally realizes that according to the search conditions input by the user,it can quickly locate the target sentence in the text and the sentences before and after the sentence,and then complete the matching between the vocabulary and the sign language vocabulary in the sentence through the word segmentation model,and finally output the expression of the sign language vocabulary.The full functionality of the sequence.This study creatively uses the word segmentation tool to match the Chinese sentence with the sign language vocabulary to construct the corpus of the primary school texts.It can reduce the cost of building the sign language corpus and facilitate the later expansion of the corpus,thus providing more hearing impairment students.Reading opportunities,which in turn help them quickly adapt to written reading and improve reading skills.
Keywords/Search Tags:Chinese sign language, Chinese, corpus, Chinese word segmentation technology, Conditional random field model
PDF Full Text Request
Related items