Font Size: a A A

Research On Extraction Methods Of Kazakh Common-used Words And Investigation Of Elementary School Textbooks' Words

Posted on:2013-01-12Degree:MasterType:Thesis
Country:ChinaCandidate:Y L WangFull Text:PDF
GTID:2218330374466473Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
The research of Kazakh (the Kazakh Language) common-used words plays animportant role in language standardization, structured dictionary compilation, thenational elementary and middle school in Kazakh language teaching, as a secondlanguage teaching in Kazakh language, natural language information processing etc.Elementary school textbooks as basic resources of Kazakh teaching, their words andwords survey work have great significance of teaching, is directly related to theKazakh language teaching effect.This thesis defined the Kazakh common-used words, analyzed and defined thethree basic features of Kazakh common-used words: filed generality, regionalgenerality and time generality, using the corresponding quantitative indexes "fieldgeneral usage","regional general usage" and" time general usage" to measure thegeneral degree of word. By analyzing the traditional words field general usage, foundsome defects in the traditional method, and use mathematical methods to improve thecalculation method of Kazakh language vocabulary, using the calculation formula ofimproved words filed general usage calculating lexical general usage of Kazakhcommon-used words, enabled improved method have greater influence in rankingposition of Kazakh common-used words. Use statistical methods to investigate thegeneral usage of Kazakh words. On the basis of frequency statistics of Kazakh words,implement the statistics of Kazakh lexical general usage. Design and realize theautomatic extraction system of Kazakh common-used words.This thesis selects the current general two sets of the elementary school Kazakhtextbooks of nine years of compulsory education in the Xinjiang Uygur AutonomousRegion, which are" nine years compulsory education in the new curriculum standardexperiment teaching books of ordinary class"(referred to as" ordinary class edition")and" nine years compulsory education in the new curriculum standard experimental teaching books for bilingual class"(referred to as " bilingual class edition"), as theinvestigation objects. Investigate the words' frequency, the words' stem and thewords' suffix of the two sets textbooks. Make the contrast analysis on the wordscondition of the two sets textbooks in each volume.Experimental results show that the improved calculation formula has greaterinfluence strength of words ranking position than the traditional in extracting KazakhCommon-used words. The calculation method of lexical general usage is morescientific and effective. Based on the statistical investigation showed that two sets oftextbooks in the use of words is quite different, since the two sets of teaching objectand the emphasis is different.
Keywords/Search Tags:Kazakh, Common-used words, Lexical general usage, Textbooks, Words Investigation
PDF Full Text Request
Related items