Font Size: a A A

A Research On The Extraction And Analysis Of The Newspaper Theme Words Group Based On The Dynamic Circulating Corpus

Posted on:2007-01-05Degree:DoctorType:Dissertation
Country:ChinaCandidate:Y L ShiFull Text:PDF
GTID:1115360185968406Subject:Linguistics and Applied Linguistics
Abstract/Summary:PDF Full Text Request
Teaching Chinese to foreigners is a great undertaking for the Chinese nation. More and more foreigners come to China to acquire latest information from mainstream Newspapers and other media. This research was drove by the requirement of the teaching reform on Newspaper Reading Course in BLCU. This paper disserts how to build a Newspaper resource database and extract theme words group from it based on the large-scale Chinese mainstream Newspaper Dynamic Circulation Corpus, all the study is under the theory of Dynamic Updating of Language and Knowledge. First, we established a classified Newspaper resource database on the DCC corpus, and we got 19 domain word lists from natural language in the database. Then we extract the general words by making the 19 domain word list across together. The most important research is the extraction of theme words group by the means of making the vocabulary apart. The theme words group is delaminated into different layer — A domain theme words group; B subdomain theme words group; C hypogynous theme words group; and single text theme words group. In the course of the experiment, all the theme words are strongly reflect the feature of the domain, subdomain, hypogynous theme and single text. We can use these different layer feature words to measure the extent of the theme semantic relevancy, we also try to explore the way to weigh the degree of the text difficulty. The research of the theme words group is benefit to the Newspaper Reading Course in the actual teaching. It provides a scientific and applied research platform to the Teaching Chinese to foreigners, and also, it provides a new landscape to the vocabulary study.Research route:Newspaper resource database--general words lists-- the extraction of themewords group and relevant research-- theme-centered teachingThis paper focuses on the extraction of theme words group and relevant research as follows:1 Built a Newspaper resource database based on the large-scale Chinese mainstream Newspaper Dynamic Circulation CorpusDynamic information resource system is from the material process of the instruction. Dynamic information is another kind of education information, It is very significant for studying and teaching. The range of content is wide, and its representation is diversity. This resource database has a total of 170,633,995 characters, 33545 text files. It is fills up the blank of the research of the Teaching Chinese to foreigners.2 Built a Classed Newspaper teaching system based on the Newspaper resource database After study many authoritative classify system and several Newspaper teaching material, webuilt a layered classed Newspaper teaching frame. This frame contains 19 different domains,91 subdomains, 189 hypogynous themes, basically cover all the main domains in the Newspaper and press. It is benefit to the teaching on the Newspaper and other courses.3 Extract a Newspaper and press general words list from the 19 domain words lists...
Keywords/Search Tags:Newspaper resource database, the method of making vocabulary apart, general words lists, theme words group, theme semantic relevancy, theme-centered teaching pattern
PDF Full Text Request
Related items