| Code-switching is a phenomenon that inevitably arises from language contact and language influence,involving the use of different languages or language varieties.In recent years,researchers from various fields such as sociolinguistics,psycholinguistics,foreign language teaching,syntax,and pragmatics have conducted in-depth studies on the phenomenon of codeswitching.Taking the Ordos region as an example,this article establishes a Mongolian-Chinese code-switching corpus of spoken language and studies the types and distribution characteristics of code-switching in this region.Ordos is an area where multiple ethnic groups reside,with the Han and Mongolian ethnic groups being the two largest in terms of population.Therefore,Mongolian-Chinese code-switching is a common language phenomenon in the Ordos region.Through field investigations and online interviews,this article collected spoken language data from natural conversations between Mongolian-Chinese bilingual speakers in the Ordos region,transcribed and annotated the data,and constructed a MongolianChinese code-switching corpus.We also designed and implemented a corpus management application platform.This corpus and platform software will provide data resources and analysis systems for research on Mongolian-Chinese code-switching and related studies based on codeswitching.The article consists of five parts.The introduction mainly introduces the background and significance of this research,as well as the research status of code-switching and corpus construction.Chapter 1 describes the data collection plan and data sources for the Mongolian-Chinese codeswitching corpus of spoken language in the Ordos region.Chapter 2introduces the transcription and annotation standards for audio data and word class tagging,as well as the problems encountered in generating and processing text data and the solutions adopted.Chapter 3 designs and implements a corpus management and application system based on the B/S architecture using frontend and backend development tools such as nodejs,vscode,jdk,and idea,with the constructed audio database and text database as data resources.Chapter 4 uses Poplack’s code-switching classification method to study the types of Mongolian-Chinese code-switching in the Ordos region,and discusses the social distribution characteristics of Mongolian-Chinese code-switching in the Ordos region from several aspects,such as age of onset,gender,cultural level,and occupation. |