Font Size: a A A

Quantitative Syntactic Research Based On The Mongolian Academic Corpus

Posted on:2023-05-31Degree:DoctorType:Dissertation
Country:ChinaCandidate:Y T WuFull Text:PDF
GTID:1525306788494674Subject:Chinese Language and Literature
Abstract/Summary:PDF Full Text Request
In the era of big data,language research based on corpus has become a research hotspot.Corpus-based syntax research is an important part of natural language research.As a knowledge base that provides language information,corpus plays an important role in natural language processing research.With the development of society,the theories and methods of different disciplines interacted with each other,and the research methods such as probability theory and statistics of mathematics began to be applied to language research,forming a new branch of quantitative linguistics.Quantitative linguistics takes the speech materials produced in real language activities as the research object,and strives to explore the structural patterns and evolution laws of language through quantitative methods.Based on the modern Mongolian Academic Corpus,this paper adopts the quantitative method to study the syntactic features of Mongolian academic language,including sentence length,dependency features,part of speech syntactic functions and syntactic network features,and attempts to answer the following questions:(1)What are the syntactic features and rules of Mongolian academic language?(2)What is the distribution of syntactic functions of each part of speech in the modern Mongolian academic language?(3)Are there differences in syntactic features between different styles of Mongolian?(4)What are the features of the syntactic network of the modern Mongolian academic language?The paper consists of six chapters.The first chapter introduced the significance,literature review,theory and method,content and steps of the research.The second chapter introduced the design,creation and processing steps of the modern Mongolian academic corpus.And relevant statistical research is done from the perspectives of vocabulary,parts of speech and sentence patterns,and the features and rules of Mongolian academic language in terms of vocabulary,parts of speech and sentence patterns are summarized.The third chapter,analyzed the syntactic features of Mongolian academic language.We use the syntactic indicators such as average sentence length,sentence length distribution features,dependency type,dependency type distribution features,dependency distance,probability distribution of dependency distance,dependency direction,and syntactic functions of parts of speech in the modern Mongolian academic corpus to summarize and analyze the syntax features and rules of Mongolian academic Language.The fourth chapter,based on the different styles of Mongolian dependency treebank,the stylistic differences of Mongolian syntactic features are compared and analyzed.It mainly analyzes and studies the stylistic differences in the average sentence length,sentence length distribution features,dependency type,dependency type distribution features,dependency distance,probability distribution of dependency distance,dependency direction,and functions of parts of speech in different styles of Mongolian corpus.Try to highlight the features and rules of Mongolian academic language.The fifth chapter,based on the dependency treebank,the syntactic network of the modern Mongolian academic language is constructed by using the complex network method.The basic parameter characteristics and complex network characteristics of the syntactic network of the modern Mongolian academic language are also analyzed.The sixth chapter summarized the research and discussed the follow-up research work.In this paper,a modern Mongolian academic corpus is created,which expands the research field based on the Mongolian corpus,and the result obtained by the quantitative method provide objective data to support theoretical research conclusions,so as to promote the further development of language research.
Keywords/Search Tags:Mongolian academic style, dependency treebank, syntactic features, complex network, quantitative research
PDF Full Text Request
Related items