Automatic Construction Method For Domain Concepts Based On Wikipedia Semantic Knowledge Base

Posted on:2018-08-27

Degree:Master

Type:Thesis

Country:China

Candidate:Q Y Zhang

Full Text:PDF

GTID:2348330512996460

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

With the development of network teaching mode,which is represented by the large open network course MOOC(massive open online courses),In order to meet the needs of learners fine-grained,high quality of learning resource retrieval requirements and use of information extraction method to construction a dynamic update for the machine algorithm using the domain concepts of semantic knowledge base,to support the MOOC system of learning resources is of great significance according to the semantic retrieval.There are some defects in existing semantic knowledge base including static and limited,which are difficult to meet the needs of large-scale network text information retrieval.Therefore,we studied the methods of automatically constructing and dynamic updating semantic knowledge base,which are as follows:(1)The method of combination about LDA topic model and TF-IDF algorithm is proposed better resolve these problems,which are extracting keywords not accurately representing the semantics of the concepts and not obviously be distinguished use information extraction method.We make full use of the advantages of LDA topic model to extract keywords semantic representation of comprehensive,TF-IDF algorithm to extract the keywords weight value distinguish obvious.Experiments show that this method can significantly improve effect than others.(2)To resolve problem that the semantic representation of concept knowledge base is not comprehensive,we use the keywords to represent the semantics of the concept itself,by hierarchy relationships of concepts in Wikipedia,link relationships between concepts and explanatory text of other concepts and link relationships between explanatory texts of different concepts to build concept semantic knowledge base.Therefore,link relationships between concepts and the semantic relationship of the concept itself are both used to construct concept semantic knowledge base,which are more comprehensive.(3)Graph’s random walk algorithm is used to compute the semantic similarity between concepts in order to resolve the problem of inaccurate computation the semantic similarity between concepts.We make full use of the advantages of the random walk algorithm to achieve the stable value after several iterations of the probability value.The accuracy of this experiment can achieve more than 84%.The validity of using the random walk algorithm to compute the semantic similarity of the knowledge base is verified.(4)To resolve the problem that the concept node of semantic knowledge base scale is relatively small and need to be updated dynamically,we make full use of the advantages of Word2vec’s Skip-gram model,which can skip some symbols.The dynamic update of knowledge base is achieved by acquiring a synonym to expand the number of nodes.

Keywords/Search Tags:

Semantic knowledge base, LDA topic modle, Semantic similarity, Random walk, Distributed Word Embedding

PDF Full Text Request

Related items

1	Research On WS-LDA Topic Model Based On Word Embedding And Semantic Similarity
2	Research On Lexical Semantic Similarity Measurement Based On Knowledge Integration
3	Trend Analysis Of Network Popular Words By Using Semantic Knowledge
4	The Calculation Method Of User Advisory Text Similarity Research
5	Research On Semantic Cue Discovery Based On Random Walking In Open Domain Knowledge Network
6	Research On Chinese Word Semantic Similarity Computation
7	Improved Text Topic Representation And Learning Method
8	Research On Short Text Topic Information Mining Technology
9	Research Of Collective Entity Linking Based On Joint Embedding Of Word And Entity
10	Research Of Word Semantic Similarity Based On Domain Knowledge