Research On Semantic Similarity Calculation Of Linked Data

Posted on:2015-01-03

Degree:Master

Type:Thesis

Country:China

Candidate:L M Jia

Full Text:PDF

GTID:2298330431495510

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

As the semantic web technology matures, the network resource environmentdevelops in the direction of intelligence and structuration. In current network resourceenvironment, the most of the information in this network is present in the form thatpeople can understand, not to be understood and processed by computer. Thisimpedes the development of network resource environment. The proposing ofsemantic web can solve these problems well. By adding the formal semanticinformation to the documents on the World Wide Web, semantic web can make thecomputer understand and deal with these documents to realize the automation of dataprocessing, and improve the efficiency of information searching.The appearance of linked data brings the substantial development of semanticweb and is recommended as the best practice of semantic web by W3C. By thestructural description and links between the data, Linked data is associated with thedata of scattered field to form the global giant data space that is data network. Thisdata network offers the protection to the maximum share, reusing and release of data,and offers the new opportunity to the knowledge discovery. With the increasingnumber of linked data in the data network, how to use the characteristics of linkeddata to carry on the knowledge discovery become the key issue to be solved in thecurrent study.This paper mainly does research into the semantic similarity of knowledgediscovery research. By researching and analyzing the existing semantic similaritycalculation methods about RDF data, the semantic similarity calculation method oflinked data based on multiple factors is presented. This method analyzed emphaticallythe three important factors influencing the similarity calculation, such as types ofattribute value, attribute weight and correlation, and for each of influence factorscorresponding similarity calculation formula is given. Finally the method is verifiedthrough specific instances. The experimental results show that, this method this paperproposed makes full use of the semantic information between concepts, and calculation results can better reveal the similarity relation between concepts in linkeddata.In the different application, the importance of attributes is different, and theweights of attributes can be changed. This paper carries on further researches aboutthe calculation problem of attributes weights, and presents a semantic similaritycalculation method of linked data based on dynamic weight. This method improvesthe semantic similarity calculation method proposed by Song D based on the Tverskycalculation model. First, this method dynamically computes the attribute weightaccording to quantity of different attribute values, distribution of attribute values, andvalidity of attribute. This method can distinguish the instances well that methodproposed by Song D and Tversky calculation model cannot distinguish, and improvethe accuracy of semantic similarity calculation about Linked data. Finally, in order toverify the validity and stability of this method proposed in this paper, ACM andFOAF testing datasets are used to experiment. Comparing to existing sentencesimilarity computing methods, the result of experiment shows that the methodproposed in this paper is superior to other similarity calculation methods in thevalidity and stability.

Keywords/Search Tags:

Semantic web, Linked data, Semantic similarity, Instance attributes, Dynamic weight

PDF Full Text Request

Related items

1	Linked Data Based Semantic Query System Of Traditional Chinese Medicine
2	Research On Semantic Similarity Computation Method Of Linked Data Based On Multi-granularity
3	Enterprise Data Environment Construction Combined With Semantic Linked Open Data
4	Research On Linked Data Model Based On The Semantic Similarity Analysis
5	Research On Semantic Similarity Measure Method For RDF Graphs
6	Research And Implementation Of Semantic Similarity Parallel Calculation Based On Association Of Tourism Data
7	Research On Ontology-Based Semantic Text Categorization
8	Research On Semantic Integration Of Digital Resources At Libraries Using Linked Data
9	Semantic Interconnection Pattern Based On Linked Data And Its Application In Organization Of Medical Scientific Research Information
10	A Pattern-based Approach For Extracting Linked Data From Chinese Government Documents