Research Of Word Similarity Computing In Ontology Automatic Generation

Posted on:2009-11-25

Degree:Master

Type:Thesis

Country:China

Candidate:F H Zhang

Full Text:PDF

GTID:2178360248454780

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the rapid developing of network technology,more and more information resources of every domain in society can be acquired from network by people..It is becoming a hot issue that how to retrieve valuable information on web quickly and accurately home and abroad.The similarity of vocabulary terms is an important question in the information retrieval.In the machine translation,automatic response system,meaning disambiguation,natural language processing and other fileds term similarity calculation has a wide range applications.As the emergence of semantic web,word similarity computing also plays an important role in the Ontology integration and semantic information retrieval.Based on the above background,firstly this article introduces the theory and concept of ontology,and then analyzes the present research situation of word similarity and several of representative arithmetic on word similarity.The article presents an improvement method based-on word vector of similarity calculation.The dictionary is constituted by a series of words as well as their related interpretation. Some designated vocabulary act as the key word.This key word is explained by its explanatory note,and the words in the explanatory note are also explained by their own explanatory notes.Consequently,a vocabulary network hierarchy is formed. Then calculating probability of each word which appears in the corresponding explanation note.These probability data is saved to matrix document(frequency matrix).And calculates word vector through the formula(I-aA)C=(1-a)A.The similarity of words is defined as the probability that a key word is estimated from explanatory note of another key word.This similarity expresses how closely a key word represents the explanatory note of another key word.The greater their value,the closer the two words,and vice versa.These words come from the word vector file, and the value of the similarity is stored as vector files finally.Then this paper verified this method using sample data,and the experimental results show that the method of similarity calculation is feasible.Such similarity calculation method laid the foundation for words classification.

Keywords/Search Tags:

ontology, word vector, word similarity

PDF Full Text Request

Related items

1	Study On Multi-sense Word Vector And Semantic Similarity
2	Research On Key Techniques Of Cross-Language Text Similarity Detection Based On Word Vector
3	Research On Ontology Matching Based On Word Embedding And Structural Similarity
4	Algorithm Study On Based-Feature Word Similarity In Ontology
5	Constructing Ontology For Unstructured Chinese Text
6	Semantic Similarity Measurement Of Short Text By Convolutional Neural Network Based On Multi-Dimensional Attention On Word Vector
7	The Research Of Micro-Blog New Emotion Words Recognition And Orientation Judgment Based On Word2Vec
8	Research On Text Similarity Algorithm Based On VSM Combined With Word Semantics
9	Computation Of Word Similarity And Its Application In Question Answering System
10	The Research On Measuring Text Similarity Based On Word Vector Enhanced Tree Kernel Model