G-hash: Towards fast kernel-based similarity search in large graph databases

Posted on:2010-05-19

Degree:M.S

Type:Thesis

University:University of Kansas

Candidate:Wang, Xiaohong

Full Text:PDF

GTID:2448390002487710

Subject:Computer Science

Abstract/Summary:

Structured data such as graphs and networks have posed significant challenges to fundamental aspects of data management including efficient storage, indexing, and similarity search. With the fast accumulation of graph databases, similarity search in graph databases has emerged as an important research topic. Graph similarity search has applications in a wide range of domains including cheminformatics, bioinformatics, sensor network management, social network management, and XML documents, among others.;Our objective in this thesis is to enable fast similarity search in large graph databases with graph kernel functions. In particular, we propose to develop (i) a novel kernel-based similarity measurement and (ii) an efficient indexing structure for graph data management. In our method we use a hash table to support efficient storage and fast search of the extracted local features from graph data. Using the hash table, we have developed a graph kernel function to capture the intrinsic similarity of graphs and for fast similarity query processing. We have demonstrated the utility of the proposed methods using large chemical structure graph databases.

Keywords/Search Tags:

Related items

1	Research On Node Similarity Measurement Method For Large Scale Dynamic Graph
2	Similarity Nodes Query Processing Approach In The Evolution Process Of Large Dynamic Graph
3	Research On Large Scale Graph Analytic System For Supporting Fast Random Walks
4	Design And Implementation Of RDF Graph Management Tool
5	Similarity Top-k Query For Large-scale Dynamic Graph
6	Research On Distributed Storage And Retrieval Technology Of Large-scale Knowledge Graph
7	Research On Fast Graph Clustering Algorithm On Large-Scale Data
8	Research And Implementation Of Virtual Machine Management System For Large Scale Graph Processing
9	Research On Large Graph Aggregation Algorithm Based On Finite Memory
10	Mining, indexing and similarity search in large graph data sets