The Research Of Keyword Queries Over Relational Databases Based On Team

Posted on:2013-08-25

Degree:Master

Type:Thesis

Country:China

Candidate:Y Tao

Full Text:PDF

GTID:2248330395450937

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

Relational databases are used to store structured data, a user must learn how to use SQL and have the knowledge of schema of the database, if he wants to query information over relational databases. It is a high threshold for the user. At the same time, keyword query is often used to retrieve information on the Internet, this is simple and easy to use, and is welcomed by common users. Thus, Keyword query over relational databases has attracted much interest recently.Most of works by now are aimed to obtain individual tuples to answer queries. We analyze these research works, and make an observation that use the combination of several tuples to answer queries will bring more value for users. So, this paper presents the concept of keyword queries over relational databases based on team. Here "team" means the combination of several tuples.We use inverted index to solve the problem. We think out a direct solution firstly:list all combinations of tuples in the inverted lists of query keywords, and then remove those teams which do not satisfy the restricted conditions in the user input, at last, remove the redundancy in the results. This Naive algorithm has a low efficiency, as it searches all of the solution space. Sometimes we can’t calculate the results because of high computation complexity.As the existence of the low efficiency in the Naive algorithm, we analyze the process deeply, make some important observations and bring forward several pruning strategies:remove the tuple which contains all of the query keywords, use restricted conditions earlier, and keep the optimal candidate. We reduce the amount of calculation as far as possible when we are integrating these strategies, and get a heuristic pruning algorithm. The pruning algorithm does not search all of the solution space any more, but search a part of it and add some judgment calculation, and thus reduce the computation complexity.We also add the mechanism of sorting the results. Some properties are observed and used to sort the results in a reasonable way, to make the results more acceptable to users. An empirical evaluation on both real data sets and synthetic data sets verifies that the pruning algorithm performances well in most cases.

Keywords/Search Tags:

combination of tuples, team query, inverted lists, keywordquery, relational databases

PDF Full Text Request

Related items

1	Research On Update And Query Optimization In Probabilistic Relational Databases With Integrity Constraints
2	Research On Keyword Query Over Relational Databases
3	Processing Recommender Top-N Queries In Relational Databases
4	A Study On Performance Optimization Of Keyword-Query Over Relational Databases
5	Scalable Query Technology Over Probabilistic Databases
6	Research And Application On Keyword Searches Of Relational Databases For Agricultural Information
7	Study On Keywords-Based Approximate Search Techniques On Relational Databases
8	Research On Query Interface Technologies To Relational Databases
9	Research On Technologies Of Keyword Search Over Relational Databases
10	Top-k Query For Relational Databases With Conditional Preferences