Algorithms For Building Compact Representatives And Processing Ranking Querie

Posted on:2018-04-05

Degree:Ph.D

Type:Dissertation

University:The University of Texas at Arlington

Candidate:Asudeh Naee, Abolfazl

Full Text:PDF

GTID:1448390002496133

Subject:Computer Science

Abstract/Summary:

PDF Full Text Request

Ranked retrieval model has rapidly replaced the traditional Boolean retrieval model as the de facto way for query processing when a large portion of (big) data matches a given query. Returning all the query results in these cases is not efficient nor informative. Unlike the Boolean retrieval model, the ranked retrieval model orders the matching tuples according to an often proprietary ranking function and returns the top-k of them. In this dissertation, we study ranked retrieval model and propose exact and approximate algorithms for (i) building representatives for fast query processing, and (ii) online processing of ranking queries. We study the problem both in the general cases and in the special environment of web databases, a natural fit for the ranked retrieval model.;We start the dissertation by building representatives that serve as indices for ranking query processing. A critical observation is that skyline, also known as Pareto-optimal, (resp. k sky-band) is a set that contains the top-1 (resp. top-k) for every possible ranking function following the monotonic order of attribute values. Thus, first, we study the problem crowdsourcing Pareto-optimal object finding, in the case where objects do not have explicit attributes and preference relations on objects are strict partial orders. Then, we initiate the research into the novel problem of skyline discovery over hidden web databases, which enables a wide variety of innovative third-party applications over one or multiple web databases.;A major problem with the ranking queries representatives, i.e., skyline and convex hull, is that as in real-world applications the representative can be a significant portion of the data, its performance in the ranking query processing is greatly reduced. Thus, computing a subset limited to r tuples that minimize the user's dissatisfaction with the result from the limited set is of interest. We make several fundamental theoretical as well as practical advances in developing such a compact set.;Finally, considering the limitations of top-k indices, while focusing on the client-server databases, we propose query reranking third-party service that uses public interface of the database to enable the on-the-fly processing of ranking queries.

Keywords/Search Tags:

Processing, Ranking, Query, Retrieval model, Representatives, Building

PDF Full Text Request

Related items

1	Research And Application On Expansion Term Ranking Model For Query Understanding
2	Document Ranking Methods For Supporting Implicit Temporal Queries In Information Retrieval
3	Based On Xml Chinese Web-retrieval Model
4	Research Of Relevant Document Retrieval Technology For Question Answering System
5	Federated query processing using ontology structure and ranking in a service oriented environment
6	Research On Information Retrieval Ranking Optimization Methods
7	Research On Music Information Retrieval Technology Based On Content And Semantic
8	Research On Ranking And Query Expansion Based On Polysemy
9	The Research Of Skyline Query Processing In The Literature Searching And Ranking
10	Using Statistical Language Modeling For Ad Hoc Information Retrieval