Efficient processing of complex features for information retrieval

Posted on:2009-11-21

Degree:Ph.D

Type:Dissertation

University:University of Massachusetts Amherst

Candidate:Strohman, Trevor

Full Text:PDF

GTID:1448390002492182

Subject:Computer Science

Abstract/Summary:

Text search systems research has primarily focused on simple occurrences of query terms within documents to compute document relevance scores. However, recent research shows that additional document features are crucial for improving retrieval effectiveness.;We develop a series of techniques for efficiently processing queries with feature-based models. Our TupleFlow framework, an extension of MapReduce, provides a basis for custom binned indexes, which efficiently store feature data. Our work in binning probabilities shows how to effectively map language model probabilities into the space of small positive integers, which helps improve speeds without reducing query effectiveness. We also show new efficient query processing results for both document-sorted and score-sorted indexes. All of our work is evaluated using the largest available research dataset.

Keywords/Search Tags:

Processing

Related items

1	Study Of Digital Image Processing Circuits Based On Multi-Processing Units
2	Research On Big Data Processing System Based On MapReduce Parallel Processing Framework
3	Research And Application Of Parallel Processing Technology In Radar Signal Processing
4	Implementation Of Advertisement Detection System For Stream Processing
5	The Research Of Distributed RDF Data Processing Architecture
6	Millimeter-wave Man-radar Information Processing System
7	The Design And Implementation Of Graph Processing Middleware On Infosphere Streams
8	Design And Implementation Of Signal Processing System Of Multifunctional Low-altitude Three-dimensional Radar
9	Research On Optronic Radar Signal Processing
10	Parallel Signal Processing And Lts Lmplementation For Radar Systems