Research On Loss Function Of Learning To Rank

Posted on:2012-05-18

Degree:Master

Type:Thesis

Country:China

Candidate:J J Wu

Full Text:PDF

GTID:2218330368487986

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the breakneck-speed of information increment in this day and age, information retrieval technology is becoming crucial important. There are mainly two categories of traditional information retrieval models, namely query-dependent models based on content, such as TF/IDF, probability model and language model and query-independent models based on link analysis, such as PageRank and HITTS. The core of both the two categories is ranking. With the advent of Learning to Rank technology, it is easy to incorporate different information retrieval models into one super-power model, which leads performance improvement.Learning to rank is the cross field of information retrieval and machine learning. Based on how they treat sets of ratings, they can be categorized into the following three groups: pointwise, pairwise and listwise approaches. Loss function is used as the measurement of loss generated by ranking function in the process of training, and thus it plays a pivotal role of learning to rank.This thesis studies the loss function of learning to rank. Firstly, the weakness of adopting different input instance (i.e. sampled in different input space) exclusively is analyzed. Pairwise approach is used as an example to illustrate the improved approach, that is, incorporating the pointwise loss with the pairwise loss function in order to enrich the objective loss function, in which way the real loss in training process is better measured.Secondly, a new listwise approach is proposed based on query-level regression, whose ranking function is modeled by neural network and optimization is carried on by gradient descent.Finally, a framework of incorporation of loss functions is proposed and tested, based on which three weighting schemes for incorporation are given and tested. Further, the three methods using the different merging strategies are compared with the pointwise, pairwise, listwise and other similar approaches that also take into consideration of multiple input instances. This brings a new idea for improving the learning to rank approaches.The experimental result on LETOR demonstrates that the approaches proposed in this paper outperform the existing learning to rank methods.

Keywords/Search Tags:

Information Retrieval, Learning to Rank, Loss Function, Gradient Descent, Query-Level Regression, Incorporation

PDF Full Text Request

Related items

1	Research Of Learning To Rank In Information Retrieval
2	A Research Of Stochastic Gradient Descent Algorithm
3	Imbalanced Stochastic Gradient Descent Online Algorithm For SVM
4	The Reseach And Application Of Stochastic Gradient Descent And Dual Coordinate Descent Algorithm
5	Researches On Information Retrieval Model Based On The Algorithm Of Learning To Rank
6	Application Of Gradient Descent Method In Machine Learning
7	Research On Support Vector Machine Based On Improved Loss Function
8	Study On Learning To Rank And Query Reformulation Based Information Retrieval Model
9	A regression framework for learning to rank in web information retrieval
10	A Ranking Algorithm ListNet Based On Stochastic Gradient Descent