A Ranking Algorithm ListNet Based On Stochastic Gradient Descent

Posted on:2012-09-09

Degree:Master

Type:Thesis

Country:China

Candidate:Y H Zheng

Full Text:PDF

GTID:2218330362452228

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

With the rapid development of the Internet, search engine is more and more important. It's very important that how to effectively search useful information, a good search engine can help users save a lot of time. Search engine's core part is how to rank web page associated to inquires. Page ranking's stand or fall is becoming more and more important, which decides whether a search engine is good. Learning to rank is the current popular page ranking field, and has good performance. According to the ranking method, it can be divided into three categories: pointwise, pairwise and listwise. And listwise approach usually performs better than the other approaches. ListNet is a classical and effect listwise approach. But also has shortcomings: ListNet algorithm convergence slowly, and its running time directly depends on the size of training set and is not suited to train a data set with large quantity queries , not to mention mass datasets.Nowadays, as there are more and more large data sets, and even mass datasets, existing algorithms of learning to rank have not been suited to train a data set with large quantity queries , not to mention mass datasets. Because the existing algorithms need to read the training data into memory for one time, but large data sets, and even mass datasets is not possible to be read into memory once. Therefore, the motivation of this paper is to design a algorithm of learning to rank that can be applied to train the large datasets, and even mass datasets.This paper proposes the improvement ideas of ListNet algorithm. The original algorithm use neural network as model, and now we use SVM. We use Stochastic Gradient Descent algorithm instead of the original gradient descent algorithm. We combine ListNet and Pegasos to make up the shortfall, make the improved algorithm convergence faster, make the running time does not directly depends on the size of the training set and make the improved algorithm can be suited to train a data set with large quantity queries, even mass datasets.In this paper, we use LETOR MQ2007,MQ2008, OHSUMED TD2003, TD2004 data sets to do the experiments. First, we compare running time of the improved algorithm and ListNet's. Second, we compare the accuracy of the improved algorithm and ListNet's. Last, we compare the accuracy of the improved algorithm and other listwise approaches. Experimental results show that relative to ListNet, the improved algorithm has absolute advantage in the training time, and the improved algorithm performs better than ListNet and other listwise approaches in data set with large quantity queries.

Keywords/Search Tags:

learning to rank, listwise, Stochastic Gradient Descent, ListNet

PDF Full Text Request

Related items

1	The Reseach And Application Of Stochastic Gradient Descent And Dual Coordinate Descent Algorithm
2	A Research Of Stochastic Gradient Descent Algorithm
3	Optimization Algorithms Of Neural Networks Weights Based On Stochastic Gradient Descent
4	A Research And Application On Stochastic Gradient Descent Algorithm In Distributed Cluster
5	Research On Distributed Stochastic Gradient Descent Algorithm
6	Research On Improving The Convergence Performance Of Stochastic Gradient Descent In Distributed Machine Learning
7	Imbalanced Stochastic Gradient Descent Online Algorithm For SVM
8	Dynamic Regret Of Online Gradient Descent:Analyses And Applications
9	Applied Research On Gradient Descent Algorithm In Deep Learning
10	Research On Privacy Preserving Methods In Stochastic Gradient Descent