A Study Of Boltzmann Machines For Classification And Ranking Tasks

Posted on:2015-03-04

Degree:Master

Type:Thesis

Country:China

Candidate:Q Yu

Full Text:PDF

GTID:2348330485994223

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Boltzmann Machine(BM) is a kind of stochastic graph model usually used for distribution estimation. This paper aims to study the novel adaptions and applications of BM in data classification and information retrieval. This paper proposes and analyzes a R�nyi divergence based generalization for discriminative learning objective of Classification Restricted Boltzmann Machine(ClassRBM). Specifically, we extend the Conditional Log Likelihood(CLL) objective to a general learning criterion. We prove that, some existing popular training methods can be derived from this generalization, via adjusting the parameters to specific values. Moreover, we show that this generalized criterion actually extends the CLL objective with a R�nyi divergence-based regularization. Besides, we can replace the uniform distribution used in this divergence-based regularization by some sample-based distribution and we call the appended loss as general margin. The proposed generalization enables an effective model selection procedure and experiments achieved significant performance improvement over the existing learning methods on data classification tasks. In information retrieval, we proposed a novel retrieval method making use of BM. We aim to generalize the multinomial distribution assumption in traditional language model by exploring the use of fully-visible Boltzmann Machines(BMs) for document modeling. BM is a stochastic recurrent network and is able to model the distribution of multi-dimensional variables. It yields a kind of Boltzmann distribution which is more general than multinomial distribution. We propose a Document Boltzmann Machine(DBM) that can naturally capture the intrinsic connections among terms and estimate query likelihood efficiently. We formally prove that under certain conditions(with 1-order parameters learnt only), DBM subsumes the traditional document language model. Its relations to other graphical models in IR, e.g., MRF model, are also discussed. Our experiments on the document reranking demonstrate the potential of the proposed DBM in information retrieval.

Keywords/Search Tags:

Boltzmann Machines, Data Classification, Information Retrieval

PDF Full Text Request

Related items

1	A Method Of Improving Restricted Boltzmann Machines Via Theta Pure Dependency
2	Data Mining Research In Web Information Retrieval And Classification
3	Image Classification Method Based On Abandoned Stacked Restricted Boltzmann Machine
4	Research Of Sparse Restricted Boltzmann Machine Based On Data Class Information Entropy
5	Research Of Deep Learning Method Based On Restricted Boltzmann Machines
6	Research On Learning Algorithms For Restricted Boltzmann Machines
7	Parameter Choice For Boltzmann Machines:Theories And Applications
8	Study On Boltzmann Machines For Robust Target Recognition
9	Classification Of SAR Images Based On SIFT And Restricted Boltzmann Machines
10	Data association techniques for bearings-only multi-target tracking using simulated annealing and implemented with Boltzmann machines