Research On Online Learning Algorithm Based On Optimized Product Quantization

Posted on:2021-04-04

Degree:Master

Type:Thesis

Country:China

Candidate:C Liu

Full Text:PDF

GTID:2428330623967771

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

With the rapid development of computers and the popularization of the Internet,information has exploded on the Internet.The development of information has led to the need for higher dimensions of expression data.In order to react to users more quickly,how to quickly perform matching search and feedback in a massive database in a short time has become a huge challenge at present.Based on this background,some algorithms about approximate nearest neighbor(ANN)have been proposed successively.Among those algorithms,quantization-based technology has achieved great success due to its high search performance,strong expression ability,and small memory space.However,most of the existing quantization methods are batch-based models,which are not suitable for processing streaming data,and the existing online product quantization model do not optimize the space decomposition.In order to solve the above problems,this paper proposes an online optimized product quantization(Online OPQ)model.The model can dynamically update the quantized codebook and rotation matrix,and optimize the space decomposition during the model update stage,which can reduce the quantization error and improve the search performance.Because the online OPQ is essentially an online model,only new data is needed during the update process without historical data.In the parameter initialization stage,an optimized product quantization(OPQ)algorithm is used to initialize the codebook,rotation matrix,and counter.During the updating of parameters,when learning new data,the online OPQ will first rotate the data to the optimal space.In the following,the code books are updated by the calculation process of the K-means algorithm.Then the codebooks of two adjacent batches can be obtained.In order to ensure the optimality of space,this paper proposes to introduce a correction matrix to the rotation matrix.The correction matrix is used to track the change of the center points.In order to ensure the orthogonality of the rotation matrix,the correction matrix also needs to be an orthogonal matrix.The process of solving the correction matrix is exactly the Orthogonal Procrustes problem.By solving this problem,the orthogonality of the correction matrix can be guaranteed.In addition,in order to measure the difference between the online OPQ model and the traditional OPQ codebook,a loss error boundary is theoretically derived.In this paper,a series of comparative experiments are performed with the baseline models on the public datasets.The experimental results show that compared with the baseline models,the performance of online OPQ on approximate nearest neighbor search is more effective.With more subspaces are decomposed,the expressiveness of the codebooks is much stronger and the quantization performance is much better.By tracking the change of center points,this method can optimize the decomposition space which can reduce the quantization error and further improve the search performance.

Keywords/Search Tags:

approximate nearest neighbor, product quantization, online learning model

PDF Full Text Request

Related items

1	Research On Approximate Nearest Neighbor Search Method Based On Product Quantization
2	Quantization For Approximate Nearest Neighbor Search
3	Binary Hashing And Quantization Methods For Approximate Nearest Neighbor Search
4	Research And Application Of Approximate Nearest Neighbor Algorithm Based On Navigable Small World
5	Iterative Quantization-based Hashing For Approximate Nearest Neighbor Retrieval
6	Research On Optimized Residual Vector Quantization For Nearest Neighbor Search In Image Retrieval
7	Research On Local Sensitive Hashing And Approximate Nearest Neighbor Algorithm
8	Research And Implementation Of Image Retrieval On Deep Learning
9	Research On Nearest Neighbor Search Method Based On Multi-stage Vector Quantization
10	Approximate Nearest Neighbor Search Based On Optimized Bit Allocation And Residual Rotation