Research On Learning-based Database Query Optimization

Posted on:2024-09-21

Degree:Master

Type:Thesis

Country:China

Candidate:W Q Zhou

Full Text:PDF

GTID:2558307079959549

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

With the rapid development of communication technology and the internet,data volume has proliferated and database query efficiency has gradually become a bottleneck for large-scale business performance.In database query optimization,cardinality estimation and join order selection are the two most crucial components.Traditional cardinality estimation algorithms are unable to utilize the underlying data distribution,resulting in significant errors.Additionally,traditional join order selection is unable to efficiently search the vast solution space,leading to excessively long execution time for query plans.This thesis deeply investigates the current research status of database query optimization and the frontier technology of artificial intelligence,and innovatively proposes a learningbased query optimization algorithm.In order to solve the problem of huge errors in traditional cardinality estimation,this thesis proposes a learning based cardinality estimation algorithm GTR based on Graph Transformer.By studying the current query optimizer structure and combining the cuttingedge results in the field of graph neural networks,this thesis encodes queries into graphstructured data,utilizes Graph Transformer based on graph attention mechanism to extract query features,perceives the underlying data distribution patterns,mines the correlation information between attributes and tables,and finally significantly improves the accuracy of cardinality estimation.In order to solve the problem that traditional join order selection has difficulty in producing excellent execution plans,this thesis proposes a join order selection algorithm TARL based on Tree Attention and reinforcement learning.The thesis abstracts the join order selection as a Markov decision process and solves it through reinforcement learning.Based on the Attention mechanism,combined with the tree structure and data transmission characteristics of query execution plans,this thesis proposes Tree Attention to extract features from the execution plan,and combines it with GTR to extract features from the original query,which together make up the value network model.By using the DQN reinforcement learning algorithm framework and interacting with the database,the model is trained with real-time delay as a guide to predict long term rewards of joins,generating excellent query execution plan and significantly reducing query latency.Finally,this thesis combines the learning based cardinality estimation algorithm GTR and join order selection algorithm TARL with Ti DB to implement a prototype system.The experimental results show that the cardinality estimation algorithm GTR and the join order selection algorithm TARL have significantly improved multiple indicators compared to Ti DB and other learning based algorithms,sensibly improving the database query efficiency.

Keywords/Search Tags:

Query Optimization, Cadinality Estimation, Join Order Selection, Graph Neural Network, Reinforcement Learning

PDF Full Text Request

Related items

1	Research And Implementation Of Database Query Optimization Based On Graph Neural Network
2	Join Order Selection Optimization With Deep Graph-based Representation
3	Research On Database Query Optimization Method Based On Deep Reinforcement Learning
4	Benchmarking Join Order Selection For Complex Joins
5	Research On Database Index Selection Based On Learned Cost Estimator And Reinforcement Learning
6	Research On Optimization Of Multi-table Join Order Of Database Based On Monte Carlo Tree Search
7	Research And Implementation Of Deep Learning Computation Graph Optimization Based On Reinforcement Learning
8	Querying Optimization Research Based On XML Database
9	Research Of Query Optimization Based On Join Index
10	Research On Data Query Optimization Algorithm Of Distributed Database