Research On Program Code Classification Based On Deep Learning

Posted on:2022-09-08

Degree:Master

Type:Thesis

Country:China

Candidate:Z Z Tang

Full Text:PDF

GTID:2518306329998799

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The rapid growth of user demand has led to the rapid development of mobile Internet.A large number of applications have been developed to meet the needs of users.Understanding the existing program code is the basic step of many software development tasks.How to quickly analyze the functions of the code,to reduce the time of develop and maintenance program as much as possible,has become a hot issue in the field of software engineering.It has very important practical and economic significance.Traditional program classification tasks can only rely on a large number of human resources for manual labeling.Some scholars learn from the field of natural language processing,introduce deep learning into the field of programming language processing,improve the speed of understanding of program code.Previous studies have used traditional recurrent neural networks or convolutional neural networks as classification models.Because the program has strong local semantic structure characteristics and long-term dependence,this strategy effect is not good.Some researchers have proposed to characterize the code by using vector expression based on abstract syntax tree,and to preserve the local semantic structure features in the code through abstract syntax tree.Based on previous research,this paper puts forward a program code classification research model based on ConvTransformer.It preserves the local semantic structure and long-term dependencies of program code by adding location coding to word embedding and using Transformer's attention mechanism.In this paper,the program language category classification experiment and the program function category classification experiment are carried out,in order to compare a variety of models.In the program language category classification experiment.the classification accuracy rate reached 97.5%,and in the program function category classification experiment,the accuracy rate reached 85.6%.These two experiments prove the effectiveness and advancement of the model.

Keywords/Search Tags:

Program language processing, Code classification, Transformer, Attention

PDF Full Text Request

Related items

1	Research On Code Recommendation Based On Program Analysis And Neural Network Language Model
2	Text Semantic Similarity Algorithm Based On Transformer
3	Research On Syntactic Analyzer And Code Transformer For Quantum Programming Language NDQJava-2Processing System
4	Research On Sentiment Classification Based On BiGRU And Aspect Attention Module
5	Research On Comment Localization Technology Based On Program Analysis And Natural Language Processing
6	Text Sentiment Classification Based On Sentence Structure Information
7	Natural language program analysis: Combining natural language processing with program analysis to improve software maintenance tools
8	Research On Vulnerability Mining Based On Deep Learning And Programming Language Processing
9	Transformer specification language: A system for generating analyzers and its applications
10	Research On Text Classification Method Combining Attention Mechanism And Bi-GRU