Research On LSTM-GRU-based Source Code Vulnerability Detection Method For Out-of-bounds Write

Posted on:2022-06-23

Degree:Master

Type:Thesis

Country:China

Candidate:C F Li

Full Text:PDF

GTID:2518306536996909

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Out-of-bounds Write(CWE787),is a vulnerability that can modify the memory data outside the boundary of the buffer by modify the index or execute the pointer algorithm.If a write operation is performed later,it will have serious consequences such as information leakage,equipment being charged,software crashes and even system crashes.In order to detect Out-of-bounds Write in more depth,this paper proposes a source code detection method for Out-of-bounds Write vulnerabilities based on LSTM-GRU.First,aiming at the problem of vulnerability metrics,by analyzing the source code of software vulnerabilities directly,an AST-based representation method of Out-of-bounds Write features is proposed.Aiming at the expression of the lexer rules and parser rules of the AST,based on the common features of Out-of-bounds Write,an improved lexer and parser rules of AST are built.Combining the improved rules with the Out-of-bounds Write text vector generation algorithm,while preserving the grammatical structure of the source code,the feature extraction of Out-of-bounds Write is enhanced,and the Out-of-bounds Write text vector stream is obtained.After that,the word embedding method is used to convert the Out-of-bounds Write text vector stream into a digital tensor form of the Out-ofbounds Write digital vector stream.Secondly,in view of the problem that traditional software vulnerability detection methods cannot obtain source code context semantic information,a natural language processing-based source code detection method for out-of-bounds writing vulnerabilities is proposed.Integrating the LSTM and GRU network and the maximum pooling method in natural language processing,the LSTM-GRU-based Out-of-bounds Write source code detection model is built,and the contextual semantic information extraction and feature extraction of the cross-border write vulnerability source code are achieved.The purpose of out-of-bounds writing vulnerability detection.Finally,an out-of-bounds write vulnerability detection experiment was conducted on the SARD's Out-of-bounds Write source code data set,which verified the effectiveness of the LSTM-GRU-based source code detection method for Out-of-bounds Write in the detection of Out-of-bounds Write.

Keywords/Search Tags:

Out-of-bounds Write, Vulnerability detection, AST, Word embedding, NLP

PDF Full Text Request

Related items

1	Research And Implementation Of Software Source Code Security Vulnerability Representation Learning And Detection Technology
2	Study On The Deep Learning Models And Algorithms Of DGA Domain Name Detection Based On Word Embedding Methods
3	Design And Implementation Of SQL Injection Vulnerability Detection System
4	Research On Chinese Word Segmentation Method Based On Word Embedding
5	Automatic Detection And Analysis Of Chinese Lexical Changes Based On Diachronic Corpora
6	Research On Word Sense Disambiguation Method Based On Word Embedding
7	Research On Binary Software Vulnerability Mining Technology Based On Machine Learning
8	Research On The Representation Of Word Embedding Based On Knowledge Fusion
9	Web Access Behavior Analysis And Study Based On Word Embedding Technology
10	Research And Application On Word Embedding Of Low Frequency Words