Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach

Jiawei Shao; Yan Zhou; Guohua Liu; Dezhi Zheng

doi:10.1109/DDCLS58216.2023.10166955

Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach

Jiawei Shao, Yan Zhou, Guohua Liu, Dezhi Zheng

Advanced Research Institute of Multidisciplinary Science

Southeast University, Nanjing

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Citation (Scopus)

Abstract

As a vulnerability discovery technique, fuzzing has been widely used in the field of software test in the past years. Traditional fuzzing has several drawbacks, including poor efficiency, low code coverage, and a high dependence on expert experience. By introducing the deep reinforcement learning technique, one can train the mutator of the fuzzer to move in a desired direction, such as maximizing code coverage or finding more code paths. This paper proposes a reinforcement learning-based fuzzing method to enhance the code coverage and explore potential code vulnerabilities. First, the concept of the input field is introduced to the seed file, reducing invalid operations by marking whether each byte of the seed file is a valid byte. Then, we optimize mutation by modeling the grey-box fuzzing as a reinforcement learning problem and training mutator's behavior on test cases. By observing the rewards caused by mutating with a specific set of actions performed on an initial program input, the fuzzing agent learns a policy that can next generate new higher-reward inputs. Finally, experimental results show that the proposed deep reinforcement learning-based fuzzing method outperforms the baseline random fuzzing algorithms.

Original language	English
Title of host publication	Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1296-1300
Number of pages	5
ISBN (Electronic)	9798350321050
DOIs	https://doi.org/10.1109/DDCLS58216.2023.10166955
Publication status	Published - 2023
Event	12th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2023 - Xiangtan, China Duration: 12 May 2023 → 14 May 2023

Publication series

Name	Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023

Conference

Conference	12th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2023
Country/Territory	China
City	Xiangtan
Period	12/05/23 → 14/05/23

Keywords

Fuzzing
Reinforcement Learning
Seed Mutation
Software Testing

Access to Document

10.1109/DDCLS58216.2023.10166955

Cite this

Shao, J., Zhou, Y., Liu, G., & Zheng, D. (2023). Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach. In Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023 (pp. 1296-1300). (Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/DDCLS58216.2023.10166955

Shao, Jiawei ; Zhou, Yan ; Liu, Guohua et al. / Optimized Mutation of Grey-box Fuzzing : A Deep RL-based Approach. Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 1296-1300 (Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023).

@inproceedings{b49f876039e0421c97b2194e3e883bf1,

title = "Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach",

abstract = "As a vulnerability discovery technique, fuzzing has been widely used in the field of software test in the past years. Traditional fuzzing has several drawbacks, including poor efficiency, low code coverage, and a high dependence on expert experience. By introducing the deep reinforcement learning technique, one can train the mutator of the fuzzer to move in a desired direction, such as maximizing code coverage or finding more code paths. This paper proposes a reinforcement learning-based fuzzing method to enhance the code coverage and explore potential code vulnerabilities. First, the concept of the input field is introduced to the seed file, reducing invalid operations by marking whether each byte of the seed file is a valid byte. Then, we optimize mutation by modeling the grey-box fuzzing as a reinforcement learning problem and training mutator's behavior on test cases. By observing the rewards caused by mutating with a specific set of actions performed on an initial program input, the fuzzing agent learns a policy that can next generate new higher-reward inputs. Finally, experimental results show that the proposed deep reinforcement learning-based fuzzing method outperforms the baseline random fuzzing algorithms.",

keywords = "Fuzzing, Reinforcement Learning, Seed Mutation, Software Testing",

author = "Jiawei Shao and Yan Zhou and Guohua Liu and Dezhi Zheng",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 12th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2023 ; Conference date: 12-05-2023 Through 14-05-2023",

year = "2023",

doi = "10.1109/DDCLS58216.2023.10166955",

language = "English",

series = "Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1296--1300",

booktitle = "Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023",

address = "United States",

}

Shao, J, Zhou, Y, Liu, G & Zheng, D 2023, Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach. in Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023. Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023, Institute of Electrical and Electronics Engineers Inc., pp. 1296-1300, 12th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2023, Xiangtan, China, 12/05/23. https://doi.org/10.1109/DDCLS58216.2023.10166955

Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach. / Shao, Jiawei; Zhou, Yan; Liu, Guohua et al.
Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 1296-1300 (Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Optimized Mutation of Grey-box Fuzzing

T2 - 12th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2023

AU - Shao, Jiawei

AU - Zhou, Yan

AU - Liu, Guohua

AU - Zheng, Dezhi

PY - 2023

Y1 - 2023

N2 - As a vulnerability discovery technique, fuzzing has been widely used in the field of software test in the past years. Traditional fuzzing has several drawbacks, including poor efficiency, low code coverage, and a high dependence on expert experience. By introducing the deep reinforcement learning technique, one can train the mutator of the fuzzer to move in a desired direction, such as maximizing code coverage or finding more code paths. This paper proposes a reinforcement learning-based fuzzing method to enhance the code coverage and explore potential code vulnerabilities. First, the concept of the input field is introduced to the seed file, reducing invalid operations by marking whether each byte of the seed file is a valid byte. Then, we optimize mutation by modeling the grey-box fuzzing as a reinforcement learning problem and training mutator's behavior on test cases. By observing the rewards caused by mutating with a specific set of actions performed on an initial program input, the fuzzing agent learns a policy that can next generate new higher-reward inputs. Finally, experimental results show that the proposed deep reinforcement learning-based fuzzing method outperforms the baseline random fuzzing algorithms.

AB - As a vulnerability discovery technique, fuzzing has been widely used in the field of software test in the past years. Traditional fuzzing has several drawbacks, including poor efficiency, low code coverage, and a high dependence on expert experience. By introducing the deep reinforcement learning technique, one can train the mutator of the fuzzer to move in a desired direction, such as maximizing code coverage or finding more code paths. This paper proposes a reinforcement learning-based fuzzing method to enhance the code coverage and explore potential code vulnerabilities. First, the concept of the input field is introduced to the seed file, reducing invalid operations by marking whether each byte of the seed file is a valid byte. Then, we optimize mutation by modeling the grey-box fuzzing as a reinforcement learning problem and training mutator's behavior on test cases. By observing the rewards caused by mutating with a specific set of actions performed on an initial program input, the fuzzing agent learns a policy that can next generate new higher-reward inputs. Finally, experimental results show that the proposed deep reinforcement learning-based fuzzing method outperforms the baseline random fuzzing algorithms.

KW - Fuzzing

KW - Reinforcement Learning

KW - Seed Mutation

KW - Software Testing

UR - http://www.scopus.com/inward/record.url?scp=85165962610&partnerID=8YFLogxK

U2 - 10.1109/DDCLS58216.2023.10166955

DO - 10.1109/DDCLS58216.2023.10166955

M3 - Conference contribution

AN - SCOPUS:85165962610

T3 - Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023

SP - 1296

EP - 1300

BT - Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 12 May 2023 through 14 May 2023

ER -

Shao J, Zhou Y, Liu G, Zheng D. Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach. In Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 1296-1300. (Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023). doi: 10.1109/DDCLS58216.2023.10166955

Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this