Text filtering and ranking for security bug report prediction

Peters, Fayola; Tun, Thein Than; Yu, Yijun; Nuseibeh, Bashar

Nuseibeh_2019_Text.pdf (3.87 MB)

Text filtering and ranking for security bug report prediction

conference contribution

posted on 2019-11-29, 15:13 authored by Fayola Peters, Thein Than Tun, Yijun Yu, Bashar NuseibehBashar Nuseibeh

Security bug reports can describe security critical vulnerabilities in software products. Bug tracking systems may contain thousands of bug reports, where relatively few of them are security related. Therefore finding unlabelled security bugs among them can be challenging. To help security engineers identify these reports quickly and accurately, text-based prediction models have been proposed. These can often mislabel security bug reports due to a number of reasons such as class imbalance, where the ratio of non-security to security bug reports is very high. More critically, we have observed that the presence of security related keywords in both security and non-security bug reports can lead to the mislabelling of security bug reports. This paper proposes FARSEC, a framework for filtering and ranking bug reports for reducing the presence of security related keywords. Before building prediction models, our framework identifies and removes non-security bug reports with security related keywords. We demonstrate that FARSEC improves the performance of text-based prediction models for security bug reports in 90% of cases. Specifically, we evaluate it with 45,940 bug reports from Chromium and four Apache projects. With our framework, we mitigate the class imbalance issue and reduce the number of mislabelled security bug reports by 38%.

Funding

Study on Aerodynamic Characteristics Control of Slender Body Using Active Flow Control Technique

Japan Society for the Promotion of Science

Find out more...

History

Publication

IEEE Transactions on Software Engineering;45 (6), pp. 615-631

Publisher

IEEE Computer Society

Note

peer-reviewed

Other Funding information

SFI, ERC

Rights

© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Language

English

External identifier

Usage metrics

Keywords

security cross words security related keywords security bug reports text filtering ranking prediction models transfer lerning

Licence

CC BY-NC-SA 1.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Text filtering and ranking for security bug report prediction

Funding

Study on Aerodynamic Characteristics Control of Slender Body Using Active Flow Control Technique

History

Publication

Publisher

Note

Other Funding information

Rights

Language

External identifier

Usage metrics

Categories

Keywords

Licence

Exports