Learning Software Bug Reports: A Systematic Literature Review
Long, Guoming, Gong, Jingzhi, Fang, Hui, Chen, Tao
–arXiv.org Artificial Intelligence
The recent advancement of artificial intelligence, especially machine learning (ML), has significantly impacted software engineering research, including bug report analysis. ML aims to automate the understanding, extraction, and correlation of information from bug reports. Despite its growing importance, there has been no comprehensive review in this area. In this paper, we present a systematic literature review covering 1,825 papers, selecting 204 for detailed analysis. We derive seven key findings: 1) Extensive use of CNN, LSTM, and $k$NN for bug report analysis, with advanced models like BERT underutilized due to their complexity. 2) Word2Vec and TF-IDF are popular for feature representation, with a rise in deep learning approaches. 3) Stop word removal is the most common preprocessing, with structural methods rising after 2020. 4) Eclipse and Mozilla are the most frequently evaluated software projects. 5) Bug categorization is the most common task, followed by bug localization and severity prediction. 6) There is increasing attention on specific bugs like non-functional and performance bugs. 7) Common evaluation metrics are F1-score, Recall, Precision, and Accuracy, with $k$-fold cross-validation preferred for model evaluation. 8) Many studies lack robust statistical tests. We also identify six promising future research directions to provide useful insights for practitioners.
arXiv.org Artificial Intelligence
Jul-22-2025
- Country:
- Africa > Middle East
- Morocco (0.04)
- Asia
- China
- Beijing > Beijing (0.04)
- Guangdong Province > Guangzhou (0.04)
- Hubei Province > Wuhan (0.04)
- Inner Mongolia > Hohhot (0.04)
- Jiangsu Province > Nanjing (0.04)
- Shaanxi Province > Xi'an (0.04)
- Sichuan Province > Chengdu (0.04)
- Tianjin Province > Tianjin (0.04)
- India > West Bengal
- Kolkata (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Middle East > Jordan (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- China
- Europe
- Portugal > Coimbra
- Coimbra (0.04)
- Sweden
- Vaestra Goetaland > Gothenburg (0.04)
- Västmanland County > Västerås (0.04)
- France (0.04)
- United Kingdom > England
- Leicestershire > Loughborough (0.04)
- West Midlands > Birmingham (0.04)
- West Yorkshire > Leeds (0.40)
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Galicia > Madrid (0.04)
- Catalonia > Barcelona Province
- Germany (0.04)
- Italy > Tuscany
- Florence (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Portugal > Coimbra
- North America
- Canada > Ontario
- National Capital Region > Ottawa (0.04)
- Toronto (0.04)
- United States
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- Arizona > Maricopa County
- Scottsdale (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Florida > Orange County
- Orlando (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Texas > Travis County
- Austin (0.04)
- Alaska > Anchorage Municipality
- Canada > Ontario
- Oceania > New Zealand
- North Island > Auckland Region > Auckland (0.04)
- South America > Argentina
- Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
- Africa > Middle East
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (0.93)
- New Finding (1.00)
- Industry:
- Technology: