C2RUST-BENCH: A Minimized, Representative Dataset for C-to-Rust Transpilation Evaluation
Sirlanci, Melih, Yagemann, Carter, Lin, Zhiqiang
–arXiv.org Artificial Intelligence
Despite the effort in vulnerability detection over the last two decades, memory safety vulnerabilities continue to be a critical problem. Recent reports suggest that the key solution is to migrate to memory-safe languages. To this end, C-to-Rust transpilation becomes popular to resolve memory-safety issues in C programs. Recent works propose C-to-Rust transpilation frameworks; however, a comprehensive evaluation dataset is missing. Although one solution is to put together a large enough dataset, this increases the analysis time in automated frameworks as well as in manual efforts for some cases. In this work, we build a method to select functions from a large set to construct a minimized yet representative dataset to evaluate the C-to-Rust transpilation. We propose C2RUST-BENCH that contains 2,905 functions, which are representative of C-to-Rust transpilation, selected from 15,503 functions of real-world programs.
arXiv.org Artificial Intelligence
Apr-22-2025
- Country:
- Asia > Middle East
- Oman (0.04)
- North America > United States
- Ohio (0.04)
- Asia > Middle East
- Genre:
- Research Report (0.84)
- Industry:
- Technology: