CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code Matching
–Neural Information Processing Systems
Binary source code matching, especially on function-level, has a critical role in the field of computer security. Given binary code only, finding the corresponding source code improves the accuracy and efficiency in reverse engineering. Given source code only, related binary code retrieval contributes to known vulnerabilities confirmation. However, due to the vast difference between source and binary code, few studies have investigated binary source code matching. Previously published studies focus on code literals extraction such as strings and integers, then utilize traditional matching algorithms such as the Hungarian algorithm for code matching.
Neural Information Processing Systems
Feb-2-2026, 06:21:46 GMT