NALA_MAINZ at BLP-2025 Task 2: A Multi-agent Approach for Bangla Instruction to Python Code Generation
Saadi, Hossain Shaikh, Alam, Faria, Sanz-Guerrero, Mario, Bui, Minh Duc, Mager, Manuel, von der Wense, Katharina
–arXiv.org Artificial Intelligence
This paper presents JGU Mainz's winning system for the BLP-2025 Shared Task on Code Generation from Bangla Instructions. We propose a multi-agent-based pipeline. First, a code-generation agent produces an initial solution from the input instruction. The candidate program is then executed against the provided unit tests (pytest-style, assert-based). Only the failing cases are forwarded to a debugger agent, which reruns the tests, extracts error traces, and, conditioning on the error messages, the current program, and the relevant test cases, generates a revised solution. Using this approach, our submission achieved first place in the shared task with a $Pass@1$ score of 95.4. We also make our code public.
arXiv.org Artificial Intelligence
Nov-24-2025
- Country:
- Europe > Germany
- Rheinland-Pfalz > Mainz (0.61)
- Saarland (0.04)
- North America > United States
- Colorado > Boulder County
- Boulder (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- Colorado > Boulder County
- Europe > Germany
- Genre:
- Research Report (0.64)
- Technology: