AITopics

Genre: Research Report > New Finding (0.59)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Neural Information Processing SystemsFeb-17-2026, 16:34:07 GMT

b6edb87876bec4ac2260bffa083cb992-Paper-Conference.pdf

large language model, machine learning, translation, (22 more...)

Country:

Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
North America > United States > Iowa > Story County > Ames (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry: Information Technology (0.46)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(3 more...)

Neural Information Processing SystemsOct-11-2025, 00:37:28 GMT

b6edb87876bec4ac2260bffa083cb992-Paper-Conference.pdf

blockidx, coderosetta, translation, (15 more...)

Country:

Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
North America > United States > Iowa > Story County > Ames (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry: Information Technology (0.46)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(3 more...)

Boruch-Gruszecki, Aleksander, Zi, Yangtian, Wu, Zixuan, Oberoi, Tejas, Anderson, Carolyn Jane, Biswas, Joydeep, Guha, Arjun

Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

arXiv.org Artificial IntelligenceAug-8-2025

Large language models (LLMs) already excel at writing code in high-resource languages such as Python and JavaScript, yet stumble on low-resource languages that remain essential to science and engineering. Besides the obvious shortage of pre-training data, post-training itself is a bottleneck: every new language seems to require new datasets, test harnesses, and reinforcement-learning (RL) infrastructure. We introduce Agnostics, a language-agnostic post-training pipeline that eliminates this per-language engineering. The key idea is to judge code solely by its externally observable behavior, so a single verifier can test solutions written in any language. Concretely, we (i) use an LLM to rewrite existing unit-test datasets into an I/O format, (ii) supply a short configuration that tells the verifier how to compile and run a target language, and (iii) apply reinforcement learning with verifiable rewards (RLVR) in a robust code execution environment. Applied to five low-resource languages--Lua, Julia, R, OCaml, and Fortran--Agnostics (1) improves Qwen-3 4B to performance that rivals other 16B-70B open-weight models; (2) scales cleanly to larger and diverse model families (Qwen-3 8B, DeepSeek Coder 6.7B Instruct, Phi 4 Mini); and (3) for ${\le} 16$B parameter models, sets new state-of-the-art pass@1 results on MultiPL-E and a new multi-language version LiveCodeBench that we introduce. We will release the language-agnostic training datasets (Ag-MBPP-X, Ag-Codeforces-X, Ag-LiveCodeBench-X), training code, and ready-to-use configurations, making RL post-training in any programming language as simple as editing a short YAML file.

large language model, machine learning, programming language, (20 more...)

2508.04865

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.67)

Industry: Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Neural Information Processing SystemsMay-27-2025, 13:49:13 GMT

CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming

large language model, natural language, translation, (10 more...)

Country: Asia > Middle East > Iran > Tehran Province > Tehran (0.08)

Genre: Research Report > New Finding (0.61)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.73)

Diehl, Patrick, Nader, Nojoud, Moraru, Maxim, Brandt, Steven R.

LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages

arXiv.org Artificial IntelligenceMar-24-2025

Large Language Models (LLMs) have made significant advances in various code-related tasks, particularly in generating source code from natural language descriptions (Zhao et al. (2023); Chang et al. (2024)). Their effectiveness is primarily driven by their extensive number of model parameters, the use of large and diverse datasets, and the immense computational resources employed during training (Kaplan et al. (2020)). These models are typically trained on vast corpora sourced from the web. LLMs are capable of capturing intricate patterns, linguistic subtleties, and semantic relationships. A wide range of models are available for code generation. There are general-purpose models like ChatGPT (Ouyang et al. (2022)), GPT -4 (Achiam et al. (2023)), and LLaMA (Touvron et al. (2023a)) which are designed for a broad range of applications, as well as specialized models such as StarCoder, Code LLaMA (Roziere et al. (2023)), DeepSeek-Coder, and Code Gemma that are optimized for code-related tasks. The integration of code generation with the latest advances in LLM technology is now an essential tool for many businesses, as well as an essential target for LLM developers as programming languages are considered to be different dialects of natural language (Athiwaratkun et al. (2022)).

large language model, machine learning, natural language, (17 more...)

2503.19217

Country:

North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
North America > United States > Louisiana > East Baton Rouge Parish > Baton Rouge (0.04)

Genre: Research Report > New Finding (0.68)

Industry:

Energy (1.00)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceFeb-7-2025

Native Fortran Implementation of TensorFlow-Trained Deep and Bayesian Neural Networks

Furlong, Aidan, Zhao, Xingang, Salko, Bob, Wu, Xu

Over the past decade, the investigation of machine learning (ML) within the field of nuclear engineering has grown significantly. With many approaches reaching maturity, the next phase of investigation will determine the feasibility and usefulness of ML model implementation in a production setting. Several of the codes used for reactor design and assessment are primarily written in the Fortran language, which is not immediately compatible with TensorFlow-trained ML models. This study presents a framework for implementing deep neural networks (DNNs) and Bayesian neural networks (BNNs) in Fortran, allowing for native execution without TensorFlow's C API, Python runtime, or ONNX conversion. Designed for ease of use and computational efficiency, the framework can be implemented in any Fortran code, supporting iterative solvers and UQ via ensembles or BNNs. Verification was performed using a two-input, one-output test case composed of a noisy sinusoid to compare Fortran-based predictions to those from TensorFlow. The DNN predictions showed negligible differences and achieved a 19.6x speedup, whereas the BNN predictions exhibited minor disagreement, plausibly due to differences in random number generation. An 8.0x speedup was noted for BNN inference. The approach was then further verified on a nuclear-relevant problem predicting critical heat flux (CHF), which demonstrated similar behavior along with significant computational gains. Discussion regarding the framework's successful integration into the CTF thermal-hydraulics code is also included, outlining its practical usefulness. Overall, this framework was shown to be effective at implementing both DNN and BNN model inference within Fortran, allowing for the continued study of ML-based methods in real-world nuclear applications.

fortran, implementation, prediction, (13 more...)

2502.06853

Country:

North America > United States > Tennessee > Knox County > Knoxville (0.14)
North America > United States > Tennessee > Anderson County > Oak Ridge (0.05)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
(3 more...)

Genre: Research Report (0.50)

Industry:

Energy (1.00)
Government > Regional Government > North America Government > United States Government (0.95)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

TehraniJamsaz, Ali, Bhattacharjee, Arijit, Chen, Le, Ahmed, Nesreen K., Yazdanbakhsh, Amir, Jannesari, Ali

CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming

arXiv.org Artificial IntelligenceOct-27-2024

Recent advancements in Large Language Models (LLMs) have renewed interest in automatic programming language translation. Encoder-decoder transformer models, in particular, have shown promise in translating between different programming languages. However, translating between a language and its high-performance computing (HPC) extensions remains underexplored due to challenges such as complex parallel semantics. In this paper, we introduce CodeRosetta, an encoder-decoder transformer model designed specifically for translating between programming languages and their HPC extensions. CodeRosetta is evaluated on C++ to CUDA and Fortran to C++ translation tasks. It uses a customized learning framework with tailored pretraining and training objectives to effectively capture both code semantics and parallel structural nuances, enabling bidirectional translation. Our results show that CodeRosetta outperforms state-of-the-art baselines in C++ to CUDA translation by 2.9 BLEU and 1.72 CodeBLEU points while improving compilation accuracy by 6.05%. Compared to general closed-source LLMs, our method improves C++ to CUDA translation by 22.08 BLEU and 14.39 CodeBLEU, with 2.75% higher compilation accuracy. Finally, CodeRosetta exhibits proficiency in Fortran to parallel C++ translation, marking it, to our knowledge, as the first encoder-decoder model for this complex task, improving CodeBLEU by at least 4.63 points compared to closed-source and open-code LLMs.

large language model, machine learning, programming language, (21 more...)

2410.20527

Country:

Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
North America > United States > Iowa > Story County > Ames (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)

Genre: Research Report > New Finding (0.86)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(2 more...)

Diehl, Patrick, Nader, Noujoud, Brandt, Steve, Kaiser, Hartmut

Evaluating AI-generated code for C++, Fortran, Go, Java, Julia, Matlab, Python, R, and Rust

arXiv.org Artificial IntelligenceJul-5-2024

This study evaluates the capabilities of ChatGPT versions 3.5 and 4 in generating code across a diverse range of programming languages. Our objective is to assess the effectiveness of these AI models for generating scientific programs. To this end, we asked ChatGPT to generate three distinct codes: a simple numerical integration, a conjugate gradient solver, and a parallel 1D stencil-based heat equation solver. The focus of our analysis was on the compilation, runtime performance, and accuracy of the codes. While both versions of ChatGPT successfully created codes that compiled and ran (with some help), some languages were easier for the AI to use than others (possibly because of the size of the training sets used). Parallel codes -- even the simple example we chose to study here -- also difficult for the AI to generate correctly.

chat gpt 4, correct result, solver, (12 more...)

2405.13101

Country:

North America > United States > Louisiana > East Baton Rouge Parish > Baton Rouge (0.15)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
North America > United States > New York (0.04)

Genre: Research Report (0.64)

Industry:

Energy (0.69)
Government > Regional Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

The GuardianJan-13-2024, 16:00:02 GMT

The hard truth about AI? It might produce some better software John Naughton

As you have doubtless noticed, we are in the middle of a feeding frenzy about something called generative AI. Legions of hitherto normal people – and economists – are surfing a wave of irrational exuberance about its transformative potential. For anyone suffering from the fever, two antidotes are recommended. The first is the hype cycle monitor produced by consultants Gartner, which shows the technology currently perched on the "peak of inflated expectations", before a steep decline into the "trough of disillusionment". The other is Hofstadter's law, about the difficulty of estimating how long difficult tasks will take, which says that "It always takes longer than you expect, even when you take into account Hofstadter's law".

better software john naughton, generative ai, hard truth, (9 more...)

The Guardian

Country:

North America > United States > New York (0.05)
Asia > South Korea (0.05)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)