api change
RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation
Liang, Linxi, Gong, Jing, Liu, Mingwei, Wang, Chong, Ou, Guangsheng, Wang, Yanlin, Peng, Xin, Zheng, Zibin
Large Language Models (LLMs) have become pivotal tools for automating code generation in software development. However, these models face significant challenges in producing version-aware code for rapidly evolving languages like Rust, where frequent Application Programming Interfaces (API) changes across versions lead to compatibility issues and correctness errors. Existing benchmarks lack systematic evaluation of how models navigate API transitions, relying on labor-intensive manual curation and offering limited version-specific insights. To address this gap, we present RustEvo, a novel framework for constructing dynamic benchmarks that evaluate the ability of LLMs to adapt to evolving Rust APIs. RustEvo automates dataset creation by synthesizing 588 API changes (380 from Rust standard libraries, 208 from 15 third-party crates) into programming tasks mirroring real-world challenges. These tasks cover four API evolution categories: Stabilizations, Signature Changes, Behavioral Changes, and Deprecations, reflecting their actual distribution in the Rust ecosystem. Experiments on state-of-the-art (SOTA) LLMs reveal significant performance variations: models achieve a 65.8% average success rate on stabilized APIs but only 38.0% on behavioral changes, highlighting difficulties in detecting semantic shifts without signature alterations. Knowledge cutoff dates strongly influence performance, with models scoring 56.1% on before-cutoff APIs versus 32.5% on after-cutoff tasks. Retrieval-Augmented Generation (RAG) mitigates this gap, improving success rates by 13.5% on average for APIs released after model training. Our findings underscore the necessity of our evolution-aware benchmarks to advance the adaptability of LLMs in fast-paced software ecosystems. The framework and the benchmarks are publicly released at https://github.com/SYSUSELab/RustEvo.
- South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.05)
- Asia > China (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- (5 more...)
Keras 2.3.0 is the last major release of multi-backend Keras - JAXenter
Keras, the deep learning library written in Python, has a new release. Version 2.3.0 is now the first release that supports TensorFlow 2.0. This version adds a few breaking changes and API changes and maintains TensorFlow 1.14 and 1.13 compatibility. For those new to the API, a quick introduction: Keras is a deep learning that's user friendly and uses models as a way to organize layers. It allows for fast prototyping and supports convolutional networks and recurrent networks.
TensorFlow 1.6: Increased support and bug fixes - JAXenter
It's only been two months since the last release from TensorFlow. However, they've certainly been busy. This update focuses mostly on bug fixes, API changes, and a few new features. Let's take a look at what's new in this ML favorite. This release focuses on improved support, documentation, and a few other API changes. TensorFlow 1.6 has added a second version of their Getting Stared document, aimed specifically at ML newcomers.