AITopics | Software Engineering

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

Neural Information Processing SystemsJun-2-2025, 05:47:01 GMT

Due to the page limit in the submitted paper, we shall provide more detailed information on our proposed benchmark dataset LIG-MM and the our proposed framework LLM-SE. The supplementary material is organized as follows: Sec. Dataset Documentation: We have documented our dataset for intended researchers as required. The website of our benchmark dataset is available at the following link: https://anonymous. The link to download the models after fine-tuning is https://mega.nz/file/M9FEWCjD# Dataset Statistics: As we mentioned in our paper, the benchmark programs in existing papers mostly contain numerical programs. To fill the lack of benchmarks for general loop invariant generation, we propose LIG-MM, a loop invariant generation benchmark of memory manipulation programs. Table 1 below shows the basics of the code in LIG-MM. Our programs come from four main sources: course codes, competition codes, previous relevant work, and the actual system codes. The programs are modified into a unified format for better usage. Multiple examples are shown in Sec. 3, and the licenses of benchmarks can also be found in Sec. 4. Course codes.

invariant, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
Europe > United Kingdom > England (0.14)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation Chang Liu

Neural Information Processing SystemsJun-2-2025, 05:46:58 GMT

Program verification is vital for ensuring software reliability, especially in the context of increasingly complex systems. Loop invariants, remaining true before and after each iteration of loops, are crucial for this verification process. Traditional provers and machine learning based methods for generating loop invariants often require expert intervention or extensive labeled data, and typically only handle numerical property verification.

large language model, loop invariant, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.68)
Europe > United Kingdom > England (0.14)

Genre:

Instructional Material > Course Syllabus & Notes (0.67)
Research Report > New Finding (0.46)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

LO: Neural Bilevel Optimization

Neural Information Processing SystemsMay-31-2025, 16:27:56 GMT

Bilevel optimization deals with nested problems in which a leader takes the first decision to minimize their objective function while accounting for a follower's best-response reaction. Constrained bilevel problems with integer variables are particularly notorious for their hardness. While exact solvers have been proposed for mixed-integer linear bilevel optimization, they tend to scale poorly with problem size and are hard to generalize to the non-linear case. On the other hand, problemspecific algorithms (exact and heuristic) are limited in scope.

artificial intelligence, follower, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe (0.67)
North America > Canada > Ontario > Toronto (0.14)
North America > United States > South Dakota (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry:

Energy (0.93)
Information Technology (0.92)
Transportation > Infrastructure & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Communications > Networks (0.93)
(2 more...)

Add feedback

Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration

Neural Information Processing SystemsMay-29-2025, 19:04:00 GMT

Discrete structures play an important role in applications like program language modeling and software engineering. Current approaches to predicting complex structures typically consider autoregressive models for their tractability, with some sacrifice in flexibility. Energy-based models (EBMs) on the other hand offer a more flexible and thus more powerful approach to modeling such distributions, but require partition function estimation. In this paper we propose ALOE, a new algorithm for learning conditional and unconditional EBMs for discrete structured data, where parameter gradients are estimated using a learned sampler that mimics local search. We show that the energy function and sampler can be trained efficiently via a new variational form of power iteration, achieving a better trade-off between flexibility and tractability. Experimentally, we show that learning local search leads to significant improvements in challenging application domains. Most notably, we present an energy model guided fuzzer for software testing that achieves comparable performance to well engineered fuzzing engines like libfuzzer.

arxiv preprint arxiv, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

Neural Information Processing SystemsMay-29-2025, 15:58:13 GMT

Language model (LM) agents are increasingly being used to automate complicated tasks in digital environments. Just as humans benefit from powerful software applications, such as integrated development environments, for complex tasks like software engineering, we posit that LM agents represent a new category of end users with their own needs and abilities, and would benefit from specially-built interfaces to the software they use. We investigate how interface design affects the performance of language model agents. As a result of this exploration, we introduce SWE-agent: a system that facilitates LM agents to autonomously use computers to solve software engineering tasks. SWE-agent's custom agent-computer interface (ACI) significantly enhances an agent's ability to create and edit code files, navigate entire repositories, and execute tests and other programs. We evaluate SWE-agent on SWE-bench and HumanEvalFix, achieving state-of-the-art performance on both with a pass@1 rate of 12.5% and 87.7%, respectively, far exceeding the previous state-of-the-art achieved with non-interactive LMs. Finally, we provide insight on how the design of the ACI can impact agents' behavior and performance.

large language model, machine learning, natural language, (23 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.28)
North America > United States > California > Santa Clara County (0.13)
Asia > Middle East > Israel > Mediterranean Sea (0.13)

Genre:

Research Report > Experimental Study (1.00)
Workflow (0.92)
Overview (0.92)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
(3 more...)

Add feedback

285f89b802bcb2651801455c86d78f2a-Paper.pdf

Neural Information Processing SystemsMay-28-2025, 19:18:53 GMT

data mining, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia > China (0.14)
North America > Canada (0.14)

Industry: Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Appendix A Source codes

Neural Information Processing SystemsMay-28-2025, 10:52:09 GMT

Source codes for reproducing our experimental results are available at https://github.com/ To encourage the size of the dataset to be consistent across multiple environments, we use the number of expert demonstrations N 2{20, 50}. We provide the size of a dataset for each environment in Table 4. Following de Haan et al. [12], we consider confounded Atari environments, where images are augmented with previous actions (see Figure 4). We provide source codes for loading images from the dataset, preprocessing images, and augmenting numbers to the images in Section A. For experiments with selected environments in Figure 7, we randomly chose 8 confounded Atari environments, i.e., BankHeist, Enduro, KungFuMaster, Pong, PrivateEye, RoadRunner, Seaquest, and UpNDown, due to the high computational cost of considering all environments.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report (0.69)

Technology:

Information Technology > Software Engineering (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

Tangent: Automatic differentiation using source-code transformation for dynamically typed array programming

Bart van Merrienboer, Dan Moldovan, Alexander Wiltschko

Neural Information Processing SystemsMay-27-2025, 22:19:10 GMT

The need to efficiently calculate first-and higher-order derivatives of increasingly complex models expressed in Python has stressed or exceeded the capabilities of available tools. In this work, we explore techniques from the field of automatic differentiation (AD) that can give researchers expressive power, performance and strong usability. These include source-code transformation (SCT), flexible gradient surgery, efficient in-place array operations, and higher-order derivatives. We implement and demonstrate these ideas in the Tangent software library for Python, the first AD framework for a dynamic language that uses SCT.

artificial intelligence, machine learning, programming language, (20 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Technology: