AITopics | prev

e93b673c55d6768cdd39ce90de8c4d4c-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-18-2026, 13:30:14 GMT

invariant, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(4 more...)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

Neural Information Processing SystemsFeb-18-2026, 13:30:11 GMT

We collect 312 programs from various sources, including daily programs from college homework, the international competition (SV -COMP), benchmarks from previous papers (SLING), and programs from real-world software systems (Linux Kernel, GlibC, LiteOS, and Zephyr).

large language model, loop invariant, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Diffusion models have achieved remarkable performance in generative modeling, yet their theoretical foundations are often intricate, and the gap between mathematical formulations in papers and practical open-source implementations can be difficult to bridge. Existing tutorials primarily focus on deriving equations, offering limited guidance on how diffusion models actually operate in code. To address this, we present a concise implementation of approximately 300 lines that explains diffusion models from a code-execution perspective. Our minimal example preserves the essential components -- including forward diffusion, reverse sampling, the noise-prediction network, and the training loop -- while removing unnecessary engineering details. This technical report aims to provide researchers with a clear, implementation-first understanding of how diffusion models work in practice and how code and theory correspond. Our code and pre-trained models are available at: https://github.com/disanda/GM/tree/main/DDPM-DDIM-ClassifierFree.

artificial intelligence, machine learning, torch, (19 more...)

arXiv.org Artificial Intelligence

2512.07201

Country: Asia > China (0.14)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Recognising, Anticipating, and Mitigating LLM Pollution of Online Behavioural Research

Rilla, Raluca, Werner, Tobias, Yakura, Hiromu, Rahwan, Iyad, Nussberger, Anne-Marie

arXiv.org Artificial IntelligenceNov-4-2025

Online behavioural research faces an emerging threat as participants increasingly turn to large language models (LLMs) for advice, translation, or task delegation: LLM Pollution. We identify three interacting variants through which LLM Pollution threatens the validity and integrity of online behavioural research. First, Partial LLM Mediation occurs when participants make selective use of LLMs for specific aspects of a task, such as translation or wording support, leading researchers to (mis)interpret LLM-shaped outputs as human ones. Second, Full LLM Delegation arises when agentic LLMs complete studies with little to no human oversight, undermining the central premise of human-subject research at a more foundational level. Third, LLM Spillover signifies human participants altering their behaviour as they begin to anticipate LLM presence in online studies, even when none are involved. While Partial Mediation and Full Delegation form a continuum of increasing automation, LLM Spillover reflects second-order reactivity effects. Together, these variants interact and generate cascading distortions that compromise sample authenticity, introduce biases that are difficult to detect post hoc, and ultimately undermine the epistemic grounding of online research on human cognition and behaviour. Crucially, the threat of LLM Pollution is already co-evolving with advances in generative AI, creating an escalating methodological arms race. To address this, we propose a multi-layered response spanning researcher practices, platform accountability, and community efforts. As the challenge evolves, coordinated adaptation will be essential to safeguard methodological integrity and preserve the validity of online behavioural research.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2508.0139

Country: Europe (0.28)

Genre:

Research Report > Experimental Study (0.67)
Research Report > New Finding (0.67)

Industry:

Law (0.70)
Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation Anonymous Author(s) Affiliation Address email 1 Overview of Supplementary Material

Neural Information Processing SystemsOct-10-2025, 20:11:52 GMT

Dataset Documentation: We have documented our dataset for intended researchers as required. The link to download the models after fine-tuning is https://mega.nz/file/M9FEWCjD# To fill the lack of benchmarks for general loop invariant generation, we propose LIG-MM, a loop invariant generation benchmark of memory manipulation programs. Table 1 below shows the basics of the code in LIG-MM. Multiple examples are shown in Sec. 3, and the Table 1: Statistics of our proposed LIG-MM benchmark.

assertion, invariant, loop invariant, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(4 more...)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

Neural Information Processing SystemsOct-10-2025, 20:11:49 GMT

We collect 312 programs from various sources, including daily programs from college homework, the international competition (SV -COMP), benchmarks from previous papers (SLING), and programs from real-world software systems (Linux Kernel, GlibC, LiteOS, and Zephyr).

assertion, invariant, loop invariant, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Pinellas County > St. Petersburg (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

ONNX-Net: Towards Universal Representations and Instant Performance Prediction for Neural Architectures

Qin, Shiwen, Auras, Alexander, Cohen, Shay B., Crowley, Elliot J., Moeller, Michael, Ericsson, Linus, Lukasik, Jovita

arXiv.org Artificial IntelligenceOct-7-2025

Neural architecture search (NAS) automates the design process of high-performing architectures, but remains bottlenecked by expensive performance evaluation. Most existing studies that achieve faster evaluation are mostly tied to cell-based search spaces and graph encodings tailored to those individual search spaces, limiting their flexibility and scalability when applied to more expressive search spaces. In this work, we aim to close the gap of individual search space restrictions and search space dependent network representations. We present ONNX-Bench, a benchmark consisting of a collection of neural networks in a unified format based on ONNX files. ONNX-Bench includes all open-source NAS-bench-based neural networks, resulting in a total size of more than 600k {architecture, accuracy} pairs. This benchmark allows creating a shared neural network representation, ONNX-Net, able to represent any neural architecture using natural language descriptions acting as an input to a performance predictor. This text-based encoding can accommodate arbitrary layer types, operation parameters, and heterogeneous topologies, enabling a single surrogate to generalise across all neural architectures rather than being confined to cell-based search spaces. Experiments show strong zero-shot performance across disparate search spaces using only a small amount of pretraining samples, enabling the unprecedented ability to evaluate any neural network architecture instantly.

artificial intelligence, machine learning, search space, (18 more...)

arXiv.org Artificial Intelligence

2510.04938

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

bcd0049c35799cdf57d06eaf2eb3cff6-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 03:28:07 GMT

artificial intelligence, machine learning, neuron, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois > Cook County > Chicago (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Filters

Collaborating Authors

prev

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

e93b673c55d6768cdd39ce90de8c4d4c-Supplemental-Datasets_and_Benchmarks_Track.pdf

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

bcd0049c35799cdf57d06eaf2eb3cff6-Supplemental.pdf

5d516fc09b53e9a7fade4fbad703e686-Supplemental-Conference.pdf

Understanding Diffusion Models via Code Execution

Recognising, Anticipating, and Mitigating LLM Pollution of Online Behavioural Research

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation Anonymous Author(s) Affiliation Address email 1 Overview of Supplementary Material

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

ONNX-Net: Towards Universal Representations and Instant Performance Prediction for Neural Architectures

bcd0049c35799cdf57d06eaf2eb3cff6-Supplemental.pdf