AITopics | iter

Collaborating Authors

iter

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Dennis Whyte's fusion quest

MIT Technology ReviewJan-6-2026, 22:00:00 GMT

When the US Department of Energy announced that it would stop funding the tokamak at MIT's Plasma Science and Fusion Center, Dennis Whyte considered giving up on fusion research. But then he had a brainstorm--and challenged his students to bring the idea to life. This full-scale high-temperature superconducting magnet designed and built by Commonwealth Fusion Systems and MIT's Plasma Science and Fusion Center (PSFC) has demonstrated a recordbreaking 20 tesla magnetic field. It is the strongest fusion magnet in the world. Ever since nuclear fusion was discovered in the 1930s, scientists have wondered if we could somehow replicate and harness the phenomenon behind starlight--the smashing together of hydrogen atoms to form helium and a stupendous amount of clean energy. Fusing hydrogen would yield times more energy than simply burning it. Unlike nuclear fission, which powers the world's 440 atomic reactors, hydrogen fusion produces no harmful radiation, only neutrons that are captured and added back to the reaction.

artificial intelligence, magnet, social media, (13 more...)

MIT Technology Review

Country:

Europe > Russia (0.14)
Asia > Russia (0.14)
North America > United States > Wisconsin (0.04)
(9 more...)

Genre: Personal > Honors (0.46)

Industry:

Energy > Power Industry > Utilities > Nuclear (0.88)
Government > Regional Government > North America Government > United States Government (0.55)

Technology:

Information Technology > Communications > Social Media (0.95)
Information Technology > Artificial Intelligence (0.71)

Add feedback

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Xia, Peng, Zeng, Kaide, Liu, Jiaqi, Qin, Can, Wu, Fang, Zhou, Yiyang, Xiong, Caiming, Yao, Huaxiu

arXiv.org Artificial IntelligenceNov-21-2025

Large Language Model (LLM) Agents, often trained with Reinforcement Learning (RL), are constrained by a dependency on human-curated data, limiting scalability and tethering AI to human knowledge. Existing self-evolution frameworks offer an alternative but are typically restricted by the model's inherent capabilities and single-round interactions, hindering the development of complex curricula involving tool use or dynamic reasoning. We introduce Agent0, a fully autonomous framework that evolves high-performing agents without external data through multi-step co-evolution and seamless tool integration. Agent0 establishes a symbiotic competition between two agents initialized from the same base LLM: a curriculum agent that proposes increasingly challenging frontier tasks, and an executor agent that learns to solve them. We integrate external tools to enhance the executor's problem-solving capacity; this improvement, in turn, pressures the curriculum agent to construct more complex, tool-aware tasks. Through this iterative process, Agent0 establishes a self-reinforcing cycle that continuously produces high-quality curricula. Empirically, Agent0 substantially boosts reasoning capabilities, improving the Qwen3-8B-Base model by 18% on mathematical reasoning and 24% on general reasoning benchmarks. Code is available at https://github.com/aiming-lab/Agent0.

arxiv preprint arxiv, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2511.16043

Genre: Research Report (0.50)

Industry: Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Compiling to recurrent neurons

Velez-Ginorio, Joey, Amin, Nada, Kording, Konrad, Zdancewic, Steve

arXiv.org Artificial IntelligenceNov-20-2025

Discrete structures are currently second-class in differentiable programming. Since functions over discrete structures lack overt derivatives, differentiable programs do not differentiate through them and limit where they can be used. For example, when programming a neural network, conditionals and iteration cannot be used everywhere; they can break the derivatives necessary for gradient-based learning to work. This limits the class of differentiable algorithms we can directly express, imposing restraints on how we build neural networks and differentiable programs more generally. However, these restraints are not fundamental. Recent work shows conditionals can be first-class, by compiling them into differentiable form as linear neurons. Similarly, this work shows iteration can be first-class -- by compiling to linear recurrent neurons. We present a minimal typed, higher-order and linear programming language with iteration called $\textsf{Cajal}\scriptstyle(\mathbb{\multimap}, \mathbb{2}, \mathbb{N})$. We prove its programs compile correctly to recurrent neurons, allowing discrete algorithms to be expressed in a differentiable form compatible with gradient-based learning. With our implementation, we conduct two experiments where we link these recurrent neurons against a neural network solving an iterative image transformation task. This determines part of its function prior to learning. As a result, the network learns faster and with greater data-efficiency relative to a neural network programmed without first-class iteration. A key lesson is that recurrent neurons enable a rich interplay between learning and the discrete structures of ordinary programming.

artificial intelligence, machine learning, neural network, (16 more...)

arXiv.org Artificial Intelligence

2511.14953

Country:

North America > United States > Pennsylvania (0.76)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

5d1f02132ef51602adf07000ca5b6138-Paper-Conference.pdf

Neural Information Processing SystemsNov-18-2025, 20:48:22 GMT

code change, large language model, machine learning, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Austria > Vienna (0.14)
(18 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (0.67)

Industry: Information Technology (0.46)

Technology:

Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

Delayed Gradient Averaging: Tolerate the Communication Latency in Federated Learning

Neural Information Processing SystemsNov-16-2025, 05:48:35 GMT

Federated Learning is an emerging direction in distributed machine learning that enables jointly training a model without sharing the data.

artificial intelligence, latency, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Industry: Information Technology (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

85ea6fd7a2ca3960d0cf5201933ac998-Paper.pdf

Neural Information Processing SystemsNov-14-2025, 22:23:03 GMT

artificial intelligence, constraint, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Indiana (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Energy (0.47)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Robots (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

A Expansive Latent Space Trees Details and Implementation

Neural Information Processing SystemsNov-14-2025, 19:59:06 GMT

As shown, ELAST achieves better success rates even for versions of CEM with higher average computation time per query.

artificial intelligence, elast, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)

Add feedback

A Proofs

Neural Information Processing SystemsNov-14-2025, 13:01:45 GMT

When CondInstanceNorm++ is added, we name them "CondResBlock" and "CondRefineBlock" We use the ELU activation function [25] throughout all architectures. The latter is configured according to Technique 1-4. The learning rates and batch sizes are provided in Appendix B.1 and Table 4. EMA with momentum 0.9 to smooth the curves in Figure 1. We can interpolate between two different samples from NCSN/NCSNv2 via interpolating the Gaussian random noise injected by annealed Langevin dynamics. As indicated by Figs. 4 and 8, EMA can stabilize training and remove sample FID scores should be interpreted with caution because they may not align well with human judgement.

artificial intelligence, machine learning, ncsnv2, (17 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County > Palo Alto (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

5a5aacae31b6d41edf49bc43bccb7c4f-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsNov-14-2025, 09:25:57 GMT

artificial intelligence, machine learning, metric value, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Dual Manifold Adversarial Robustness: Defense against L p and non-L p Adversarial Attacks A OM-ImageNet Details A.1 Overview

Neural Information Processing SystemsNov-13-2025, 13:06:07 GMT

Figure 1: Visual comparison between original images and projected images. All the classification models are trained using two P6000 GPUs with a batch size of 64 for 20 epochs. We study how different choices affect the robustness of the trained networks against unseen attacks. Table 4: Classification accuracy against unseen attacks applied to OM-ImageNet test set. Table 5. 3 Table 5: Classification accuracy against known (PGD-50 and OM-PGD-50) and unseen attacks Brighter colors indicate larger absolute differences.

artificial intelligence, machine learning, unseen attack, (11 more...)

Neural Information Processing Systems

Country: