AITopics | Education

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai 1,2 Yifei Zhou

Neural Information Processing SystemsOct-9-2025, 19:27:25 GMT

While training with static demonstrations has shown some promise, we show that such methods fall short for controlling real GUIs due to their failure to deal with real world stochasticity and non-stationarity not captured in static observational data.

agent, digirl, trajectory, (17 more...)

Neural Information Processing Systems

Country:

South America > Chile (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Illinois (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology > Services (0.68)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

16c628ab12dc4caca8e7712affa6c767-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 19:26:38 GMT

algorithm, arxiv preprint arxiv, assumption 4, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
(2 more...)

Add feedback

1697e3fb412da11dc9488249f9e7bbc9-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-9-2025, 19:26:07 GMT

dataset, high precision resolution, resolution, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.05)
Asia > China > Beijing > Beijing (0.05)
North America > United States (0.05)

Industry: Education (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)

Add feedback

163b048741e1deea2b3d9a46c2c88af3-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 19:19:41 GMT

selection, subgoal, subtask, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(4 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Information Technology (0.67)
Education (0.46)
Health & Medicine (0.45)

Technology:

Information Technology > Artificial Intelligence > Robots (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language (0.67)
(3 more...)

Add feedback

Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Neural Information Processing SystemsOct-9-2025, 19:17:38 GMT

Automating mathematical reasoning is a longstanding goal in artificial intelligence (Newell et al., 1957). A prominent line of work on the problem (Li et al., 2024) uses neural models to direct

benchmark, formalization, lean 4, (13 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.67)

Industry: Education > Educational Setting (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

14c018d2e72c521605b0567029ef0efb-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 19:09:22 GMT

In this work, we propose a novel calibration method that can be used to combat hallucinations.

aclanthology, computational linguistic, mistral-7b-v0, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Indonesia > Bali (0.04)
(14 more...)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.68)

Industry:

Education (0.67)
Government (0.46)
Law (0.46)
Media (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
(2 more...)

Add feedback

On Convergence of Adam for Stochastic Optimization under Relaxed Assumptions

Neural Information Processing SystemsOct-9-2025, 19:08:53 GMT

In this paper, we study Adam in non-convex smooth scenarios with potential unbounded gradients and affine variance noise. We consider a general noise model which governs affine variance noise, bounded noise, and sub-Gaussian noise. We show that Adam with a specific hyper-parameter setup can find a stationary point with a O (1 / T) rate in high probability under this general noise model where T denotes total number iterations, matching the lower rate of stochastic first-order algorithms up to logarithm factors.

assumption, convergence, probability, (14 more...)

Neural Information Processing Systems

Country: