DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai 1,2 Yifei Zhou
–Neural Information Processing Systems
While training with static demonstrations has shown some promise, we show that such methods fall short for controlling real GUIs due to their failure to deal with real world stochasticity and non-stationarity not captured in static observational data.
Neural Information Processing Systems
Oct-9-2025, 19:27:25 GMT
- Country:
- South America > Chile (0.04)
- North America > United States
- Illinois (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Asia
- Middle East > Jordan (0.04)
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology > Services (0.68)
- Education > Educational Setting
- Online (0.46)
- Technology: