user interface
AndroidInTheWild: A Large-Scale Dataset For Android Device Control
There is a growing interest in device-control systems that can interpret human natural language instructions and execute them on a digital device by directly controlling its user interface. We present a dataset for device-control research, Android in the Wild (AitW), which is orders of magnitude larger than current datasets. The dataset contains human demonstrations of device interactions, including the screens and actions, and corresponding natural language instructions. It consists of 715k episodes spanning 30k unique instructions, four versions of Android (v10-13), and eight device types (Pixel 2 XL to Pixel 6) with varying screen resolutions. It contains multi-step tasks that require semantic understanding of language and visual context. This dataset poses a new challenge: actions available through the user interface must be inferred from their visual appearance, and, instead of simple UI element-based actions, the action space consists of precise gestures (e.g., horizontal scrolls to operate carousel widgets). We organize our dataset to encourage robustness analysis of device-control systems, i.e., how well a system performs in the presence of new task descriptions, new applications, or new platform versions. We develop two agents and report performance across the dataset.
TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning
The financial markets, which involve over \$90 trillion market capitals, attract the attention of innumerable profit-seeking investors globally. Recent explosion of reinforcement learning in financial trading (RLFT) research has shown stellar performance on many quantitative trading tasks. However, it is still challenging to deploy reinforcement learning (RL) methods into real-world financial markets due to the highly composite nature of this domain, which entails design choices and interactions between components that collect financial data, conduct feature engineering, build market environments, make investment decisions, evaluate model behaviors and offers user interfaces. Despite the availability of abundant financial data and advanced RL techniques, a remarkable gap still exists between the potential and realized utilization of RL in financial trading. In particular, orchestrating an RLFT project lifecycle poses challenges in engineering (i.e.
Meta Poached Apple's Top Design Guys to Fix Its Software UI
Meta wants to make its AI hardware slicker and more fashion-forward. It also needs to make its software more usable. The way to do all that appears to be hiring design maestros away from Apple. Meta has made a big move to hire two prominent designers away from rival tech giant Apple, likely putting them to work on designing Meta's next generation of AI hardware and the software that runs on it. Alan Dye, formerly Apple's vice president of Human Interface Design, will join Meta to head up a new design studio within Meta's Reality Labs.
- Asia > Nepal (0.15)
- North America > United States > Louisiana (0.05)
- North America > United States > Virginia (0.05)
- (5 more...)
- Information Technology > Hardware (0.55)
- Information Technology > Services (0.48)
- Government > Immigration & Customs (0.48)
- Energy > Renewable > Geothermal (0.48)
Building the Web for Agents: A Declarative Framework for Agent-Web Interaction
Schultze, Sven, Kietzmann, Meike Verena, Schönfeld, Nils-Lucas, Stock-Homburg, Ruth
The increasing deployment of autonomous AI agents on the web is hampered by a fundamental misalignment: agents must infer affordances from human-oriented user interfaces, leading to brittle, inefficient, and insecure interactions. To address this, we introduce VOIX, a web-native framework that enables websites to expose reliable, auditable, and privacy-preserving capabilities for AI agents through simple, declarative HTML elements. VOIX introduces
- Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.05)
- Asia > Japan > Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.04)
- North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
- (3 more...)
- Information Technology > Communications > Web (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
Context-aware Adaptive Visualizations for Critical Decision Making
Lopez-Cardona, Angela, Bruns, Mireia Masias, Attygalle, Nuwan T., Idesis, Sebastian, Salvatori, Matteo, Raftopoulos, Konstantinos, Oikonomou, Konstantinos, Duraisamy, Saravanakumar, Emami, Parvin, Latreche, Nacera, Sahraoui, Alaa Eddine Anis, Vakallelis, Michalis, Vanderdonckt, Jean, Arapakis, Ioannis, Leiva, Luis A.
Effective decision-making often relies on timely insights from complex visual data. While Information Visualization (InfoVis) dashboards can support this process, they rarely adapt to users' cognitive state, and less so in real time. We present Symbiotik, an intelligent, context-aware adaptive visualization system that leverages neurophysiological signals to estimate mental workload (MWL) and dynamically adapt visual dashboards using reinforcement learning (RL). Through a user study with 120 participants and three visualization types, we demonstrate that our approach improves task performance and engagement. Symbiotik offers a scalable, real-time adaptation architecture, and a validated methodology for neuroadaptive user interfaces.
- North America > United States (0.04)
- Europe > Sweden > Uppsala County > Uppsala (0.04)
- Europe > Spain (0.04)
- (5 more...)
- Research Report > New Finding (0.68)
- Questionnaire & Opinion Survey (0.47)
Proposing a Framework for Machine Learning Adoption on Legacy Systems
Rahman, Ashiqur, Alhoori, Hamed
The integration of machine learning (ML) is critical for industrial competitiveness, yet its adoption is frequently stalled by the prohibitive costs and operational disruptions of upgrading legacy systems. The financial and logistical overhead required to support the full ML lifecycle presents a formidable barrier to widespread implementation, particularly for small and medium-sized enterprises. This paper introduces a pragmatic, API-based framework designed to overcome these challenges by strategically decoupling the ML model lifecycle from the production environment. Our solution delivers the analytical power of ML to domain experts through a lightweight, browser-based interface, eliminating the need for local hardware upgrades and ensuring model maintenance can occur with zero production downtime. This human-in-the-loop approach empowers experts with interactive control over model parameters, fostering trust and facilitating seamless integration into existing workflows. By mitigating the primary financial and operational risks, this framework offers a scalable and accessible pathway to enhance production quality and safety, thereby strengthening the competitive advantage of the manufacturing sector.
- North America > United States > Illinois > DeKalb County > DeKalb (0.04)
- Europe > Switzerland > Basel-City > Basel (0.04)
- Asia > Singapore > Central Region > Singapore (0.04)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine (0.94)
- Banking & Finance (0.68)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.68)
Does Interpretability of Knowledge Tracing Models Support Teacher Decision Making?
Khalid, Adia, Deriyeva, Alina, Paassen, Benjamin
Knowledge tracing (KT) models are a crucial basis for pedagogical decision-making, namely which task to select next for a learner and when to stop teaching a particular skill. Given the high stakes of pedagogical decisions, KT models are typically required to be interpretable, in the sense that they should implement an explicit model of human learning and provide explicit estimates of learners' abilities. However, to our knowledge, no study to date has investigated whether the interpretability of KT models actually helps human teachers to make teaching decisions. We address this gap. First, we perform a simulation study to show that, indeed, decisions based on interpretable KT models achieve mastery faster compared to decisions based on a non-interpretable model. Second, we repeat the study but ask $N=12$ human teachers to make the teaching decisions based on the information provided by KT models. As expected, teachers rate interpretable KT models higher in terms of usability and trustworthiness. However, the number of tasks needed until mastery hardly differs between KT models. This suggests that the relationship between model interpretability and teacher decisions is not straightforward: teachers do not solely rely on KT models to make decisions and further research is needed to investigate how learners and teachers actually understand and use KT models.
- Research Report > New Finding (1.00)
- Research Report > Experimental Study > Negative Result (0.46)
Generative Interfaces for Language Models
Chen, Jiaqi, Zhang, Yanzhe, Zhang, Yutong, Shao, Yijia, Yang, Diyi
Large language models (LLMs) are increasingly seen as assistants, copilots, and consultants, capable of supporting a wide range of tasks through natural conversation. However, most systems remain constrained by a linear request-response format that often makes interactions inefficient in multi-turn, information-dense, and exploratory tasks. To address these limitations, we propose Generative Interfaces for Language Models, a paradigm in which LLMs respond to user queries by proactively generating user interfaces (UIs) that enable more adaptive and interactive engagement. Our framework leverages structured interface-specific representations and iterative refinements to translate user queries into task-specific UIs. For systematic evaluation, we introduce a multidimensional assessment framework that compares generative interfaces with traditional chat-based ones across diverse tasks, interaction patterns, and query types, capturing functional, interactive, and emotional aspects of user experience. Results show that generative interfaces consistently outperform conversational ones, with up to a 72% improvement in human preference. These findings clarify when and why users favor generative interfaces, paving the way for future advancements in human-AI interaction.
- Europe > Austria > Vienna (0.14)
- North America > United States (0.04)
- Instructional Material > Course Syllabus & Notes (0.67)
- Research Report > New Finding (0.48)
- Health & Medicine (0.69)
- Education > Educational Setting (0.46)
- Information Technology > Human Computer Interaction (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)