AITopics

2509.04129

Country:

Europe (1.00)
Asia > India (0.68)
North America > United States > California (0.46)

Genre:

Overview (0.46)
Research Report (0.40)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)

arXiv.org Artificial IntelligenceSep-5-2025

FaMA: LLM-Empowered Agentic Assistant for Consumer-to-Consumer Marketplace

Yan, Yineng, Wang, Xidong, Cheng, Jin Seng, Hu, Ran, Guan, Wentao, Farahmand, Nahid, Lin, Hengte, Li, Yue

The emergence of agentic AI, powered by Large Language Models (LLMs), marks a paradigm shift from reactive generative systems to proactive, goal-oriented autonomous agents capable of sophisticated planning, memory, and tool use. This evolution presents a novel opportunity to address long-standing challenges in complex digital environments. Core tasks on Consumer-to-Consumer (C2C) e-commerce platforms often require users to navigate complex Graphical User Interfaces (GUIs), making the experience time-consuming for both buyers and sellers. This paper introduces a novel approach to simplify these interactions through an LLM-powered agentic assistant. This agent functions as a new, conversational entry point to the marketplace, shifting the primary interaction model from a complex GUI to an intuitive AI agent. By interpreting natural language commands, the agent automates key high-friction workflows. For sellers, this includes simplified updating and renewal of listings, and the ability to send bulk messages. For buyers, the agent facilitates a more efficient product discovery process through conversational search. We present the architecture for Facebook Marketplace Assistant (FaMA), arguing that this agentic, conversational paradigm provides a lightweight and more accessible alternative to traditional app interfaces, allowing users to manage their marketplace activities with greater efficiency. Experiments show FaMA achieves a 98% task success rate on solving complex tasks on the marketplace and enables up to a 2x speedup on interaction time.

artificial intelligence, large language model, natural language, (14 more...)

2509.0389

Country: North America > United States (0.70)

Genre:

Research Report (0.70)
Overview (0.48)

Industry: Information Technology > Services (0.71)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

The GuardianSep-4-2025, 06:00:44 GMT

Google Pixel 10 review: the new benchmark for a standard flagship phone

Google's new cheapest Pixel 10 has been upgraded with more cameras, a faster chip and some quality software that has brought it out of the shadow of its pricier Pro siblings to set a new standard of what you should expect from a base-model flagship phone. The Guardian's journalism is independent. We will earn a commission if you buy something through an affiliate link. The design is almost identical to the Pixel 9, except for some new bold colours and the all-important new third camera in the pill-shaped lump on the back. The satin aluminium and glass body feels like a quality piece of hardware and the design certainly stands out in a sea of samey slab phones. The 6.3in OLED screen is crisp, super-bright and smooth with a 120Hz refresh rate.

artificial intelligence, google, pixel 10, (14 more...)

The Guardian

Country: North America > United States (0.05)

Genre:

Research Report (0.40)
Overview (0.40)

Industry:

Media > Photography (0.70)
Information Technology (0.70)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence (1.00)

Securing AI Agents with Information-Flow Control

Costa, Manuel, Köpf, Boris, Kolluri, Aashish, Paverd, Andrew, Russinovich, Mark, Salem, Ahmed, Tople, Shruti, Wutschitz, Lukas, Zanella-Béguelin, Santiago

As AI agents become increasingly autonomous and capable, ensuring their security against vulnerabilities such as prompt injection becomes critical. This paper explores the use of information-flow control (IFC) to provide security guarantees for AI agents. We present a formal model to reason about the security and expressiveness of agent planners. Using this model, we characterize the class of properties enforceable by dynamic taint-tracking and construct a taxonomy of tasks to evaluate security and utility trade-offs of planner designs. Informed by this exploration, we present Fides, a planner that tracks confidentiality and integrity labels, deterministically enforces security policies, and introduces novel primitives for selectively hiding information. Its evaluation in AgentDojo demonstrates that this approach enables us to complete a broad range of tasks with security guarantees. A tutorial to walk readers through the the concepts introduced in the paper can be found at https://github.com/microsoft/fides

artificial intelligence, natural language, tool call, (15 more...)

2505.23643

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > Middle East > Palestine > Gaza Strip > Rafah Governorate > Rafah (0.04)
Asia > Middle East > Israel > Mediterranean Sea (0.04)
Africa > Cameroon > Gulf of Guinea (0.04)

Genre:

Research Report (0.63)
Overview (0.45)
Instructional Material > Course Syllabus & Notes (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Chen, Yifan, Vanden-Eijnden, Eric

Scale-Adaptive Generative Flows for Multiscale Scientific Data

arXiv.org Machine LearningSep-4-2025

Flow-based generative models can face significant challenges when modeling scientific data with multiscale Fourier spectra, often producing large errors in fine-scale features. We address this problem within the framework of stochastic interpolants, via principled design of noise distributions and interpolation schedules. The key insight is that the noise should not be smoother than the target data distribution -- measured by Fourier spectrum decay rates -- to ensure bounded drift fields near the initial time. For Gaussian and near-Gaussian distributions whose fine-scale structure is known, we show that spectrum-matched noise improves numerical efficiency compared to standard white-noise approaches. For complex non-Gaussian distributions, we develop scale-adaptive interpolation schedules that address the numerical ill-conditioning arising from rougher-than-data noise. Numerical experiments on synthetic Gaussian random fields and solutions to the stochastic Allen-Cahn and Navier-Stokes equations validate our approach and demonstrate its ability to generate high-fidelity samples at lower computational cost than traditional approaches.

artificial intelligence, machine learning, noise, (15 more...)

arXiv.org Machine Learning

2509.02971

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
North America > United States > New York (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre:

Research Report (0.64)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

arXiv.org Machine LearningSep-4-2025

Feedback-Enhanced Online Multiple Testing with Applications to Conformal Selection

Lu, Lin, Huo, Yuyang, Ren, Haojie, Wang, Zhaojun, Zou, Changliang

We study online multiple testing with feedback, where decisions are made sequentially and the true state of the hypothesis is revealed after the decision has been made, either instantly or with a delay. We propose GAIF, a feedback-enhanced generalized alpha-investing framework that dynamically adjusts thresholds using revealed outcomes, ensuring finite-sample false discovery rate (FDR)/marginal FDR control. Extending GAIF to online conformal testing, we construct independent conformal $p$-values and introduce a feedback-driven model selection criterion to identify the best model/score, thereby improving statistical power. We demonstrate the effectiveness of our methods through numerical simulations and real-data applications.

artificial intelligence, machine learning, procedure, (17 more...)

arXiv.org Machine Learning

2509.03297

Country:

Asia > Middle East > Jordan (0.04)
Asia > China > Shanghai > Shanghai (0.04)
North America > United States > New York (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)

Genre:

Research Report (0.54)
Overview (0.45)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

Embodied AI: Emerging Risks and Opportunities for Policy Action

Perlo, Jared, Robey, Alexander, Barez, Fazl, Floridi, Luciano, Mökander, Jakob

The field of embodied AI (EAI) is rapidly advancing. Unlike virtual AI, EAI systems can exist in, learn from, reason about, and act in the physical world. With recent advances in AI models and hardware, EAI systems are becoming increasingly capable across wider operational domains. While EAI systems can offer many benefits, they also pose significant risks, including physical harm from malicious use, mass surveillance, as well as economic and societal disruption. These risks require urgent attention from policymakers, as existing policies governing industrial robots and autonomous vehicles are insufficient to address the full range of concerns EAI systems present. To help address this issue, this paper makes three contributions. First, we provide a taxonomy of the physical, informational, economic, and social risks EAI systems pose. Second, we analyze policies in the US, EU, and UK to assess how existing frameworks address these risks and to identify critical gaps. We conclude by offering policy recommendations for the safe and beneficial deployment of EAI systems, such as mandatory testing and certification schemes, clarified liability frameworks, and strategies to manage EAI's potentially transformative economic and societal impacts.

eai system, large language model, machine learning, (19 more...)

2509.00117

Country:

Asia (1.00)
Europe > United Kingdom > England (0.93)
North America > United States > Massachusetts > Middlesex County (0.28)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.45)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey

Shao, Rui, Li, Wei, Zhang, Lingsen, Zhang, Renshan, Liu, Zhiyang, Chen, Ran, Nie, Liqiang

Robotic manipulation, a key frontier in robotics and embodied AI, requires precise motor control and multimodal understanding, yet traditional rule-based methods fail to scale or generalize in unstructured, novel environments. In recent years, Vision-Language-Action (VLA) models, built upon Large Vision-Language Models (VLMs) pretrained on vast image-text datasets, have emerged as a transformative paradigm. This survey provides the first systematic, taxonomy-oriented review of large VLM-based VLA models for robotic manipulation. We begin by clearly defining large VLM-based VLA models and delineating two principal architectural paradigms: (1) monolithic models, encompassing single-system and dual-system designs with differing levels of integration; and (2) hierarchical models, which explicitly decouple planning from execution via interpretable intermediate representations. Building on this foundation, we present an in-depth examination of large VLM-based VLA models: (1) integration with advanced domains, including reinforcement learning, training-free optimization, learning from human videos, and world model integration; (2) synthesis of distinctive characteristics, consolidating architectural traits, operational strengths, and the datasets and benchmarks that support their development; (3) identification of promising directions, including memory mechanisms, 4D perception, efficient adaptation, multi-agent cooperation, and other emerging capabilities. This survey consolidates recent advances to resolve inconsistencies in existing taxonomies, mitigate research fragmentation, and fill a critical gap through the systematic integration of studies at the intersection of large VLMs and robotic manipulation. We provide a regularly updated project page to document ongoing progress: https://github.com/JiuTian-VL/Large-VLM-based-VLA-for-Robotic-Manipulation

large language model, machine learning, reinforcement learning, (20 more...)

2508.13073

Country: Asia > China (0.27)

Genre: Overview (1.00)

Industry: Education > Educational Setting (0.45)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(4 more...)

Deep Research Agents: A Systematic Examination And Roadmap

Huang, Yuxuan, Chen, Yihang, Zhang, Haozheng, Li, Kang, Zhou, Huichi, Fang, Meng, Yang, Linyi, Li, Xiaoguang, Shang, Lifeng, Xu, Songcen, Hao, Jianye, Shao, Kun, Wang, Jun

The rapid progress of Large Language Models (LLMs) has given rise to a new category of autonomous AI systems, referred to as Deep Research (DR) agents. These agents are designed to tackle complex, multi-turn informational research tasks by leveraging a combination of dynamic reasoning, adaptive long-horizon planning, multi-hop information retrieval, iterative tool use, and the generation of structured analytical reports. In this paper, we conduct a detailed analysis of the foundational technologies and architectural components that constitute Deep Research agents. We begin by reviewing information acquisition strategies, contrasting API-based retrieval methods with browser-based exploration. We then examine modular tool-use frameworks, including code execution, multimodal input processing, and the integration of Model Context Protocols (MCPs) to support extensibility and ecosystem development. To systematize existing approaches, we propose a taxonomy that differentiates between static and dynamic workflows, and we classify agent architectures based on planning strategies and agent composition, including single-agent and multi-agent configurations. We also provide a critical evaluation of current benchmarks, highlighting key limitations such as restricted access to external knowledge, sequential execution inefficiencies, and misalignment between evaluation metrics and the practical objectives of DR agents. Finally, we outline open challenges and promising directions for future research. A curated and continuously updated repository of DR agent research is available at: {https://github.com/ai-agents-2030/awesome-deep-research-agent}.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

2506.18096

Genre:

Overview (1.00)
Research Report (0.82)
Workflow (0.69)

Industry: Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
(2 more...)

Bentley, Peter J., Lim, Soo Ling, Ishikawa, Fuyuki

Situating AI Agents in their World: Aspective Agentic AI for Dynamic Partially Observable Information Systems

Agentic LLM AI agents are often little more than autonomous chatbots: actors following scripts, often controlled by an unreliable director. This work introduces a bottom-up framework that situates AI agents in their environment, with all behaviors triggered by changes in their environments. It introduces the notion of aspects, similar to the idea of umwelt, where sets of agents perceive their environment differently to each other, enabling clearer control of information. We provide an illustrative implementation and show that compared to a typical architecture, which leaks up to 83% of the time, aspective agentic AI enables zero information leakage. We anticipate that this concept of specialist agents working efficiently in their own information niches can provide improvements to both security and efficiency.

large language model, machine learning, natural language, (17 more...)

2509.0338

Country: Asia > Japan > Honshū (0.29)

Genre:

Overview (0.68)
Research Report (0.50)

Industry:

Information Technology > Security & Privacy (0.93)
Government (0.70)
Health & Medicine > Therapeutic Area > Immunology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)