Goto

Collaborating Authors

 system environment




Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM Agents

Song, Kevin, Jayarajan, Anand, Ding, Yaoyao, Su, Qidong, Zhu, Zhanda, Liu, Sihang, Pekhimenko, Gennady

arXiv.org Artificial Intelligence

Large Language Models (LLMs) agents augmented with domain tools promise to autonomously execute complex tasks requiring human-level intelligence, such as customer service and digital assistance. However, their practical deployment is often limited by their low success rates under complex real-world environments. To tackle this, prior research has primarily focused on improving the agents themselves, such as developing strong agentic LLMs, while overlooking the role of the system environment in which the agent operates. In this paper, we study a complementary direction: improving agent success rates by optimizing the system environment in which the agent operates. We collect 142 agent traces (3,656 turns of agent-environment interactions) across 5 state-of-the-art agentic benchmarks. By analyzing these agent failures, we propose a taxonomy for agent-environment interaction failures that includes 6 failure modes. Guided by these findings, we design Aegis, a set of targeted environment optimizations: 1) environment observability enhancement, 2) common computation offloading, and 3) speculative agentic actions. These techniques improve agent success rates on average by 6.7-12.5%, without any modifications to the agent and underlying LLM.




Attention-based Supply-Demand Prediction for Autonomous Vehicles

Zhang, Zikai, Li, Yidong, Dong, Hairong, You, Yizhe, Zhao, Fengping

arXiv.org Machine Learning

As one of the important functions of the intelligent transportation system (ITS), supply-demand prediction for autonomous vehicles provides a decision basis for its control. In this paper, we present two prediction models (i.e. ARLP model and Advanced ARLP model) based on two system environments that only the current day's historical data is available or several days' historical data are available. These two models jointly consider the spatial, temporal, and semantic relations. Spatial dependency is captured with residual network and dimension reduction. Short term temporal dependency is captured with LSTM. Long term temporal dependency and temporal shifting are captured with LSTM and attention mechanism. Semantic dependency is captured with multi-attention mechanism and autocorrelation coefficient method. Extensive experiments show that our frameworks provide more accurate and stable prediction results than the existing methods.


When it comes to controls, should we keep AI on a leash?

#artificialintelligence

In a previous article for DCD I suggested that if true data center thermal optimization is to be achieved, it requires a proven, safe process based on thousands of real-time sensors and expert spatial models. Inevitably this involves the collection of huge volumes of information, so it's critical that you can absolutely rely on the data you're gathering if you're to succeed in removing much of the uncertainty from data center cooling. That's particularly the case when you're considering the kind of control models needed to monitor critical cooling duty performance for your data center estate's multiple CRAC/AHU units. Of course, you'll want to track your data center cooling loads in real-time using standard temperature and current measurement sensors for both chilled water and direct expansion cooling systems. However, it's also important to continuously monitor air inlet and outlet temperatures – along with variables such as fan performance, filter quality, and alerts for potential CRAC/AHU blockages.


philferriere/dlwin

#artificialintelligence

There are certainly a lot of guides to assist you build great deep learning (DL) setups on Linux or Mac OS (including with Tensorflow which, unfortunately, as of this posting, cannot be easily installed on Windows), but few care about building an efficient Windows 10-native setup. Most focus on running an Ubuntu VM hosted on Windows or using Docker, unnecessary - and ultimately sub-optimal - steps. We also found enough misguiding/deprecated information out there to make it worthwhile putting together a step-by-step guide for the latest stable versions of Keras, Tensorflow, CNTK, MXNet, and PyTorch. Used either together (e.g., Keras with Tensorflow backend), or independently -- PyTorch cannot be used as a Keras backend, TensorFlow can be used on its own -- they make for some of the most powerful deep learning python libraries to work natively on Windows. If you must run your DL setup on Windows 10, then the information contained here will hopefully be useful to you.