Agents
Supplementary Material for " Hierarchical Adaptive Value Estimation for Multi-modal Visual Reinforcement Learning " Y angru Huang
The contents of this supplementary material are organized as follows: Section A provides additional experimental results, including more results with three modalities, performance under dynamic weathers, performance under several challenging or extreme environmental conditions ( e.g., increased number of vehicles and dazzling sunlight), results on DeepMind Control Suit, and ablation study of auxiliary losses and the design of re-fusion. Section B provides further discussions related to our approach. This includes a comparison between value-level dynamic fusion and feature-level dynamic fusion supported by empirical results, the advantages of hierarchical bi-level fusion over uni-level fusion, and the relationship and differences between our approach and the value decomposition techniques in multi-agent RL. Section C describes the details of the experimental setup, including network architectures, hyper-parameters, and hardware details. Section D states the potential negative societal impacts of our work.
MONAQ: Multi-Objective Neural Architecture Querying for Time-Series Analysis on Resource-Constrained Devices
The growing use of smartphones and IoT devices necessitates efficient time-series analysis on resource-constrained hardware, which is critical for sensing applications such as human activity recognition and air quality prediction. Recent efforts in hardware-aware neural architecture search (NAS) automate architecture discovery for specific platforms; however, none focus on general time-series analysis with edge deployment. Leveraging the problem-solving and reasoning capabilities of large language models (LLM), we propose MONAQ, a novel framework that reformulates NAS into Multi-Objective Neural Architecture Querying tasks. MONAQ is equipped with multimodal query generation for processing multimodal time-series inputs and hardware constraints, alongside an LLM agent-based multi-objective search to achieve deployment-ready models via code generation. By integrating numerical data, time-series images, and textual descriptions, MONAQ improves an LLM's understanding of time-series data. Experiments on fifteen datasets demonstrate that MONAQ-discovered models outperform both handcrafted models and NAS baselines while being more efficient.