AITopics

--Under-actuated compliant robotic systems offer a promising approach to mitigating actuation and control challenges by harnessing pre-designed, embodied dynamic behaviors. This paper presents Flix-Walker, a novel, untethered, centimeter-scale quadrupedal robot inspired by compliant under-actuated mechanisms. Flix-Walker employs flexible, helix-shaped beams as legs, which are actuated by vibrations from just two motors to achieve three distinct mobility modes. We analyze the actuation parameters required to generate various locomotion modes through both simulation and prototype experiments. The effects of system and environmental variations on locomotion performance are examined, and we propose a generic metric for selecting control parameters that produce robust and functional motions. Under-actuated, compliant systems exploit structural dynamics to produce complex robotic motions for locomotion and manipulation, while reducing actuation demands. Leveraging these dynamic behaviors diminishes the need for active actuation, lowers controller complexity, reduces actuator count, and simplifies fabrication [1], [2]. Legged robots offer superior maneuverability in cluttered terrain compared to wheeled or tracked platforms [3], [4].

artificial intelligence, flix-walker, robot, (18 more...)

2507.02547

Country:

North America > United States > Michigan > Wayne County > Detroit (0.04)
North America > United States > Arizona > Maricopa County > Mesa (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Energy (0.68)

Technology: Information Technology > Artificial Intelligence > Robots > Locomotion (0.88)

WebSailor: Navigating Super-human Reasoning for Web Agent

Li, Kuan, Zhang, Zhongwang, Yin, Huifeng, Zhang, Liwen, Ou, Litu, Wu, Jialong, Yin, Wenbiao, Li, Baixuan, Tao, Zhengwei, Wang, Xinyu, Shen, Weizhou, Zhang, Junkai, Zhang, Dingchu, Wu, Xixi, Jiang, Yong, Yan, Ming, Xie, Pengjun, Huang, Fei, Zhou, Jingren

Transcending human cognitive limitations represents a critical frontier in LLM training. Proprietary agentic systems like DeepResearch have demonstrated superhuman capabilities on extremely complex information-seeking benchmarks such as BrowseComp, a feat previously unattainable. We posit that their success hinges on a sophisticated reasoning pattern absent in open-source models: the ability to systematically reduce extreme uncertainty when navigating vast information landscapes. Based on this insight, we introduce WebSailor, a complete post-training methodology designed to instill this crucial capability. Our approach involves generating novel, high-uncertainty tasks through structured sampling and information obfuscation, RFT cold start, and an efficient agentic RL training algorithm, Duplicating Sampling Policy Optimization (DUPO). With this integrated pipeline, WebSailor significantly outperforms all opensource agents in complex information-seeking tasks, matching proprietary agents' performance and closing the capability gap.

arxiv preprint arxiv, large language model, machine learning, (16 more...)

2507.02592

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
South America > Colombia (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)

Genre: Research Report (1.00)

Industry: Energy (1.00)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

ArtGS:3D Gaussian Splatting for Interactive Visual-Physical Modeling and Manipulation of Articulated Objects

Yu, Qiaojun, Yuan, Xibin, jiang, Yu, Chen, Junting, Zheng, Dongzhe, Hao, Ce, You, Yang, Chen, Yixing, Mu, Yao, Liu, Liu, Lu, Cewu

Articulated object manipulation remains a critical challenge in robotics due to the complex kinematic constraints and the limited physical reasoning of existing methods. In this work, we introduce ArtGS, a novel framework that extends 3D Gaussian Splatting (3DGS) by integrating visual-physical modeling for articulated object understanding and interaction. ArtGS begins with multi-view RGB-D reconstruction, followed by reasoning with a vision-language model (VLM) to extract semantic and structural information, particularly the articulated bones. Through dynamic, differentiable 3DGS-based rendering, ArtGS optimizes the parameters of the articulated bones, ensuring physically consistent motion constraints and enhancing the manipulation policy. By leveraging dynamic Gaussian splatting, cross-embodiment adaptability, and closed-loop optimization, ArtGS establishes a new framework for efficient, scalable, and generalizable articulated object modeling and manipulation. Experiments conducted in both simulation and real-world environments demonstrate that ArtGS significantly outperforms previous methods in joint estimation accuracy and manipulation success rates across a variety of articulated objects. Additional images and videos are available on the project website: https://sites.google.com/view/artgs/home

artificial intelligence, manipulation, object-oriented architecture, (16 more...)

2507.026

Country: Asia > China (0.29)

Genre: Research Report (0.82)

Industry: Energy (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.35)

Empowering Intelligent Low-altitude Economy with Large AI Model Deployment

Lyu, Zhonghao, Gao, Yulan, Chen, Junting, Du, Hongyang, Xu, Jie, Huang, Kaibin, Kim, Dong In

--Low-altitude economy (LAE) represents an emerging economic paradigm that redefines commercial and social aerial activities. Large artificial intelligence models (LAIMs) offer transformative potential to further enhance the intelligence of LAE services. However, deploying LAIMs in LAE poses several challenges, including the significant gap between their computational/storage demands and the limited onboard resources of LAE entities, the mismatch between lab-trained LAIMs and dynamic physical environments, and the inefficiencies of traditional decoupled designs for sensing, communication, and computation. T o address these issues, we first propose a hierarchical system architecture tailored for LAIM deployment and present representative LAE application scenarios. Next, we explore key enabling techniques that facilitate the mutual co-evolution of LAIMs and low-altitude systems, and introduce a task-oriented execution pipeline for scalable and adaptive service delivery. Then, the proposed framework is validated through real-world case studies. Finally, we outline open challenges to inspire future research. The low-altitude economy (LAE) is rapidly emerging as a critical engine of global industrial innovation and economic growth.

communication, machine learning, natural language, (18 more...)

2505.22343

Country: Asia > China (0.29)

Genre: Research Report (0.50)

Industry: Energy > Power Industry (0.68)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Burange, Rahul A., Shinde, Harsh K., Mutyalwar, Omkar

Landslide Detection and Mapping Using Deep Learning Across Multi-Source Satellite Data and Geographic Regions

Abstract: Landslides pose severe threats to infrastructure, economies, and human lives, necessitating accurate detection and predictive mapping across diverse geographic regions. With advancements in deep learning and remote sensing, automated landslide detection has become increasingly effective. This study presents a comprehensive approach integrating multi-source satellite imagery and deep learning models to enhance landslide identification and prediction. We leverage Sentinel-2 multispectral data and ALOS PALSAR-derived slope and Digital Elevation Model (DEM) layers to capture critical environmental features influencing landslide occurrences. Various geospatial analysis techniques are employed to assess the impact of terrain characteristics, vegetation cover, and rainfall on detection accuracy. Additionally, we evaluate the performance of multiple state-of-the-art deep learning segmentation models, including U-Net, DeepLabV3+, and Res-Net, to determine their effectiveness in landslide detection. The proposed framework contributes to the development of reliable early warning systems, improved disaster risk management, and sustainable land-use planning. Our findings provide valuable insights into the potential of deep learning and multi-source remote sensing in creating robust, scalable, and transferable landslide prediction models. Landslides represent a significant natural hazard, causing substantial environmental and socio-economic damage worldwide. The increasing frequency of extreme weather events, deforestation, and rapid urbanization have exacerbated the risks associated with landslides, highlighting the need for effective detection and monitoring strategies. Traditional landslide mapping techniques, including field surveys and manual interpretation of satellite imagery, are time-consuming, costly, and often constrained by limited spatial coverage.

artificial intelligence, landslide detection, machine learning, (14 more...)

doi: 10.2139/ssrn.5225437

2507.01123

Country:

Asia > India (0.14)
Asia > Japan (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.98)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

AirV2X: Unified Air-Ground Vehicle-to-Everything Collaboration

Gao, Xiangbo, Wu, Yuheng, Yang, Fengze, Luo, Xuewen, Wu, Keshu, Chen, Xinghao, Wang, Yuping, Liu, Chenxi, Zhou, Yang, Tu, Zhengzhong

While multi-vehicular collaborative driving demonstrates clear advantages over single-vehicle autonomy, traditional infrastructure-based V2X systems remain constrained by substantial deployment costs and the creation of "uncovered danger zones" in rural and suburban areas. We present AirV2X-Perception, a large-scale dataset that leverages Unmanned Aerial Vehicles (UAVs) as a flexible alternative or complement to fixed Road-Side Units (RSUs). Drones offer unique advantages over ground-based perception: complementary bird's-eye-views that reduce occlusions, dynamic positioning capabilities that enable hovering, patrolling, and escorting navigation rules, and significantly lower deployment costs compared to fixed infrastructure. Our dataset comprises 6.73 hours of drone-assisted driving scenarios across urban, suburban, and rural environments with varied weather and lighting conditions. The AirV2X-Perception dataset facilitates the development and standardized evaluation of Vehicle-to-Drone (V2D) algorithms, addressing a critical gap in the rapidly expanding field of aerial-assisted autonomous driving systems. The dataset and development kits are open-sourced at https://github.com/taco-group/AirV2X-Perception.

artificial intelligence, machine learning, natural language, (17 more...)

2506.19283

Country: North America > United States (0.46)

Genre: Research Report (1.00)

Industry:

Energy (1.00)
Transportation > Ground > Road (0.89)
Automobiles & Trucks (0.89)
Information Technology > Robotics & Automation (0.69)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
(3 more...)

Alwarafy, Abdulmalik, Ciftler, Bekir Sait, Abdallah, Mohamed, Hamdi, Mounir, Al-Dhahir, Naofal

Hierarchical Multi-Agent DRL-Based Framework for Joint Multi-RAT Assignment and Dynamic Resource Allocation in Next-Generation HetNets

This paper considers the problem of cost-aware downlink sum-rate maximization via joint optimal radio access technologies (RATs) assignment and power allocation in next-generation heterogeneous wireless networks (HetNets). We consider a future HetNet comprised of multi-RATs and serving multi-connectivity edge devices (EDs), and we formulate the problem as mixed-integer non-linear programming (MINP) problem. Due to the high complexity and combinatorial nature of this problem and the difficulty to solve it using conventional methods, we propose a hierarchical multi-agent deep reinforcement learning (DRL)-based framework, called DeepRAT, to solve it efficiently and learn system dynamics. In particular, the DeepRAT framework decomposes the problem into two main stages; the RATs-EDs assignment stage, which implements a single-agent Deep Q Network (DQN) algorithm, and the power allocation stage, which utilizes a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm. Using simulations, we demonstrate how the various DRL agents efficiently interact to learn system dynamics and derive the global optimal policy. Furthermore, our simulation results show that the proposed DeepRAT algorithm outperforms existing state-of-the-art heuristic approaches in terms of network utility. Finally, we quantitatively show the ability of the DeepRAT model to quickly and dynamically adapt to abrupt changes in network dynamics, such as EDs mobility.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

doi: 10.1109/TNSE.2022.3164648

2202.13652

Genre: Research Report > New Finding (0.66)

Industry:

Telecommunications (0.46)
Energy (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Chaubey, Shivam, Verdoja, Francesco, Deka, Shankar, Kyrki, Ville

MISC: Minimal Intervention Shared Control with Guaranteed Safety under Non-Convex Constraints

Shared control combines human intention with autonomous decision-making, from low-level safety overrides to high-level task guidance, enabling systems that adapt to users while ensuring safety and performance. This enhances task effectiveness and user experience across domains such as assistive robotics, teleoperation, and autonomous driving. However, existing shared control methods, based on e.g. Model Predictive Control, Control Barrier Functions, or learning-based control, struggle with feasibility, scalability, or safety guarantees, particularly since the user input is unpredictable. To address these challenges, we propose an assistive controller framework based on Constrained Optimal Control Problem that incorporates an offline-computed Control Invariant Set, enabling online computation of control actions that ensure feasibility, strict constraint satisfaction, and minimal override of user intent. Moreover, the framework can accommodate structured class of non-convex constraints, which are common in real-world scenarios. We validate the approach through a large-scale user study with 66 participants--one of the most extensive in shared control research--using a computer game environment to assess task load, trust, and perceived control, in addition to performance. The results show consistent improvements across all these aspects without compromising safety and user intent.

artificial intelligence, constraint, machine learning, (19 more...)

2507.02438

Country: North America > United States (0.68)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.54)

Industry:

Leisure & Entertainment > Games > Computer Games (0.86)
Energy (0.67)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Ingram, William A., Banerjee, Bipasha, Fox, Edward A.

When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search

Large language models (LLMs) are increasingly used to assign document relevance labels in information retrieval pipelines, especially in domains lacking human-labeled data. However, different models often disagree on borderline cases, raising concerns about how such disagreement affects downstream retrieval. This study examines labeling disagreement between two open-weight LLMs, LLaMA and Qwen, on a corpus of scholarly abstracts related to Sustainable Development Goals (SDGs) 1, 3, and 7. We isolate disagreement subsets and examine their lexical properties, rank-order behavior, and classification predictability. Our results show that model disagreement is systematic, not random: disagreement cases exhibit consistent lexical patterns, produce divergent top-ranked outputs under shared scoring functions, and are distinguishable with AUCs above 0.74 using simple classifiers. These findings suggest that LLM-based filtering introduces structured variability in document retrieval, even under controlled prompting and shared ranking logic. We propose using classification disagreement as an object of analysis in retrieval evaluation, particularly in policy-relevant or thematic search tasks.

large language model, machine learning, natural language, (17 more...)

2507.02139

Country: North America > United States > Virginia (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Energy > Energy Storage (0.68)
Electrical Industrial Apparatus (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

FOX NewsJul-3-2025, 23:33:04 GMT

Artificial intelligence fuels Big Tech partnerships with nuclear energy producers

Fox News chief political anchor Bret Baier delves into the demand for energy amid developing artificial intelligence technology on'Special Report.' There has been little change in U.S. energy consumption over the past decade. Increased efforts to make energy use more efficient have kept levels low. But over the next five years, demand for electricity to power data centers is expected to more than double. Some estimates show the facilities are expected to require as much energy in 2030 as the entire country of Japan does today.

electricity, intelligence fuel big tech partnership, partnership, (13 more...)

FOX News

Country:

Asia > Japan (0.25)
North America > United States > Pennsylvania (0.06)
North America > United States > Illinois (0.05)
(2 more...)

Industry: Energy > Power Industry > Utilities > Nuclear (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)