xie
Hustlers are cashing in on China's OpenClaw AI craze
Hustlers are cashing in on China's OpenClaw AI craze The AI tool has become the country's latest tech obsession. Feng Qingyang had always hoped to launch his own company, but he never thought this would be how--or that the day would come this fast. Feng, a 27-year-old software engineer based in Beijing, started tinkering with OpenClaw, a popular new open-source AI tool that can take over a device and autonomously complete tasks for a user, in January. He was immediately hooked, and before long he was helping other curious tech workers with less technical proficiency install the AI agent. Feng soon realized this could be a lucrative opportunity. By the end of January, he had set up a page on Xianyu, a secondhand shopping site, advertising "OpenClaw installation support."
- Asia > China > Beijing > Beijing (0.25)
- Asia > China > Guangdong Province > Shenzhen (0.06)
- North America > United States > Massachusetts (0.04)
- Asia > China > Zhejiang Province > Ningbo (0.04)
- Information Technology > Security & Privacy (0.69)
- Government (0.69)
- Information Technology > Services (0.48)
FT-MDT: Extracting Decision Trees from Medical Texts via a Novel Low-rank Adaptation Method
Li, Yuheng, Gao, Jiechao, Han, Wei, Ouyang, Wenwen, Zhu, Wei, Leong, Hui Yi
Knowledge of the medical decision process, which can be modeled as medical decision trees (MDTs), is critical to building clinical decision support systems. However, current MDT construction methods rely heavily on time-consuming and laborious manual annotation. To address this challenge, we propose PI-LoRA (Path-Integrated LoRA), a novel low-rank adaptation method for automatically extracting MDTs from clinical guidelines and textbooks. We integrate gradient path information to capture synergistic effects between different modules, enabling more effective and reliable rank allocation. This framework ensures that the most critical modules receive appropriate rank allocations while less important ones are pruned, resulting in a more efficient and accurate model for extracting medical decision trees from clinical texts. Extensive experiments on medical guideline datasets demonstrate that our PI-LoRA method significantly outperforms existing parameter-efficient fine-tuning approaches for the Text2MDT task, achieving better accuracy with substantially reduced model complexity. The proposed method achieves state-of-the-art results while maintaining a lightweight architecture, making it particularly suitable for clinical decision support systems where computational resources may be limited.
- Asia > Singapore (0.04)
- Asia > China > Hong Kong (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- (8 more...)
- Health & Medicine (1.00)
- Information Technology (0.67)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.97)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.83)
- Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.83)
Autonomous 3D Moving Target Encirclement and Interception with Range measurement
Liu, Fen, Yuan, Shenghai, Nguyen, Thien-Minh, Su, Rong
Commercial UAVs are an emerging security threat as they are capable of carrying hazardous payloads or disrupting air traffic. To counter UAVs, we introduce an autonomous 3D target encirclement and interception strategy. Unlike traditional ground-guided systems, this strategy employs autonomous drones to track and engage non-cooperative hostile UAVs, which is effective in non-line-of-sight conditions, GPS denial, and radar jamming, where conventional detection and neutralization from ground guidance fail. Using two noisy real-time distances measured by drones, guardian drones estimate the relative position from their own to the target using observation and velocity compensation methods, based on anti-synchronization (AS) and an X$-$Y circular motion combined with vertical jitter. An encirclement control mechanism is proposed to enable UAVs to adaptively transition from encircling and protecting a target to encircling and monitoring a hostile target. Upon breaching a warning threshold, the UAVs may even employ a suicide attack to neutralize the hostile target. We validate this strategy through real-world UAV experiments and simulated analysis in MATLAB, demonstrating its effectiveness in detecting, encircling, and intercepting hostile drones. More details: https://youtu.be/5eHW56lPVto.
- Information Technology (1.00)
- Government > Military (1.00)
Tire Wear Aware Trajectory Tracking Control for Multi-axle Swerve-drive Autonomous Mobile Robots
Hu, Tianxin, Xu, Xinhang, Nguyen, Thien-Minh, Liu, Fen, Yuan, Shenghai, Xie, Lihua
Multi-axle [1] Swerve-drive Autonomous Guided Vehicle (MS-AGV) is a type of heavy-duty vehicle equipped with multiple independently controlled steering wheels. This design provides MS-AGVs with a unique combination of high load capacity [2, 3] and exceptional maneuverability [4, 5], making them highly suitable for complex industrial environments [6, 7], such as automated warehouses and port logistics [8-12]. However, effectively controlling MS-AGVs presents several challenges. These include achieving accurate kino-dynamic modeling [13], ensuring precise trajectory tracking [14], and optimizing speed for operational efficiency [15, 16]. Recent works have explored prescribed performance control under uncertainties and faults, such as [17, 18], but they do not consider tire wear, which is critical in MS-AGV applications. Furthermore, practical concerns such as minimizing tire wear, which directly impacts maintenance costs, add complexity to the problem [19]. Despite significant advancements, no existing solution [20] comprehensively addresses these issues in an integrated manner, leaving a critical gap in MS-AGV planning and control strategies. Over the past several years, researchers have dedicated substantial effort to developing advanced control strategies to address the trajectory tracking problem in MS-AGV systems [21]. The core technical difficulty lies in managing the steering wheels, as the increased number of state variables [22] and the dynamic complexity [23] of the system make it challenging to predict and control [24] its behavior effectively.
Aerial Path Online Planning for Urban Scene Updation
Tang, Mingfeng, Wang, Ningna, Xie, Ziyuan, Hu, Jianwei, Xie, Ke, Guo, Xiaohu, Huang, Hui
We present the first scene-update aerial path planning algorithm specifically designed for detecting and updating change areas in urban environments. While existing methods for large-scale 3D urban scene reconstruction focus on achieving high accuracy and completeness, they are inefficient for scenarios requiring periodic updates, as they often re-explore and reconstruct entire scenes, wasting significant time and resources on unchanged areas. To address this limitation, our method leverages prior reconstructions and change probability statistics to guide UAVs in detecting and focusing on areas likely to have changed. Our approach introduces a novel changeability heuristic to evaluate the likelihood of changes, driving the planning of two flight paths: a prior path informed by static priors and a dynamic real-time path that adapts to newly detected changes. The framework integrates surface sampling and candidate view generation strategies, ensuring efficient coverage of change areas with minimal redundancy. Extensive experiments on real-world urban datasets demonstrate that our method significantly reduces flight time and computational overhead, while maintaining high-quality updates comparable to full-scene re-exploration and reconstruction. These contributions pave the way for efficient, scalable, and adaptive UAV-based scene updates in complex urban environments.
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.06)
- Asia > China > Guangdong Province > Shenzhen (0.04)
- North America > United States > Texas (0.04)
- (5 more...)
Deep spatio-temporal point processes: Advances and new directions
Cheng, Xiuyuan, Dong, Zheng, Xie, Yao
Spatio-temporal point processes (STPPs) model discrete events distributed in time and space, with important applications in areas such as criminology, seismology, epidemiology, and social networks. Traditional models often rely on parametric kernels, limiting their ability to capture heterogeneous, nonstationary dynamics. Recent innovations integrate deep neural architectures -- either by modeling the conditional intensity function directly or by learning flexible, data-driven influence kernels, substantially broadening their expressive power. This article reviews the development of the deep influence kernel approach, which enjoys statistical explainability, since the influence kernel remains in the model to capture the spatiotemporal propagation of event influence and its impact on future events, while also possessing strong expressive power, thereby benefiting from both worlds. We explain the main components in developing deep kernel point processes, leveraging tools such as functional basis decomposition and graph neural networks to encode complex spatial or network structures, as well as estimation using both likelihood-based and likelihood-free methods, and address computational scalability for large-scale data. We also discuss the theoretical foundation of kernel identifiability. Simulated and real-data examples highlight applications to crime analysis, earthquake aftershock prediction, and sepsis prediction modeling, and we conclude by discussing promising directions for the field.
- North America > United States > California (0.14)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- Asia > Middle East > Iraq (0.04)
- (2 more...)
- Overview (1.00)
- Research Report > New Finding (0.46)
NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model
Lai, Yuzhi, Yuan, Shenghai, Nassar, Youssef, Fan, Mingyu, Weber, Thomas, Rätsch, Matthias
Effective Human-Robot Interaction (HRI) is crucial for future service robots in aging societies. Existing solutions are biased toward only well-trained objects, creating a gap when dealing with new objects. Currently, HRI systems using predefined gestures or language tokens for pretrained objects pose challenges for all individuals, especially elderly ones. These challenges include difficulties in recalling commands, memorizing hand gestures, and learning new names. This paper introduces NVP-HRI, an intuitive multi-modal HRI paradigm that combines voice commands and deictic posture. NVP-HRI utilizes the Segment Anything Model (SAM) to analyze visual cues and depth data, enabling precise structural object representation. Through a pre-trained SAM network, NVP-HRI allows interaction with new objects via zero-shot prediction, even without prior knowledge. NVP-HRI also integrates with a large language model (LLM) for multimodal commands, coordinating them with object selection and scene distribution in real time for collision-free trajectory solutions. We also regulate the action sequence with the essential control syntax to reduce LLM hallucination risks. The evaluation of diverse real-world tasks using a Universal Robot showcased up to 59.2\% efficiency improvement over traditional gesture control, as illustrated in the video https://youtu.be/EbC7al2wiAc. Our code and design will be openly available at https://github.com/laiyuzhi/NVP-HRI.git.
- Europe > Switzerland > Zürich > Zürich (0.14)
- Asia > Singapore (0.04)
- North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
- (4 more...)
- Questionnaire & Opinion Survey (1.00)
- Research Report > New Finding (0.67)
- Research Report > Experimental Study (0.46)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine (1.00)
QLIO: Quantized LiDAR-Inertial Odometry
Lou, Boyang, Yuan, Shenghai, Yang, Jianfei, Su, Wenju, Zhang, Yingjian, Hu, Enwen
LiDAR-Inertial Odometry (LIO) is widely used for autonomous navigation, but its deployment on Size, Weight, and Power (SWaP)-constrained platforms remains challenging due to the computational cost of processing dense point clouds. Conventional LIO frameworks rely on a single onboard processor, leading to computational bottlenecks and high memory demands, making real-time execution difficult on embedded systems. To address this, we propose QLIO, a multi-processor distributed quantized LIO framework that reduces computational load and bandwidth consumption while maintaining localization accuracy. QLIO introduces a quantized state estimation pipeline, where a co-processor pre-processes LiDAR measurements, compressing point-to-plane residuals before transmitting only essential features to the host processor. Additionally, an rQ-vector-based adaptive resampling strategy intelligently selects and compresses key observations, further reducing computational redundancy. Real-world evaluations demonstrate that QLIO achieves a 14.1% reduction in per-observation residual data while preserving localization accuracy. Furthermore, we release an open-source implementation to facilitate further research and real-world deployment. These results establish QLIO as an efficient and scalable solution for real-time autonomous systems operating under computational and bandwidth constraints.
Handle Object Navigation as Weighted Traveling Repairman Problem
Liu, Ruimeng, Xu, Xinhang, Yuan, Shenghai, Xie, Lihua
Zero-Shot Object Navigation (ZSON) requires agents to navigate to objects specified via open-ended natural language without predefined categories or prior environmental knowledge. While recent methods leverage foundation models or multi-modal maps, they often rely on 2D representations and greedy strategies or require additional training or modules with high computation load, limiting performance in complex environments and real applications. We propose WTRP-Searcher, a novel framework that formulates ZSON as a Weighted Traveling Repairman Problem (WTRP), minimizing the weighted waiting time of viewpoints. Using a Vision-Language Model (VLM), we score viewpoints based on object-description similarity, projected onto a 2D map with depth information. An open-vocabulary detector identifies targets, dynamically updating goals, while a 3D embedding feature map enhances spatial awareness and environmental recall. WTRP-Searcher outperforms existing methods, offering efficient global planning and improved performance in complex ZSON tasks. Code and more demos will be avaliable on https://github.com/lrm20011/WTRP_Searcher.
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.52)