electrical and computer engineering
Is a secure AI assistant possible?
AI agents are a risky business. Even when stuck inside the chatbox window, LLMs will make mistakes and behave badly. Once they have tools that they can use to interact with the outside world, such as web browsers and email addresses, the consequences of those mistakes become far more serious. That might explain why the first breakthrough LLM personal assistant came not from one of the major AI labs, which have to worry about reputation and liability, but from an independent software engineer, Peter Steinberger. In November of 2025, Steinberger uploaded his tool, now called OpenClaw, to GitHub, and in late January the project went viral.
- Asia > China (0.15)
- North America > Canada > Ontario > Toronto (0.14)
- North America > United States > Massachusetts (0.04)
- North America > United States > California > San Francisco County > San Francisco (0.04)
Towards Heterogeneous Quantum Federated Learning: Challenges and Solutions
Rahman, Ratun, Nguyen, Dinh C., Thomas, Christo Kurisummoottil, Saad, Walid
Quantum federated learning (QFL) combines quantum computing and federated learning to enable decentralized model training while maintaining data privacy. QFL can improve computational efficiency and scalability by taking advantage of quantum properties such as superposition and entanglement. However, existing QFL frameworks largely focus on homogeneity among quantum \textcolor{black}{clients, and they do not account} for real-world variances in quantum data distributions, encoding techniques, hardware noise levels, and computational capacity. These differences can create instability during training, slow convergence, and reduce overall model performance. In this paper, we conduct an in-depth examination of heterogeneity in QFL, classifying it into two categories: data or system heterogeneity. Then we investigate the influence of heterogeneity on training convergence and model aggregation. We critically evaluate existing mitigation solutions, highlight their limitations, and give a case study that demonstrates the viability of tackling quantum heterogeneity. Finally, we discuss potential future research areas for constructing robust and scalable heterogeneous QFL frameworks.
- North America > United States > Virginia (0.05)
- Asia > India > Karnataka > Bengaluru (0.04)
- Oceania > Australia (0.04)
- (6 more...)
- North America > United States > Texas > Travis County > Austin (0.06)
- North America > Canada (0.06)
NaturalVoices: A Large-Scale, Spontaneous and Emotional Podcast Dataset for Voice Conversion
Du, Zongyang, Chandra, Shreeram Suresh, Ulgen, Ismail Rasim, Mahapatra, Aurosweta, Salman, Ali N., Busso, Carlos, Sisman, Berrak
Everyday speech conveys far more than words, it reflects who we are, how we feel, and the circumstances surrounding our interactions. Yet, most existing speech datasets are acted, limited in scale, and fail to capture the expressive richness of real-life communication. With the rise of large neural networks, several large-scale speech corpora have emerged and been widely adopted across various speech processing tasks. However, the field of voice conversion (VC) still lacks large-scale, expressive, and real-life speech resources suitable for modeling natural prosody and emotion. To fill this gap, we release NaturalVoices (NV), the first large-scale spontaneous podcast dataset specifically designed for emotion-aware voice conversion. It comprises 5,049 hours of spontaneous podcast recordings with automatic annotations for emotion (categorical and attribute-based), speech quality, transcripts, speaker identity, and sound events. The dataset captures expressive emotional variation across thousands of speakers, diverse topics, and natural speaking styles. We also provide an open-source pipeline with modular annotation tools and flexible filtering, enabling researchers to construct customized subsets for a wide range of VC tasks. Experiments demonstrate that NaturalVoices supports the development of robust and generalizable VC models capable of producing natural, expressive speech, while revealing limitations of current architectures when applied to large-scale spontaneous data. These results suggest that NaturalVoices is both a valuable resource and a challenging benchmark for advancing the field of voice conversion. Dataset is available at: https://huggingface.co/JHU-SmileLab
- North America > United States > Maryland > Baltimore (0.04)
- Asia > Singapore (0.04)
- South America > Colombia > Meta Department > Villavicencio (0.04)
- (15 more...)
- Education (0.93)
- Information Technology > Security & Privacy (0.46)
- North America > United States > Texas > Travis County > Austin (0.06)
- North America > Canada (0.06)
Learning to Construct Knowledge through Sparse Reference Selection with Reinforcement Learning
The rapid expansion of scientific literature makes it increasingly difficult to acquire new knowledge, particularly in specialized domains where reasoning is complex, full-text access is restricted, and target references are sparse among a large set of candidates. We present a Deep Reinforcement Learning framework for sparse reference selection that emulates human knowledge construction, prioritizing which papers to read under limited time and cost. Evaluated on drug--gene relation discovery with access restricted to titles and abstracts, our approach demonstrates that both humans and machines can construct knowledge effectively from partial information.
- North America > United States > Minnesota (0.40)
- North America > United States > Florida > Broward County > Fort Lauderdale (0.04)
- North America > United States > California > Los Angeles County > Pasadena (0.04)
- (3 more...)
- Health & Medicine > Pharmaceuticals & Biotechnology (0.47)
- Leisure & Entertainment > Sports > Baseball (0.40)
Efficient Training of Large-Scale AI Models Through Federated Mixture-of-Experts: A System-Level Approach
Chen, Xiaobing, Zhang, Boyang, Zhou, Xiangwei, Sun, Mingxuan, Zhang, Shuai, Zhang, Songyang, Li, Geoffrey Ye
The integration of Federated Learning (FL) and Mixture-of-Experts (MoE) presents a compelling pathway for training more powerful, large-scale artificial intelligence models (LAMs) on decentralized data while preserving privacy. However, efficient federated training of these complex MoE-structured LAMs is hindered by significant system-level challenges, particularly in managing the interplay between heterogeneous client resources and the sophisticated coordination required for numerous specialized experts. This article highlights a critical, yet underexplored concept: the absence of robust quantitative strategies for dynamic client-expert alignment that holistically considers varying client capacities and the imperative for system-wise load balancing. Specifically, we propose a conceptual system design for intelligent client-expert alignment that incorporates dynamic fitness scoring, global expert load monitoring, and client capacity profiling. By tackling these systemic issues, we can unlock more scalable, efficient, and robust training mechanisms {with fewer communication rounds for convergence}, paving the way for the widespread deployment of large-scale federated MoE-structured LAMs in edge computing with ultra-high communication efficiency.
- North America > United States > Louisiana > East Baton Rouge Parish > Baton Rouge (0.15)
- North America > United States > New Jersey > Essex County > Newark (0.04)
- North America > United States > Louisiana > Lafayette Parish > Lafayette (0.04)
- (8 more...)
- Education (0.68)
- Information Technology > Security & Privacy (0.47)
Look to Locate: Vision-Based Multisensory Navigation with 3-D Digital Maps for GNSS-Challenged Environments
Elmaghraby, Ola, Mounier, Eslam, de Araujo, Paulo Ricardo Marques, Noureldin, Aboelmagd
--In Global Navigation Satellite System (GNSS)- denied environments such as indoor parking structures or dense urban canyons, achieving accurate and robust vehicle positioning remains a significant challenge. This paper proposes a cost-effective, vision-based multi-sensor navigation system that integrates monocular depth estimation, semantic filtering, and visual map registration (VMR) with 3-D digital maps. Extensive testing in real-world indoor and outdoor driving scenarios demonstrates the effectiveness of the proposed system, achieving sub-meter accuracy 92% indoors and more than 80% outdoors, with consistent horizontal positioning and heading average root mean-square errors of approximately 0.98 m and 1.25 Compared to the baselines examined, the proposed solution significantly reduced drift and improved robustness under various conditions, achieving positioning accuracy improvements of approximately 88% on average. This work highlights the potential of cost-effective monocular vision systems combined with 3D maps for scalable, GNSS-independent navigation in land vehicles. OSITIONING is a cornerstone of autonomous driving, enabling vehicles to plan, control, and make decisions [1]. While global navigation satellite system (GNSS) technologies provide high accuracy positioning capabilities in open-sky environments [2], they become unreliable or even denied in environments such as dense urban areas, tunnels, and underground parking [3]. To compensate for GNSS limitations, some approaches employ high-resolution light detection and ranging (LiDAR)-based positioning systems [4] or integrate high-grade inertial navigation system (INS) [5]. Although these solutions can provide accurate and reliable positioning, their high cost hinders their practicality for consumer-level deployment. In contrast, cameras offer a cost-effective, lightweight, and widely available sensing modality. This research is supported by grants from the Natural Sciences and Engineering Research Council of Canada (NSERC) under grant numbers: RGPIN-2020-03900 and ALLRP-560898-20. Ola Elmaghraby and Paulo de Araujo are with the Department of Electrical and Computer Engineering, Queen's University, Kingston, ON K7L 3N6, Canada (e-mail: ola.elmaghraby.a@queensu.ca;
- North America > Canada > Ontario > Kingston (0.48)
- Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.04)
- North America > United States > Colorado > Denver County > Denver (0.04)
- (2 more...)
Dynamic Temporal Positional Encodings for Early Intrusion Detection in IoT
Panopoulos, Ioannis, Bartsioka, Maria-Lamprini A., Nikolaidis, Sokratis, Venieris, Stylianos I., Kaklamani, Dimitra I., Venieris, Iakovos S.
--The rapid expansion of the Internet of Things (IoT) has introduced significant security challenges, necessitating efficient and adaptive Intrusion Detection Systems (IDS). Traditional IDS models often overlook the temporal characteristics of network traffic, limiting their effectiveness in early threat detection. We propose a Transformer-based Early Intrusion Detection System (EIDS) that incorporates dynamic temporal positional encodings to enhance detection accuracy while maintaining computational efficiency. Additionally, we introduce a data augmentation pipeline to improve model robustness. Evaluated on the CICIoT2023 dataset, our method outperforms existing models in both accuracy and earliness. We further demonstrate its real-time feasibility on resource-constrained IoT devices, achieving low-latency inference and minimal memory footprint. The Internet of Things (IoT) enables smart devices to exchange data in real time across various domains, such as smart homes, healthcare, and industrial automation. These systems integrate the physical and digital worlds, generating diverse data types while often operating autonomously.
- Europe > Greece > Attica > Athens (0.05)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis
Chen, Liangliang, Xie, Huiru, Rohde, Jacqueline, Zhang, Ying
This research-to-practice work-in-progress (WIP) paper presents an AI-enabled smart tutor designed to provide homework assessment and feedback for students in an undergraduate circuit analysis course. We detail the tutor's design philosophy and core components, including open-ended question answering and homework feedback generation. The prompts are carefully crafted to optimize responses across different problems. The smart tutor was deployed on the Microsoft Azure platform and is currently in use in an undergraduate circuit analysis course at the School of Electrical and Computer Engineering in a large, public, research-intensive institution in the Southeastern United States. Beyond offering personalized instruction and feedback, the tutor collects student interaction data, which is summarized and shared with the course instructor. To evaluate its effectiveness, we collected student feedback, with 90.9% of responses indicating satisfaction with the tutor. Additionally, we analyze a subset of collected data on preliminary circuit analysis topics to assess tutor usage frequency for each problem and identify frequently asked questions. These insights help instructors gain real-time awareness of student difficulties, enabling more targeted classroom instruction. In future work, we will release a full analysis once the complete dataset is available after the Spring 2025 semester. We also explore the potential applications of this smart tutor across a broader range of engineering disciplines by developing improved prompts, diagram-recognition methods, and database management strategies, which remain ongoing areas of research.
- North America > United States > Oregon > Multnomah County > Portland (0.04)
- North America > United States > Oklahoma > Oklahoma County > Oklahoma City (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- (4 more...)
- Education > Educational Setting (0.67)
- Education > Educational Technology (0.47)