AITopics | electrical and computer engineering

Collaborating Authors

electrical and computer engineering

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Is a secure AI assistant possible?

MIT Technology ReviewFeb-11-2026, 20:08:35 GMT

AI agents are a risky business. Even when stuck inside the chatbox window, LLMs will make mistakes and behave badly. Once they have tools that they can use to interact with the outside world, such as web browsers and email addresses, the consequences of those mistakes become far more serious. That might explain why the first breakthrough LLM personal assistant came not from one of the major AI labs, which have to worry about reputation and liability, but from an independent software engineer, Peter Steinberger. In November of 2025, Steinberger uploaded his tool, now called OpenClaw, to GitHub, and in late January the project went viral.

artificial intelligence, large language model, natural language, (20 more...)

MIT Technology Review

Country:

Asia > China (0.15)
North America > United States (0.14)
North America > Canada > Ontario > Toronto (0.14)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.90)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.83)

Add feedback

Towards Heterogeneous Quantum Federated Learning: Challenges and Solutions

Rahman, Ratun, Nguyen, Dinh C., Thomas, Christo Kurisummoottil, Saad, Walid

arXiv.org Artificial IntelligenceDec-5-2025

Quantum federated learning (QFL) combines quantum computing and federated learning to enable decentralized model training while maintaining data privacy. QFL can improve computational efficiency and scalability by taking advantage of quantum properties such as superposition and entanglement. However, existing QFL frameworks largely focus on homogeneity among quantum \textcolor{black}{clients, and they do not account} for real-world variances in quantum data distributions, encoding techniques, hardware noise levels, and computational capacity. These differences can create instability during training, slow convergence, and reduce overall model performance. In this paper, we conduct an in-depth examination of heterogeneity in QFL, classifying it into two categories: data or system heterogeneity. Then we investigate the influence of heterogeneity on training convergence and model aggregation. We critically evaluate existing mitigation solutions, highlight their limitations, and give a case study that demonstrates the viability of tackling quantum heterogeneity. Finally, we discuss potential future research areas for constructing robust and scalable heterogeneous QFL frameworks.

artificial intelligence, heterogeneity, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2511.22148

Country:

North America > United States (0.69)
Europe (0.68)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

NaturalVoices: A Large-Scale, Spontaneous and Emotional Podcast Dataset for Voice Conversion

Du, Zongyang, Chandra, Shreeram Suresh, Ulgen, Ismail Rasim, Mahapatra, Aurosweta, Salman, Ali N., Busso, Carlos, Sisman, Berrak

arXiv.org Artificial IntelligenceNov-4-2025

Everyday speech conveys far more than words, it reflects who we are, how we feel, and the circumstances surrounding our interactions. Yet, most existing speech datasets are acted, limited in scale, and fail to capture the expressive richness of real-life communication. With the rise of large neural networks, several large-scale speech corpora have emerged and been widely adopted across various speech processing tasks. However, the field of voice conversion (VC) still lacks large-scale, expressive, and real-life speech resources suitable for modeling natural prosody and emotion. To fill this gap, we release NaturalVoices (NV), the first large-scale spontaneous podcast dataset specifically designed for emotion-aware voice conversion. It comprises 5,049 hours of spontaneous podcast recordings with automatic annotations for emotion (categorical and attribute-based), speech quality, transcripts, speaker identity, and sound events. The dataset captures expressive emotional variation across thousands of speakers, diverse topics, and natural speaking styles. We also provide an open-source pipeline with modular annotation tools and flexible filtering, enabling researchers to construct customized subsets for a wide range of VC tasks. Experiments demonstrate that NaturalVoices supports the development of robust and generalizable VC models capable of producing natural, expressive speech, while revealing limitations of current architectures when applied to large-scale spontaneous data. These results suggest that NaturalVoices is both a valuable resource and a challenging benchmark for advancing the field of voice conversion. Dataset is available at: https://huggingface.co/JHU-SmileLab

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2511.00256

Country:

North America > United States > Texas (0.28)
North America > United States > California (0.28)
Asia > Middle East > Republic of Türkiye (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Education (0.93)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Supplementary Material of ShiftAddNet

Neural Information Processing SystemsOct-2-2025, 09:02:32 GMT

Accuracy and training costs tradeoffs. Addernet: Do we really need multiplications in deep learning?

artificial intelligence, machine learning, shiftaddnet, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.17)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)

Add feedback

Efficient Training of Large-Scale AI Models Through Federated Mixture-of-Experts: A System-Level Approach

Chen, Xiaobing, Zhang, Boyang, Zhou, Xiangwei, Sun, Mingxuan, Zhang, Shuai, Zhang, Songyang, Li, Geoffrey Ye

arXiv.org Artificial IntelligenceJul-9-2025

The integration of Federated Learning (FL) and Mixture-of-Experts (MoE) presents a compelling pathway for training more powerful, large-scale artificial intelligence models (LAMs) on decentralized data while preserving privacy. However, efficient federated training of these complex MoE-structured LAMs is hindered by significant system-level challenges, particularly in managing the interplay between heterogeneous client resources and the sophisticated coordination required for numerous specialized experts. This article highlights a critical, yet underexplored concept: the absence of robust quantitative strategies for dynamic client-expert alignment that holistically considers varying client capacities and the imperative for system-wise load balancing. Specifically, we propose a conceptual system design for intelligent client-expert alignment that incorporates dynamic fitness scoring, global expert load monitoring, and client capacity profiling. By tackling these systemic issues, we can unlock more scalable, efficient, and robust training mechanisms {with fewer communication rounds for convergence}, paving the way for the widespread deployment of large-scale federated MoE-structured LAMs in edge computing with ultra-high communication efficiency.

artificial intelligence, machine learning, moe-structured lam, (13 more...)

arXiv.org Artificial Intelligence

2507.05685

Country: North America > United States > Louisiana > East Baton Rouge Parish > Baton Rouge (0.15)

Genre: Research Report (0.64)

Industry:

Education (0.68)
Information Technology > Security & Privacy (0.47)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.91)

Add feedback

Look to Locate: Vision-Based Multisensory Navigation with 3-D Digital Maps for GNSS-Challenged Environments

Elmaghraby, Ola, Mounier, Eslam, de Araujo, Paulo Ricardo Marques, Noureldin, Aboelmagd

arXiv.org Artificial IntelligenceJun-25-2025

--In Global Navigation Satellite System (GNSS)- denied environments such as indoor parking structures or dense urban canyons, achieving accurate and robust vehicle positioning remains a significant challenge. This paper proposes a cost-effective, vision-based multi-sensor navigation system that integrates monocular depth estimation, semantic filtering, and visual map registration (VMR) with 3-D digital maps. Extensive testing in real-world indoor and outdoor driving scenarios demonstrates the effectiveness of the proposed system, achieving sub-meter accuracy 92% indoors and more than 80% outdoors, with consistent horizontal positioning and heading average root mean-square errors of approximately 0.98 m and 1.25 Compared to the baselines examined, the proposed solution significantly reduced drift and improved robustness under various conditions, achieving positioning accuracy improvements of approximately 88% on average. This work highlights the potential of cost-effective monocular vision systems combined with 3D maps for scalable, GNSS-independent navigation in land vehicles. OSITIONING is a cornerstone of autonomous driving, enabling vehicles to plan, control, and make decisions [1]. While global navigation satellite system (GNSS) technologies provide high accuracy positioning capabilities in open-sky environments [2], they become unreliable or even denied in environments such as dense urban areas, tunnels, and underground parking [3]. To compensate for GNSS limitations, some approaches employ high-resolution light detection and ranging (LiDAR)-based positioning systems [4] or integrate high-grade inertial navigation system (INS) [5]. Although these solutions can provide accurate and reliable positioning, their high cost hinders their practicality for consumer-level deployment. In contrast, cameras offer a cost-effective, lightweight, and widely available sensing modality. This research is supported by grants from the Natural Sciences and Engineering Research Council of Canada (NSERC) under grant numbers: RGPIN-2020-03900 and ALLRP-560898-20. Ola Elmaghraby and Paulo de Araujo are with the Department of Electrical and Computer Engineering, Queen's University, Kingston, ON K7L 3N6, Canada (e-mail: ola.elmaghraby.a@queensu.ca;

artificial intelligence, machine learning, point cloud, (15 more...)

arXiv.org Artificial Intelligence

2506.19827

Country: North America > Canada > Ontario > Kingston (0.48)

Genre: Research Report (0.40)

Industry: Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
(2 more...)

Add feedback

Dynamic Temporal Positional Encodings for Early Intrusion Detection in IoT

Panopoulos, Ioannis, Bartsioka, Maria-Lamprini A., Nikolaidis, Sokratis, Venieris, Stylianos I., Kaklamani, Dimitra I., Venieris, Iakovos S.

arXiv.org Artificial IntelligenceJun-24-2025

--The rapid expansion of the Internet of Things (IoT) has introduced significant security challenges, necessitating efficient and adaptive Intrusion Detection Systems (IDS). Traditional IDS models often overlook the temporal characteristics of network traffic, limiting their effectiveness in early threat detection. We propose a Transformer-based Early Intrusion Detection System (EIDS) that incorporates dynamic temporal positional encodings to enhance detection accuracy while maintaining computational efficiency. Additionally, we introduce a data augmentation pipeline to improve model robustness. Evaluated on the CICIoT2023 dataset, our method outperforms existing models in both accuracy and earliness. We further demonstrate its real-time feasibility on resource-constrained IoT devices, achieving low-latency inference and minimal memory footprint. The Internet of Things (IoT) enables smart devices to exchange data in real time across various domains, such as smart homes, healthcare, and industrial automation. These systems integrate the physical and digital worlds, generating diverse data types while often operating autonomously.

machine learning, natural language, real time system, (17 more...)

arXiv.org Artificial Intelligence

2506.18114

Country: Europe > Greece (0.15)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Internet of Things (1.00)
Information Technology > Communications > Networks (1.00)
(3 more...)

Add feedback

WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis

Chen, Liangliang, Xie, Huiru, Rohde, Jacqueline, Zhang, Ying

arXiv.org Artificial IntelligenceJun-11-2025

This research-to-practice work-in-progress (WIP) paper presents an AI-enabled smart tutor designed to provide homework assessment and feedback for students in an undergraduate circuit analysis course. We detail the tutor's design philosophy and core components, including open-ended question answering and homework feedback generation. The prompts are carefully crafted to optimize responses across different problems. The smart tutor was deployed on the Microsoft Azure platform and is currently in use in an undergraduate circuit analysis course at the School of Electrical and Computer Engineering in a large, public, research-intensive institution in the Southeastern United States. Beyond offering personalized instruction and feedback, the tutor collects student interaction data, which is summarized and shared with the course instructor. To evaluate its effectiveness, we collected student feedback, with 90.9% of responses indicating satisfaction with the tutor. Additionally, we analyze a subset of collected data on preliminary circuit analysis topics to assess tutor usage frequency for each problem and identify frequently asked questions. These insights help instructors gain real-time awareness of student difficulties, enabling more targeted classroom instruction. In future work, we will release a full analysis once the complete dataset is available after the Spring 2025 semester. We also explore the potential applications of this smart tutor across a broader range of engineering disciplines by developing improved prompts, diagram-recognition methods, and database management strategies, which remain ongoing areas of research.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2506.08962

Country: North America > United States > Oklahoma (0.14)

Genre: Instructional Material > Course Syllabus & Notes (0.67)

Industry:

Education > Educational Setting (0.67)
Education > Educational Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution

Bang, Junseo, Lee, Joonhee, Lee, Kyeonghyun, Lee, Haechang, Kang, Dong Un, Chun, Se Young

arXiv.org Artificial IntelligenceJun-10-2025

Arbitrary-scale image super-resolution aims to upsample images to any desired resolution, offering greater flexibility than traditional fixed-scale super-resolution. Recent approaches in this domain utilize regression-based or generative models, but many of them are a single-stage upsampling process, which may be challenging to learn across a wide, continuous distribution of scaling factors. Progressive upsampling strategies have shown promise in mitigating this issue, yet their integration with diffusion models for flexible upscaling remains underexplored. Here, we present CasArbi, a novel self-cascaded diffusion framework for arbitrary-scale image super-resolution. CasArbi meets the varying scaling demands by breaking them down into smaller sequential factors and progressively enhancing the image resolution at each step with seamless transitions for arbitrary scales. Our novel coordinate-guided residual diffusion model allows for the learning of continuous image representations while enabling efficient diffusion sampling. Extensive experiments demonstrate that our CasArbi outperforms prior arts in both perceptual and distortion performance metrics across diverse arbitrary-scale super-resolution benchmarks.

artificial intelligence, diffusion model, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2506.07813

Country:

North America > United States (0.46)
Asia > South Korea (0.31)

Genre: Research Report (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Machine learning powers new approach to detecting soil contaminants

AIHubJun-6-2025, 13:51:20 GMT

A team of researchers at Rice University and Baylor College of Medicine has developed a new strategy for identifying hazardous pollutants in soil, even ones that have never been isolated or studied in a lab. The new approach, described in a study published in Proceedings of the National Academy of Sciences, uses light-based imaging, theoretical predictions of compounds' light signatures and machine learning (ML) algorithms to detect toxic compounds like polycyclic aromatic hydrocarbons (PAHs) and their derivative compounds (PACs) in soil. A common by-product of combustion, PAHs and PACs have been linked to cancer, developmental issues and other serious health problems. Identifying pollutants in soil usually requires advanced laboratories and standard physical reference samples of the suspected contaminants. However, for many environmental pollutants that pose a public health risk, there is no experimental data available that can be used to detect them.

artificial intelligence, compound, machine learning, (15 more...)

AIHub

Country: North America > United States (0.36)

Genre: Research Report > New Finding (0.94)

Industry:

Energy > Oil & Gas (0.91)
Health & Medicine > Consumer Health (0.56)
Government > Regional Government > North America Government > United States Government (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.62)

Add feedback