AITopics | voice command

Collaborating Authors

voice command

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Meet the AI-powered robotic dog ready to help with emergency response

RobohubJan-7-2026, 11:34:43 GMT

Developed by Texas A&M University engineering students, this AI-powered robotic dog doesn't just follow commands. Designed to navigate chaos with precision, the robot could help revolutionize search-and-rescue missions, disaster response and many other emergency operations. Sandun Vitharana, an engineering technology master's student, and Sanjaya Mallikarachchi, an interdisciplinary engineering doctoral student, spearheaded the invention of the robotic dog. It can process voice commands and uses AI and camera input to perform path planning and identify objects. A roboticist would describe it as a terrestrial robot that uses a memory-driven navigation system powered by a multimodal large language model (MLLM).

artificial intelligence, robot, search-and-rescue mission, (14 more...)

Robohub

Country:

North America > United States > Texas (0.28)
North America > United States > Ohio (0.05)
Europe > Switzerland > Zürich > Zürich (0.05)
Asia > Kazakhstan (0.05)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

These appliances don't depend on smart speakers for voice control

PCWorldJan-7-2026, 00:25:13 GMT

When you purchase through links in our articles, we may earn a small commission. These appliances don't depend on smart speakers for voice control Emerson Smart's new appliances respond to voice commands, but they don't need a smart speaker--or even a broadband connection--to pull off the trick. Smart appliances that can be controlled with voice commands are nothing new, but IAI Smart is showing a new line of Emerson Smart appliances at CES that respond to voice commands. They don't need a smart speaker in the middle, and they don't rely on a broadband connection, an app, or anything other infrastructure--everything is processed locally. If you're leery of the privacy and security vulnerabilities of IoT devices, this could be the answer.

artificial intelligence, gaming laptop mobile monitor pc, speech recognition, (10 more...)

PCWorld

Country: North America > United States > California (0.05)

Industry:

Information Technology > Security & Privacy (1.00)
Appliances & Durable Goods (1.00)
Information Technology > Smart Houses & Appliances (0.86)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Internet of Things (0.99)
Information Technology > Communications > Networks (0.97)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.61)

Add feedback

I Ditched Alexa and Upgraded My Smart Home

WIREDNov-16-2025, 11:00:00 GMT

Here's how I cut down my family's reliance on Alexa. Until recently, my smart home setup was in chaos. After years of testing, buying, and upgrading to the latest smart home gadgets in an attempt to make my life easier, it became a bloated mess that was actually making it more complicated. My Alexa, Google Home, and Apple Home apps were awash with dead devices, duplicates, and automations that simply didn't work. My Hue Bridge, trying desperately to tie it all together, was creaking at the seams.

artificial intelligence, chatbot, natural language, (16 more...)

WIRED

Country:

Asia > Nepal (0.14)
North America > United States > California (0.04)
Europe > Slovakia (0.04)
Europe > Czechia (0.04)

Industry: Information Technology > Smart Houses & Appliances (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.73)

Add feedback

Multimodal Deep Learning for ATCO Command Lifecycle Modeling and Workload Prediction

Tan, Kaizhen

arXiv.org Artificial IntelligenceNov-13-2025

-- Air traffic controllers (ATCOs) issue high - intensity voice commands in dense airspace, where accurate workload modeling is critical for safety and efficiency. This paper proposes a multimodal deep learning framework that integrates structured data, trajectory sequences, and image features to estimate two key parameters in the ATCO command lifecycle: the time offset between a command and the resulting aircraft maneuver, and the command duration. A hi gh - quality dataset was constructed, with maneuver points detected using sliding window and histogram - based methods. A CNN - Transformer ensemble model was developed for accurate, generalizable, and interpretable predictions. By linking trajectories to voice commands, this work offers the first model of its kind to support intelligent command generation and provides practical value for workload assessment, staffing, and scheduling. A. Background As global air traffic demand increases, airspace operations have become more complex and congested, presenting major challenges for air traffic control (ATC) systems. Although surveillance and communication technologies have improved, ATC performance still largely depends on human operators, particularly air traffic controllers (ATCOs), who monitor flights, assess conditions, and issue maneuver instructions to ensure safe and efficient operations. This human bottleneck has become a key constraint on ATC efficiency and safety, emphasizing the importance of quantifying task intensity and evaluating workload to support fatigue management, staff scheduling, and the development of in telligent ATC solutions . Early studies on ATCO workload modeling primarily focused on statistical methods and subjective assessments such as NASA Task Load Index (NASA - TLX) [1] .

artificial intelligence, machine learning, prediction, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3772673.3772702

2509.10522

Country:

South America > Brazil (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Air (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

These eye-popping smart lights boast built-in AI microphones

PCWorldSep-4-2025, 15:29:03 GMT

Smart lights that react to voice commands spoken to smart speakers are old hat, but a smart light with a built-in AI microphone? Showing off its wares at the IFA trade show in Berlin this week, Germany-based smart device manufacturer Lepro is teeing up a quartet of "AI Lighting Pro" lights that can set the mood based on your natural-language prompts--anything from "Give me an Iron Man vibe" to "Set a cyberpunk city theme." Each of the lights features a built-in microphone that captures your commands (you must say the "Hey Lepro" wake phrase first) and processes them using Lepro's new LightGPM AI engine, a large language model that's trained on "color psychology and lighting design," Lepro says. The AI then delivers an "ideal" multi-color lighting scene based on your voice prompt. We've seen plenty of smart lights with AI-powered light scene bots before; Philips Hue is integrating one into the Hue app, and Govee and Nanoleaf have their own versions.

artificial intelligence, lepro, natural language, (10 more...)

PCWorld

Country:

Europe > Germany (0.26)
North America (0.06)

Industry: Information Technology > Smart Houses & Appliances (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Cog-TiPRO: Iterative Prompt Refinement with LLMs to Detect Cognitive Decline via Longitudinal Voice Assistant Commands

Qi, Kristin, Zhu, Youxiang, Summerour, Caroline, Batsis, John A., Liang, Xiaohui

arXiv.org Artificial IntelligenceSep-4-2025

Early detection of cognitive decline is crucial for enabling interventions that can slow neurodegenerative disease progression. Traditional diagnostic approaches rely on labor-intensive clinical assessments, which are impractical for frequent monitoring. Our pilot study investigates voice assistant systems (VAS) as non-invasive tools for detecting cognitive decline through longitudinal analysis of speech patterns in voice commands. Over an 18-month period, we collected voice commands from 35 older adults, with 15 participants providing daily at-home VAS interactions. To address the challenges of analyzing these short, unstructured and noisy commands, we propose Cog-TiPRO, a framework that combines (1) LLM-driven iterative prompt refinement for linguistic feature extraction, (2) HuBERT-based acoustic feature extraction, and (3) transformer-based temporal modeling. Using iTransformer, our approach achieves 73.80% accuracy and 72.67% F1-score in detecting MCI, outperforming its baseline by 27.13%. Through our LLM approach, we identify linguistic features that uniquely characterize everyday command usage patterns in individuals experiencing cognitive decline.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.17137

Country:

North America > United States > North Carolina > Orange County > Chapel Hill (0.14)
North America > United States > Massachusetts > Suffolk County > Boston (0.14)
North America > Canada > Quebec > Montreal (0.04)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Therapeutic Area > Neurology > Dementia (0.49)
Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

How to take photos on your phone via remote control

Popular ScienceJun-7-2025, 17:00:00 GMT

Breakthroughs, discoveries, and DIY tips sent every weekday. Our smartphones have transformed the way we take photos and videos and our relationship to these digital memories. Most of us will snap at least some pictures and clips every day with the gadget that's always close at hand. If you want to get more creative with photos on your phone, you can. Sometimes you're going to want to take a picture remotely, without your phone in your hand and your finger over the shutter button--maybe you're taking a wide shot of a large group, or you want to capture a lot of your surroundings.

artificial intelligence, remote control, take photo, (15 more...)

Popular Science

Industry: Media > Photography (0.36)

Technology:

Information Technology > Communications > Mobile (0.78)
Information Technology > Artificial Intelligence (0.70)

Add feedback

Reducing Latency in LLM-Based Natural Language Commands Processing for Robot Navigation

Pollini, Diego, Guterres, Bruna V., Guerra, Rodrigo S., Grando, Ricardo B.

arXiv.org Artificial IntelligenceJun-3-2025

The integration of Large Language Models (LLMs), such as GPT, in industrial robotics enhances operational efficiency and human-robot collaboration. However, the computational complexity and size of these models often provide latency problems in request and response times. This study explores the integration of the ChatGPT natural language model with the Robot Operating System 2 (ROS 2) to mitigate interaction latency and improve robotic system control within a simulated Gazebo environment. We present an architecture that integrates these technologies without requiring a middleware transport platform, detailing how a simulated mobile robot responds to text and voice commands. Experimental results demonstrate that this integration improves execution speed, usability, and accessibility of the human-robot interaction by decreasing the communication latency by 7.01\% on average. Such improvements facilitate smoother, real-time robot operations, which are crucial for industrial automation and precision tasks.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.00075

Country:

South America > Uruguay (0.04)
South America > Argentina (0.04)
Asia > South Korea > Daegu > Daegu (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Spot-On: A Mixed Reality Interface for Multi-Robot Cooperation

Engelbracht, Tim, Lukovic, Petar, Behrens, Tjark, Lascheit, Kai, Zurbrügg, René, Pollefeys, Marc, Blum, Hermann, Bauer, Zuria

arXiv.org Artificial IntelligenceMay-29-2025

Recent progress in mixed reality (MR) and robotics is enabling increasingly sophisticated forms of human-robot collaboration. Building on these developments, we introduce a novel MR framework that allows multiple quadruped robots to operate in semantically diverse environments via a MR interface. Our system supports collaborative tasks involving drawers, swing doors, and higher-level infrastructure such as light switches. A comprehensive user study verifies both the design and usability of our app, with participants giving a "good" or "very good" rating in almost all cases. Overall, our approach provides an effective and intuitive framework for MR-based multi-robot collaboration in complex, real-world scenarios.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.22539

Country: Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Leviton Decora Smart Z-Wave 800 review: It's OK to say no to Wi-Fi

PCWorldApr-23-2025, 18:00:00 GMT

Leviton, one of the biggest electrical component manufacturers in the world, makes high-quality products and offers a comprehensive collection of Z-Wave-compatible devices in addition to this Z-Wave 800 dimmer and switch. Smart lighting controls that operate over Wi-Fi are great, because they don't require a hub; they connect directly to your router. The downside is that they must compete with all the other clients on your home network: Your computers, gaming consoles, media streamers, smart speakers, home security cameras, smart plugs, and many, many more. I live in a very small home--less than 800 square feet--but there are still more than 80 devices connected to the Eero 6 router in my Ring Alarm Pro. Given that the Eero 6's practical limit is 128 clients, there just isn't a lot of room for light switches and dimmers.

artificial intelligence, z-wave, z-wave 800, (12 more...)

PCWorld

Country: North America > United States > California (0.05)

Industry: Information Technology > Smart Houses & Appliances (1.00)

Technology:

Information Technology > Communications > Networks (0.90)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.48)

Add feedback