AITopics

2411.03483

Country:

North America > United States > California > Riverside County > Riverside (0.14)
Europe > Switzerland (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)

Genre: Research Report (0.64)

Industry: Food & Agriculture > Agriculture (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)

Malek, Kaveh, Moreu, Fernando

Methodology to Deploy CNN-Based Computer Vision Models on Immersive Wearable Devices

arXiv.org Artificial IntelligenceJun-28-2024

Convolutional Neural Network (CNN) models often lack the ability to incorporate human input, which can be addressed by Augmented Reality (AR) headsets. However, current AR headsets face limitations in processing power, which has prevented researchers from performing real-time, complex image recognition tasks using CNNs in AR headsets. This paper presents a method to deploy CNN models on AR headsets by training them on computers and transferring the optimized weight matrices to the headset. The approach transforms the image data and CNN layers into a one-dimensional format suitable for the AR platform. We demonstrate this method by training the LeNet-5 CNN model on the MNIST dataset using PyTorch and deploying it on a HoloLens AR headset. The results show that the model maintains an accuracy of approximately 98%, similar to its performance on a computer. This integration of CNN and AR enables real-time image processing on AR headsets, allowing for the incorporation of human input into AI models.

ar headset, cnn model, platform, (14 more...)

2407.00233

Country:

North America > United States > New Mexico (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Hardware (0.41)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Hardware (1.00)
(3 more...)

Moore, Drake, Zolotas, Mark, Padir, Taskin

Shared Affordance-awareness via Augmented Reality for Proactive Assistance in Human-robot Collaboration

arXiv.org Artificial IntelligenceDec-20-2023

Enabling humans and robots to collaborate effectively requires purposeful communication and an understanding of each other's affordances. Prior work in human-robot collaboration has incorporated knowledge of human affordances, i.e., their action possibilities in the current context, into autonomous robot decision-making. This "affordance awareness" is especially promising for service robots that need to know when and how to assist a person that cannot independently complete a task. However, robots still fall short in performing many common tasks autonomously. In this work-in-progress paper, we propose an augmented reality (AR) framework that bridges the gap in an assistive robot's capabilities by actively engaging with a human through a shared affordance-awareness representation. Leveraging the different perspectives from a human wearing an AR headset and a robot's equipped sensors, we can build a perceptual representation of the shared environment and model regions of respective agent affordances. The AR interface can also allow both agents to communicate affordances with one another, as well as prompt for assistance when attempting to perform an action outside their affordance region. This paper presents the main components of the proposed framework and discusses its potential through a domestic cleaning task experiment.

affordance, agent, robot, (13 more...)

2312.1341

Country: North America > United States (0.14)

Genre: Research Report (0.40)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.95)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.65)

arXiv.org Artificial IntelligenceApr-26-2023

Multimodal Grounding for Embodied AI via Augmented Reality Headsets for Natural Language Driven Task Planning

Wanna, Selma, Parra, Fabian, Valner, Robert, Kruusamäe, Karl, Pryor, Mitch

Recent advances in generative modeling have spurred a resurgence in the field of Embodied Artificial Intelligence (EAI). EAI systems typically deploy large language models to physical systems capable of interacting with their environment. In our exploration of EAI for industrial domains, we successfully demonstrate the feasibility of co-located, human-robot teaming. Specifically, we construct an experiment where an Augmented Reality (AR) headset mediates information exchange between an EAI agent and human operator for a variety of inspection tasks. To our knowledge the use of an AR headset for multimodal grounding and the application of EAI to industrial tasks are novel contributions within Embodied AI research. In addition, we highlight potential pitfalls in EAI's construction by providing quantitative and qualitative analysis on prompt robustness.

computational linguistic, machine learning, natural language, (19 more...)

2304.13676

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Estonia > Tartu County > Tartu (0.05)
Europe > Ireland > Leinster > County Dublin > Dublin (0.05)
(7 more...)

Genre: Research Report (0.52)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Gu, Morris, Croft, Elizabeth, Cosgun, Akansel

AR Point&Click: An Interface for Setting Robot Navigation Goals

arXiv.org Artificial IntelligenceOct-22-2022

This paper considers the problem of designating navigation goal locations for interactive mobile robots. We investigate a point-andclick interface, implemented with an Augmented Reality (AR) headset. The cameras on the AR headset are used to detect natural pointing gestures performed by the user. The selected goal is visualized through the AR headset, allowing the users to adjust the goal location if desired. We conduct a user study in which participants set consecutive navigation goals for the robot using three different interfaces: AR Point&Click, Person Following and Tablet (birdeye map view). Results show that the proposed AR Point&Click interface improved the perceived accuracy, efficiency and reduced mental load compared to the baseline tablet interface, and it performed on-par to the Person Following method. These results show that the AR Point&Click is a feasible interaction model for setting navigation goals.

artificial intelligence, human computer interaction, interface, (14 more...)

2203.15219

Country:

Oceania > Australia (0.04)
North America > Canada > British Columbia > Vancouver Island > Capital Regional District > Victoria (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.69)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)

Sekiguchi, Kouhei, Nugraha, Aditya Arie, Du, Yicheng, Bando, Yoshiaki, Fontaine, Mathieu, Yoshii, Kazuyoshi

Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments

arXiv.org Artificial IntelligenceJul-15-2022

This paper describes the practical response- and performance-aware development of online speech enhancement for an augmented reality (AR) headset that helps a user understand conversations made in real noisy echoic environments (e.g., cocktail party). One may use a state-of-the-art blind source separation method called fast multichannel nonnegative matrix factorization (FastMNMF) that works well in various environments thanks to its unsupervised nature. Its heavy computational cost, however, prevents its application to real-time processing. In contrast, a supervised beamforming method that uses a deep neural network (DNN) for estimating spatial information of speech and noise readily fits real-time processing, but suffers from drastic performance degradation in mismatched conditions. Given such complementary characteristics, we propose a dual-process robust online speech enhancement method based on DNN-based beamforming with FastMNMF-guided adaptation. FastMNMF (back end) is performed in a mini-batch style and the noisy and enhanced speech pairs are used together with the original parallel training data for updating the direction-aware DNN (front end) with backpropagation at a computationally-allowable interval. This method is used with a blind dereverberation method called weighted prediction error (WPE) for transcribing the noisy reverberant speech of a speaker, which can be detected from video or selected by a user's hand gesture or eye gaze, in a streaming manner and spatially showing the transcriptions with an AR technique. Our experiment showed that the word error rate was improved by more than 10 points with the run-time adaptation using only twelve minutes of observation.

artificial intelligence, human computer interaction, machine learning, (16 more...)

2207.07296

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.15)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)
North America > United States (0.04)
(2 more...)

Genre: Research Report (0.70)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

#artificialintelligenceOct-22-2020

Augmented Reality Is Coming -- to Your Car's Windshield

Like millions of other kids around the world, Jamieson Christmas, now in his mid-forties, was transfixed the first time he saw director George Lucas' epic space opera Star Wars. "I'm a child of the '70s," he told Digital Trends. "I grew up when Star Wars was first released. George Lucas set up this vision of little robots beaming three-dimensional pictures of people. It had a really tremendous influence on me."

artificial intelligence, christmas, human computer interaction, (18 more...)

Country:

North America > United States > Kansas (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia (0.04)

Industry:

Automobiles & Trucks (1.00)
Transportation (0.98)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.52)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.49)

#artificialintelligenceOct-20-2020, 01:22:24 GMT

How to use human and artificial intelligence with digital twins

Human intelligence has been creating and maintaining complex systems since the beginnings of civilizations. In modern times, digital twins have emerged to aid operations of complex systems, as well as improve design and production. Artificial intelligence (AI) and extended reality (XR) – including augmented reality (AR) and virtual reality (VR) – have emerged as tools that can help manage operations for complex systems. Digital twins can be enhanced with AI and emerging user interface (UI) technologies like XR can improve people's abilities to manage complex systems via digital twins. Digital twins can marry human and AI to produce something far greater by creating a usable representation of complex systems. End users do not need to worry about the formulas that go into machine learning (ML), predictive modeling and artificially intelligent systems, but also can capitalize on their power as an extension of their own knowledge and abilities. Digital twins combined with AR, VR and related technologies provide a framework to overlay intelligent decision making into day-to-day operations, as shown in Figure 1. Figure 1: A digital twin can be enhanced with artificial intelligence (AI) and intelligent realities user interfaces, such as extended reality (XR), which includes augmented reality (AR) and virtual reality (VR). The operations of a physical twin can be digitized by sensors, cameras and other such devices, but those digital streams are not the only sources of data that can feed the digital twin. In addition to streaming data, accumulated historical data can inform a digital twin. Relevant data could include data not generated from the asset itself, such as weather and business cycle data. Also, computer-aided design (CAD) drawings and other documentation can help the digital twin provide context.

artificial intelligence, digital twin, machine learning, (17 more...)

Country: North America > United States > North Carolina > Wake County > Cary (0.04)

Industry: Information Technology (0.88)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

#artificialintelligenceOct-7-2020, 14:06:28 GMT

3d Scanner App

I've been testing 3D scanning hardware and apps for years and this is one of the fastest and easiest ones that I've used. It's great to see how far the technology has come in both quality and cost reduction. I bought the 2020 iPad pro specifically for the new lidar sensor and applications like this. If you already have an iPad with this sensor (or hopefully iPhone when they come out), definitely download this app and try it out - it's free! The mesh is generated quickly thanks to Apple's lidar sensor and this app shows the triangles in realtime as they are generated and refined.

artificial intelligence, primesense camera, sensor, (10 more...)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Vision (0.62)

#artificialintelligenceMar-6-2020, 00:32:52 GMT

Augmented Reality: Leading AR companies named

Augmented Reality is still developing as a technology, but is beginning to move into the mainstream. The big tech companies are scrambling to build sustainable AR ecosystems to gain early foothold in the potentially lucrative market, while specialist firms are focusing on areas like content development. In 2018, Alibaba, the Chinese ecommerce giant launched Taobao Buy, an app that aims to make online shopping more interactive. The app, accessible via Microsoft's HoloLens headsets, allows users to browse and interact for a select range of products from Alibaba's online store. Alibaba acquired Infinity AR and has also invested in Augmented Reality companies like WayRay and Magic Leap.

augmented reality, headset, smart glass, (13 more...)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Information Technology > Services > e-Commerce Services (0.55)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.99)