AITopics | real world environment

Collaborating Authors

real world environment

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Generative Visual Foresight Meets Task-Agnostic Pose Estimation in Robotic Table-Top Manipulation

Zhang, Chuye, Zhang, Xiaoxiong, Pan, Wei, Zheng, Linfang, Zhang, Wei

arXiv.org Artificial IntelligenceSep-3-2025

Robotic manipulation in unstructured environments requires systems that can generalize across diverse tasks while maintaining robust and reliable performance. We introduce {GVF-TAPE}, a closed-loop framework that combines generative visual foresight with task-agnostic pose estimation to enable scalable robotic manipulation. GVF-TAPE employs a generative video model to predict future RGB-D frames from a single side-view RGB image and a task description, offering visual plans that guide robot actions. A decoupled pose estimation model then extracts end-effector poses from the predicted frames, translating them into executable commands via low-level controllers. By iteratively integrating video foresight and pose estimation in a closed loop, GVF-TAPE achieves real-time, adaptive manipulation across a broad range of tasks. Extensive experiments in both simulation and real-world settings demonstrate that our approach reduces reliance on task-specific action data and generalizes effectively, providing a practical and scalable solution for intelligent robotic systems.

artificial intelligence, machine learning, pose estimation, (17 more...)

arXiv.org Artificial Intelligence

2509.00361

Genre: Research Report > New Finding (0.46)

Industry: Energy (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision > Video Understanding (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

ViSTa Dataset: Do vision-language models understand sequential tasks?

Wybitul, Evžen, Gunter, Evan Ryan, Seleznyov, Mikhail, Lindner, David

arXiv.org Artificial IntelligenceNov-21-2024

Using vision-language models (VLMs) as reward models in reinforcement learning holds promise for reducing costs and improving safety. So far, VLM reward models have only been used for goal-oriented tasks, where the agent must reach a particular final outcome. We explore VLMs' potential to supervise tasks that cannot be scored by the final state alone. To this end, we introduce ViSTa, a dataset for evaluating Vision-based understanding of Sequential Tasks. ViSTa comprises over 4,000 videos with step-by-step descriptions in virtual home, Minecraft, and real-world environments. Its novel hierarchical structure -- basic single-step tasks composed into more and more complex sequential tasks -- allows a fine-grained understanding of how well VLMs can judge tasks with varying complexity. To illustrate this, we use ViSTa to evaluate state-of-the-art VLMs, including CLIP, ViCLIP, and GPT-4o. We find that, while they are all good at object recognition, they fail to understand sequential tasks, with only GPT-4o achieving non-trivial performance.

dataset, video, video object, (15 more...)

arXiv.org Artificial Intelligence

2411.13211

Country: North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)

Genre: Research Report (0.51)

Industry: Leisure & Entertainment > Games (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Reviews: Robot Learning in Homes: Improving Generalization and Reducing Dataset Bias

Neural Information Processing SystemsOct-8-2024, 10:58:35 GMT

In this paper a new dataset for robot grasping task is proposed. Compared to grasping data collected in a lab environment, the authors propose to collect the data from real world environments (homes). To collect data in the wild, the authors propose to use cheap robots (measured by the cost) with low DoF. In order to compensate the noisy behavior of the less calibrated robots, the authors model the noise as a latent variable and jointly learn it with the grasping task. Results show that the combination of the aforementioned ideas result in a robot grasping model that can work well on both lab environments, and new real world environment.

cheap robot, lab environment, robot, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

The One [Simple] Method AI Implementers Use For Success

#artificialintelligenceSep-5-2022, 02:56:00 GMT

Who do you blame when AI projects fail? The data? Certainly you can put blame on solving the wrong problem with AI, or applying AI when you don't need AI at all. But what happens when you have a very well-suited application for AI and the project still fails? Sometimes it comes down to a simple approach: don't take so long. At a recent Enterprise Data & AI event, a presenter shared that their AI projects take on average 18 to 24 months to go from concept to production.

ai project, ai solution, real world environment, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

New AI strategy enables robots to rapidly adapt to real world environments

#artificialintelligenceJul-10-2021, 04:15:08 GMT

The RMA system combines a base policy -- the algorithm by which the robot determines how to move -- with an adaptation module. The base policy uses reinforcement learning to develop controls for sets of extrinsic variables in the environment. This is learned in simulation, but that alone is not enough to prepare the legged robot for the real world because the robot's onboard sensors cannot directly measure all possible variables in the environment. To solve this, the adaptation module directs the robot to teach itself about its surroundings using information based on its own body movements. For example, if a robot senses that its feet are extending farther, it may surmise that the surface it is on is soft and will adapt its next movements accordingly.

new ai strategy enable robot, real world environment

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.32)

Add feedback

The ingredients of real world robotic reinforcement learning

AIHubJun-30-2020, 12:10:00 GMT

Robots have been useful in environments that can be carefully controlled, such as those commonly found in industrial settings (e.g. assembly lines). However, in unstructured settings like the home, we need robotic systems that are adaptive to the diversity of the real world. Learning-based algorithms have the potential to enable robots to acquire complex behaviors adaptively in unstructured environments, by leveraging data collected from the environment. In particular, with reinforcement learning, robots learn novel behaviors through trial and error interactions. This is particularly important as we deploy robots in scenarios where the environment may not be known.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

AIHub

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.89)

Add feedback

At A Glance – Embodied AI - Disruption Hub

#artificialintelligenceFeb-4-2018, 08:50:45 GMT

Embodied AI is one of many terms associated with the relentless development of Artificial Intelligence. As the name suggests, it involves equipping software with a physical body and exploring how that body fits into real world environments. Embodied AI is based on embodied cognition – the idea that intelligence is as much a part of the body as the brain. By applying this logic to artificially intelligent systems, researchers hope to improve their functionality. Process automation, chatbots, advanced robotics, autonomous drive technology, and personal companions like Buddy and Jibo could all benefit from embodied intelligence.

artificial intelligence, disruption hub, embodied ai, (3 more...)

#artificialintelligence

Country: Europe > Switzerland > Zürich > Zürich (0.07)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Virtual Reality is the Next Training Ground for Artificial Intelligence

#artificialintelligenceOct-12-2017, 21:55:16 GMT

Virtual reality was imagined as a human simulation technology long before the most recent wave of innovation that brought us the Oculus RIFT and the wave of innovation that followed. Now, rendering high framerate graphics using multiple, stereoscopic points in virtual reality is matching the speed and accuracy of robotic sensors and cameras. By modeling physics, motion, and material interactions, virtual reality is poised to become a simulation tool for training automatons - robots, drones, and diagnostic gear - before they need to perform in the real world. Recent advancements point to a potentially disruptive combination of virtual reality and artificial intelligence which will unlock a future with safe and competent intelligent machines, able to learn exponentially through self training and intelligent, realistic simulations. Ongoing academic work in machine learning and virtual reality have been migrating to corporations and startups through open source initiatives and movement of skilled people through the academic, startup, and corporate workplaces.

artificial intelligence, human computer interaction, machine learning, (12 more...)

#artificialintelligence

Industry:

Information Technology (1.00)
Leisure & Entertainment > Games > Computer Games (0.52)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Discovering Patterns of Autistic Planning

Galitsky, Boris (University of Girona) | Jarrold, William (University of California, Davis)

AAAI ConferencesAug-8-2011

We analyze the patterns of autistic reasoning while performing planning tasks. The formalism of non-monotonic logic of defaults is used to simulate the autistic decision-making while adjusting an action to a context. Our current main finding is that while people with autism may be able to process single default rules, they have a characteristic difficulty in cases where multiple default rules conflict. Even though default reasoning was intended to simulate the reasoning of typical human subjects, it turns out that following the operational semantics of default reasoning in a literal way leads to the peculiarities of autistic behavior observed in the literature.

artificial intelligence, belief revision, reasoning, (17 more...)

AAAI Conferences

Workshops at the Twenty-Fifth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Wisconsin (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(4 more...)

Genre: Research Report (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology > Autism (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.56)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.46)

Add feedback