AITopics

2501.01119

Country:

Asia > China > Hong Kong (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
(3 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

arXiv.org Artificial IntelligenceFeb-6-2024

FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models

Liu, Chuhao, Wang, Ke, Shi, Jieqi, Qiao, Zhijian, Shen, Shaojie

Semantic mapping based on the supervised object detectors is sensitive to image distribution. In real-world environments, the object detection and segmentation performance can lead to a major drop, preventing the use of semantic mapping in a wider domain. On the other hand, the development of vision-language foundation models demonstrates a strong zero-shot transferability across data distribution. It provides an opportunity to construct generalizable instance-aware semantic maps. Hence, this work explores how to boost instance-aware semantic mapping from object detection generated from foundation models. We propose a probabilistic label fusion method to predict close-set semantic classes from open-set label measurements. An instance refinement module merges the over-segmented instances caused by inconsistent segmentation. We integrate all the modules into a unified semantic mapping system. Reading a sequence of RGB-D input, our work incrementally reconstructs an instance-aware semantic map. We evaluate the zero-shot performance of our method in ScanNet and SceneNN datasets. Our method achieves 40.3 mean average precision (mAP) on the ScanNet semantic instance segmentation task. It outperforms the traditional semantic mapping method significantly.

detection, foundation model, semantic class, (13 more...)

2402.04555

Country: Asia > China > Hong Kong (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

arXiv.org Artificial IntelligenceJan-11-2024

Kimera2: Robust and Accurate Metric-Semantic SLAM in the Real World

Abate, Marcus, Chang, Yun, Hughes, Nathan, Carlone, Luca

In particular, we enhance Kimera-VIO, the visual-inertial odometry pipeline powering Kimera, to support better feature tracking, more efficient keyframe selection, and various input modalities (e.g., monocular, stereo, and RGB-D images, as well as wheel odometry). Additionally, Kimera-RPGO and Kimera-PGMO, Kimera's pose-graph optimization backends, are updated to support modern outlier rejection methods --specifically, Graduated-Non-Convexity-- for improved robustness to spurious loop closures. These new features are evaluated extensively on a variety of simulated and real robotic platforms, including drones, quadrupeds, wheeled robots, and simulated self-driving cars. We present comparisons against several state-of-the-art visual-inertial SLAM pipelines and discuss strengths and weaknesses of the new release of Kimera.

dataset, keyframe, kimera, (15 more...)

2401.06323

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report (0.50)

Industry:

Transportation > Ground (0.34)
Information Technology > Robotics & Automation (0.34)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Arun, Aditya, Hunter, William, Ayyalasomayajula, Roshan, Bharadia, Dinesh

ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM

arXiv.org Artificial IntelligenceSep-16-2022

Recent interest towards autonomous navigation and exploration robots for indoor applications has spurred research into indoor Simultaneous Localization and Mapping (SLAM) robot systems. While most of these SLAM systems use Visual and LiDAR sensors in tandem with an odometry sensor, these odometry sensors drift over time. To combat this drift, Visual SLAM systems deploy compute and memory intensive search algorithms to detect `Loop Closures', which make the trajectory estimate globally consistent. To circumvent these resource (compute and memory) intensive algorithms, we present ViWiD, which integrates WiFi and Visual sensors in a dual-layered system. This dual-layered approach separates the tasks of local and global trajectory estimation making ViWiD resource efficient while achieving on-par or better performance to state-of-the-art Visual SLAM. We demonstrate ViWiD's performance on four datasets, covering over 1500 m of traversed path and show 4.3x and 4x reduction in compute and memory consumption respectively compared to state-of-the-art Visual and Lidar SLAM systems with on par SLAM performance.

artificial intelligence, cartographer, viwid, (15 more...)

2209.08091

Country: North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.34)

#artificialintelligenceJul-23-2020, 17:10:27 GMT

Giving Robots Human-Like Perception of Their Physical Environments

Kimera builds a dense 3D semantic mesh of an environment and can track humans in the environment. The figure shows a multi-frame action sequence of a human moving in the scene. "Alexa, go to the kitchen and fetch me a snack" Wouldn't we all appreciate a little help around the house, especially if that help came in the form of a smart, adaptable, uncomplaining robot? Sure, there are the one-trick Roombas of the appliance world. But MIT engineers are envisioning robots more like home helpers, able to follow high-level, Alexa-type commands, such as "Go to the kitchen and fetch me a coffee cup."

artificial intelligence, carlone, robot, (13 more...)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

#artificialintelligenceJul-16-2020, 16:19:15 GMT

Giving robots human-like perception of their physical environments

To carry out such high-level tasks, researchers believe robots will have to be able to perceive their physical environment as humans do. "In order to make any decision in the world, you need to have a mental model of the environment around you," says Luca Carlone, assistant professor of aeronautics and astronautics at MIT. "This is something so effortless for humans. But for robots it's a painfully hard problem, where it's about transforming pixel values that they see through a camera, into an understanding of the world." Now Carlone and his students have developed a representation of spatial perception for robots that is modeled after the way humans perceive and navigate the world. The new model, which they call 3D Dynamic Scene Graphs, enables a robot to quickly generate a 3D map of its surroundings that also includes objects and their semantic labels (a chair versus a table, for instance), as well as people, rooms, walls, and other structures that the robot is likely seeing in its environment.

artificial intelligence, perception, robot, (15 more...)

Industry: Government > Military (0.30)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

#artificialintelligenceJul-16-2020, 05:00:04 GMT

Alexa, go to the kitchen and fetch me a snack

Wouldn't we all appreciate a little help around the house, especially if that help came in the form of a smart, adaptable, uncomplaining robot? Sure, there are the one-trick Roombas of the appliance world. But MIT engineers are envisioning robots more like home helpers, able to follow high-level, Alexa-type commands, such as "Go to the kitchen and fetch me a coffee cup." To carry out such high-level tasks, researchers believe robots will have to be able to perceive their physical environment as humans do. "In order to make any decision in the world, you need to have a mental model of the environment around you," says Luca Carlone, assistant professor of aeronautics and astronautics at MIT. "This is something so effortless for humans. But for robots it's a painfully hard problem, where it's about transforming pixel values that they see through a camera, into an understanding of the world."

artificial intelligence, carlone, robot, (16 more...)

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)

Industry: Government > Military (0.30)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

#artificialintelligenceJul-12-2018, 00:06:27 GMT

Moving Humanity Forward

Kimera has created the world's first Artificial General Intelligence (AGI). Unlike traditional AI, which is limited to one field or set of tasks, AGI can do virtually anything across any industry. Kimera's vision is to use this technology to move humanity forward. TOKEN SALE IS NOW LIVE: https://kimera.ai/

artificial intelligence, humanity forward, social media, (1 more...)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.76)

#artificialintelligenceJun-7-2018, 04:57:23 GMT

Kimera Systems ICO – Moving Humanity Forward

AGI is a highly advanced and complex technology. It required years of research and creative thinking to develop Nigel AGI. Kimera wants this technology to be owned by a multitude of individuals and want many of you to understand it. Kimera's AGI is based on the General Theory of Intelligence, which defines intelligence from a quantum physics perspective, not a neuroscience approach. To enable continued general learning, the single algorithm focuses on learning cause and effect by observing reality through user's devices.

artificial intelligence, humanity forward, kimera, (1 more...)

Technology: Information Technology > Artificial Intelligence (0.70)

#artificialintelligenceMay-19-2018, 10:32:09 GMT

Kimera.ai

Kimera is the creator of Nigel AGI, the world's first Artificial General Intelligence.

artificial intelligence, kimera

Technology: Information Technology > Artificial Intelligence (1.00)