AITopics | Object-Oriented Architecture

Collaborating Authors

Object-Oriented Architecture

News Overviews Instructional Materials AI-Alerts Classics

Global Localization in Unstructured Environments using Semantic Object Maps Built from Various Viewpoints

Ankenbauer, Jacqueline, Lusk, Parker C., Thomas, Annika, How, Jonathan P.

arXiv.org Artificial IntelligenceOct-25-2023

We present a novel framework for global localization and guided relocalization of a vehicle in an unstructured environment. Compared to existing methods, our pipeline does not rely on cues from urban fixtures (e.g., lane markings, buildings), nor does it make assumptions that require the vehicle to be navigating on a road network. Instead, we achieve localization in both urban and non-urban environments by robustly associating and registering the vehicle's local semantic object map with a compact semantic reference map, potentially built from other viewpoints, time periods, and/or modalities. Robustness to noise, outliers, and missing objects is achieved through our graph-based data association algorithm. Further, the guided relocalization capability of our pipeline mitigates drift inherent in odometry-based localization after the initial global localization. We evaluate our pipeline on two publicly-available, real-world datasets to demonstrate its effectiveness at global localization in both non-urban and urban environments. The Katwijk Beach Planetary Rover dataset is used to show our pipeline's ability to perform accurate global localization in unstructured environments. Demonstrations on the KITTI dataset achieve an average pose error of 3.8m across all 35 localization events on Sequence 00 when localizing in a reference map created from aerial images. Compared to existing works, our pipeline is more general because it can perform global localization in unstructured environments using maps built from different viewpoints.

global localization, semantic object map, unstructured environment, (1 more...)

arXiv.org Artificial Intelligence

2303.04658

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.60)

Add feedback

Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation

Lei, Yinjie, Wang, Zixuan, Chen, Feng, Wang, Guoqing, Wang, Peng, Yang, Yang

arXiv.org Artificial IntelligenceOct-24-2023

Multi-modal 3D scene understanding has gained considerable attention due to its wide applications in many areas, such as autonomous driving and human-computer interaction. Compared to conventional single-modal 3D understanding, introducing an additional modality not only elevates the richness and precision of scene interpretation but also ensures a more robust and resilient understanding. This becomes especially crucial in varied and challenging environments where solely relying on 3D data might be inadequate. While there has been a surge in the development of multi-modal 3D methods over past three years, especially those integrating multi-camera images (3D+2D) and textual descriptions (3D+language), a comprehensive and in-depth review is notably absent. In this article, we present a systematic survey of recent progress to bridge this gap. We begin by briefly introducing a background that formally defines various 3D multi-modal tasks and summarizes their inherent challenges. After that, we present a novel taxonomy that delivers a thorough categorization of existing methods according to modalities and tasks, exploring their respective strengths and limitations. Furthermore, comparative results of recent approaches on several benchmark datasets, together with insightful analysis, are offered. Finally, we discuss the unresolved issues and provide several potential avenues for future research.

detection, information, point cloud, (14 more...)

arXiv.org Artificial Intelligence

2310.15676

Country:

Asia > China > Sichuan Province > Chengdu (0.04)
Oceania > Australia > South Australia > Adelaide (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Education (0.45)
Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.92)
(3 more...)

Add feedback

Tips for making the most of 64-bit architectures in langage design, libraries or garbage collection

Sonntag, Benoît, Colnet, Dominique

arXiv.org Artificial IntelligenceOct-24-2023

The 64-bit architectures that have become standard today offer unprecedented low-level programming possibilities. For the first time in the history of computing, the size of address registers far exceeded the physical capacity of their bus.After a brief reminder of the possibilities offered by the small size of addresses compared to the available 64 bits,we develop three concrete examples of how the vacant bits of these registers can be used.Among these examples, two of them concern the implementation of a library for a new statically typed programming language.Firstly, the implementation of multi-precision integers, with the aim of improving performance in terms of both calculation speed and RAM savings.The second example focuses on the library's handling of UTF-8 character strings.Here, the idea is to make indexing easier by ignoring the physical size of each UTF-8 characters.Finally, the third example is a possible enhancement of garbage collectors, in particular the mark \& sweep for the object marking phase.

64-bit architecture, architecture, integer, (16 more...)

arXiv.org Artificial Intelligence

2310.15632

Country:

Europe > France > Grand Est > Bas-Rhin > Strasbourg (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > France > Grand Est > Meurthe-et-Moselle > Nancy (0.04)

Genre: Research Report (0.40)

Industry: Water & Waste Management > Solid Waste Management (0.87)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.69)

Add feedback

Model of models -- Part 1

Komarovsky, Shimon

arXiv.org Artificial IntelligenceOct-24-2023

This paper proposes a new cognitive model, acting as the main component of an AGI agent. The model is introduced in its mature intelligence state, and as an extension of previous models, DENN, and especially AKREM, by including operational models (frames/classes) and will. This model's core assumption is that cognition is about operating on accumulated knowledge, with the guidance of an appropriate will. Also, we assume that the actions, part of knowledge, are learning to be aligned with will, during the evolution phase that precedes the mature intelligence state. In addition, this model is mainly based on the duality principle in every known intelligent aspect, such as exhibiting both top-down and bottom-up model learning, generalization verse specialization, and more. Furthermore, a holistic approach is advocated for AGI designing, and cognition under constraints or efficiency is proposed, in the form of reusability and simplicity. Finally, reaching this mature state is described via a cognitive evolution from infancy to adulthood, utilizing a consolidation principle. The final product of this cognitive model is a dynamic operational memory of models and instances. Lastly, some examples and preliminary ideas for the evolution phase to reach the mature state are presented.

knowledge, opération, representation, (16 more...)

arXiv.org Artificial Intelligence

2308.046

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > New York (0.04)
North America > United States > Virginia (0.04)
(4 more...)

Genre:

Workflow (0.92)
Research Report > Promising Solution (0.67)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Education (1.00)
Leisure & Entertainment (0.92)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.67)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Communications (1.00)
(17 more...)

Add feedback

Unifying Foundation Models with Quadrotor Control for Visual Tracking Beyond Object Categories

Saviolo, Alessandro, Rao, Pratyaksh, Radhakrishnan, Vivek, Xiao, Jiuhong, Loianno, Giuseppe

arXiv.org Artificial IntelligenceOct-17-2023

Visual control enables quadrotors to adaptively navigate using real-time sensory data, bridging perception with action. Yet, challenges persist, including generalization across scenarios, maintaining reliability, and ensuring real-time responsiveness. This paper introduces a perception framework grounded in foundation models for universal object detection and tracking, moving beyond specific training categories. Integral to our approach is a multi-layered tracker integrated with the foundation detector, ensuring continuous target visibility, even when faced with motion blur, abrupt light shifts, and occlusions. Complementing this, we introduce a model-free controller tailored for resilient quadrotor visual tracking. Our system operates efficiently on limited hardware, relying solely on an onboard camera and an inertial measurement unit. Through extensive validation in diverse challenging indoor and outdoor environments, we demonstrate our system's effectiveness and adaptability. In conclusion, our research represents a step forward in quadrotor visual tracking, moving from task-specific methods to more versatile and adaptable operations.

quadrotor control, unifying foundation model, visual tracking, (1 more...)

arXiv.org Artificial Intelligence

2310.04781

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.40)

Add feedback

Learn Python for less than $25 through October 15th

PCWorldOct-9-2023, 08:00:00 GMT

Python is the world's most popular programming language not just because it's relatively easy to learn, but because it's used in so many applications and it's highly scalable. If you've ever wanted to learn to code, starting with Python is a great idea. And it's an even better idea now because during Deal Days, you can get The Premium Python Programming Certification Bundle for $23.97 (reg. This bundle includes ten courses from top instructors like Joe Rahl (4.6/5-star instructor rating) and Edouard Renard (4.6/5-star rating). Starting with the absolute basics of Python, you'll learn basic coding principles, explore Object-Oriented Programming (OOP), and much more as you slowly level up your skills.

learn python, premium python programming certification bundle, python

PCWorld

Technology:

Information Technology > Software > Programming Languages (0.64)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.64)

Add feedback

Soda: An Object-Oriented Functional Language for Specifying Human-Centered Problems

Mendez, Julian Alfredo

arXiv.org Artificial IntelligenceOct-3-2023

We present Soda (Symbolic Objective Descriptive Analysis), a language that helps to treat qualities and quantities in a natural way and greatly simplifies the task of checking their correctness. We present key properties for the language motivated by the design of a descriptive language to encode complex requirements on computer systems, and we explain how these key properties must be addressed to model these requirements with simple definitions. We give an overview of a tool that helps to describe problems in an easy way that we consider more transparent and less error-prone.

programming language, requirement, soda, (15 more...)

arXiv.org Artificial Intelligence

2310.01961

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Sweden > Västerbotten County > Umeå (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre:

Overview (0.75)
Research Report (0.50)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.94)

Add feedback

Towards Robust Robot 3D Perception in Urban Environments: The UT Campus Object Dataset

Zhang, Arthur, Eranki, Chaitanya, Zhang, Christina, Park, Ji-Hwan, Hong, Raymond, Kalyani, Pranav, Kalyanaraman, Lochana, Gamare, Arsh, Bagad, Arnav, Esteva, Maria, Biswas, Joydeep

arXiv.org Artificial IntelligenceOct-1-2023

We introduce the UT Campus Object Dataset (CODa), a mobile robot egocentric perception dataset collected on the University of Texas Austin Campus. Our dataset contains 8.5 hours of multimodal sensor data: synchronized 3D point clouds and stereo RGB video from a 128-channel 3D LiDAR and two 1.25MP RGB cameras at 10 fps; RGB-D videos from an additional 0.5MP sensor at 7 fps, and a 9-DOF IMU sensor at 40 Hz. We provide 58 minutes of ground-truth annotations containing 1.3 million 3D bounding boxes with instance IDs for 53 semantic classes, 5000 frames of 3D semantic annotations for urban terrain, and pseudo-ground truth localization. We repeatedly traverse identical geographic locations for a wide range of indoor and outdoor areas, weather conditions, and times of the day. Using CODa, we empirically demonstrate that: 1) 3D object detection performance in urban settings is significantly higher when trained using CODa compared to existing datasets even when employing state-of-the-art domain adaptation approaches, 2) sensor-specific fine-tuning improves 3D object detection accuracy and 3) pretraining on CODa improves cross-dataset 3D object detection performance in urban settings compared to pretraining on AV datasets. Using our dataset and annotations, we release benchmarks for 3D object detection and 3D semantic segmentation using established metrics. In the future, the CODa benchmark will include additional tasks like unsupervised object discovery and re-identification. We publicly release CODa on the Texas Data Repository, pre-trained models, dataset development package, and interactive dataset viewer on our website at https://amrl.cs.utexas.edu/coda. We expect CODa to be a valuable dataset for research in egocentric 3D perception and planning for autonomous navigation in urban environments.

annotation, coda, dataset, (16 more...)

arXiv.org Artificial Intelligence

2309.13549

Country:

North America > United States > Texas > Travis County > Austin (0.48)
North America > United States > Michigan (0.04)
Europe > Germany (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Information Technology (0.93)
Transportation > Ground (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.47)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.35)

Add feedback

ConSOR: A Context-Aware Semantic Object Rearrangement Framework for Partially Arranged Scenes

Ramachandruni, Kartik, Zuo, Max, Chernova, Sonia

arXiv.org Artificial IntelligenceSep-30-2023

Object rearrangement is the problem of enabling a robot to identify the correct object placement in a complex environment. Prior work on object rearrangement has explored a diverse set of techniques for following user instructions to achieve some desired goal state. Logical predicates, images of the goal scene, and natural language descriptions have all been used to instruct a robot in how to arrange objects. In this work, we argue that burdening the user with specifying goal scenes is not necessary in partially-arranged environments, such as common household settings. Instead, we show that contextual cues from partially arranged scenes (i.e., the placement of some number of pre-arranged objects in the environment) provide sufficient context to enable robots to perform object rearrangement \textit{without any explicit user goal specification}. We introduce ConSOR, a Context-aware Semantic Object Rearrangement framework that utilizes contextual cues from a partially arranged initial state of the environment to complete the arrangement of new objects, without explicit goal specification from the user. We demonstrate that ConSOR strongly outperforms two baselines in generalizing to novel object arrangements and unseen object categories. The code and data can be found at https://github.com/kartikvrama/consor.

category, consor, schema, (15 more...)

arXiv.org Artificial Intelligence

2310.00371

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement

Chang, Haonan, Gao, Kai, Boyalakuntla, Kowndinya, Lee, Alex, Huang, Baichuan, Kumar, Harish Udhaya, Yu, Jinjin, Boularias, Abdeslam

arXiv.org Artificial IntelligenceSep-27-2023

We introduce a novel approach to the executable semantic object rearrangement problem. In this challenge, a robot seeks to create an actionable plan that rearranges objects within a scene according to a pattern dictated by a natural language description. Unlike existing methods such as StructFormer and StructDiffusion, which tackle the issue in two steps by first generating poses and then leveraging a task planner for action plan formulation, our method concurrently addresses pose generation and action planning. We achieve this integration using a Language-Guided Monte-Carlo Tree Search (LGMCTS). Quantitative evaluations are provided on two simulation datasets, and complemented by qualitative tests with a real robot.

executable semantic object rearrangement, language-guided monte-carlo tree search, lgmct

arXiv.org Artificial Intelligence

2309.15821

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.60)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.60)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.60)

Add feedback