AITopics | Object-Oriented Architecture

Collaborating Authors

Object-Oriented Architecture

News Overviews Instructional Materials AI-Alerts Classics

Alexa Arena: A User-Centric Interactive Platform for Embodied AI

Gao, Qiaozi, Thattai, Govind, Shakiah, Suhaila, Gao, Xiaofeng, Pansare, Shreyas, Sharma, Vasu, Sukhatme, Gaurav, Shi, Hangjie, Yang, Bofei, Zheng, Desheng, Hu, Lucy, Arumugam, Karthika, Hu, Shui, Wen, Matthew, Guthy, Dinakar, Chung, Cadence, Khanna, Rohan, Ipek, Osman, Ball, Leslie, Bland, Kate, Rocker, Heather, Rao, Yadunandana, Johnston, Michael, Ghanadan, Reza, Mandal, Arindam, Tur, Dilek Hakkani, Natarajan, Prem

arXiv.org Artificial IntelligenceJun-7-2023

We introduce Alexa Arena, a user-centric simulation platform for Embodied AI (EAI) research. Alexa Arena provides a variety of multi-room layouts and interactable objects, for the creation of human-robot interaction (HRI) missions. With user-friendly graphics and control mechanisms, Alexa Arena supports the development of gamified robotic tasks readily accessible to general human users, thus opening a new venue for high-efficiency HRI data collection and EAI system evaluation. Along with the platform, we introduce a dialog-enabled instruction-following benchmark and provide baseline results for it. We make Alexa Arena publicly available to facilitate research in building generalizable and assistive embodied agents.

large language model, natural language, object-oriented architecture, (18 more...)

arXiv.org Artificial Intelligence

2303.01586

Country: Europe > Sweden > Skåne County > Malmö (0.04)

Genre: Questionnaire & Opinion Survey (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (0.93)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)
(3 more...)

Add feedback

Data Poisoning Attacks Against Multimodal Encoders

Yang, Ziqing, He, Xinlei, Li, Zheng, Backes, Michael, Humbert, Mathias, Berrang, Pascal, Zhang, Yang

arXiv.org Artificial IntelligenceJun-5-2023

Recently, the newly emerged multimodal models, which leverage both visual and linguistic modalities to train powerful encoders, have gained increasing attention. However, learning from a large-scale unlabeled dataset also exposes the model to the risk of potential poisoning attacks, whereby the adversary aims to perturb the model's training data to trigger malicious behaviors in it. In contrast to previous work, only poisoning visual modality, in this work, we take the first step to studying poisoning attacks against multimodal models in both visual and linguistic modalities. Specially, we focus on answering two questions: (1) Is the linguistic modality also vulnerable to poisoning attacks? and (2) Which modality is most vulnerable? To answer the two questions, we propose three types of poisoning attacks against multimodal models. Extensive evaluations on different datasets and model architectures show that all three attacks can achieve significant attack performance while maintaining model utility in both visual and linguistic modalities. Furthermore, we observe that the poisoning effect differs between different modalities. To mitigate the attacks, we propose both pre-training and post-training defenses. We empirically show that both defenses can significantly reduce the attack performance while preserving the model's utility.

artificial intelligence, machine learning, object-oriented architecture, (17 more...)

arXiv.org Artificial Intelligence

2209.15266

Country: Europe > Switzerland > Vaud > Lausanne (0.04)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.34)

Add feedback

Comparative Analysis of Widely use Object-Oriented Languages

Farooq, Muhammad Shoaib, Khan, Taymour zaman

arXiv.org Artificial IntelligenceJun-2-2023

Every day the programming environment is not only rapidly growing but also changing and languages are constantly evolving. Learning of object-oriented paradigm is compulsory in every computer science major so the choice of language to teach object-oriented principles is very important. Due to large pool of object-oriented languages, it is difficult to choose which should be the first programming language in order to teach object-oriented principles. Many studies shown which should be the first language to tech object-oriented concepts but there is no method to compare and evaluate these languages. In this article we proposed a comprehensive framework to evaluate the widely used object-oriented languages. The languages are evaluated basis of their technical and environmental features. Furthermore, we have constructed a scoring function based on proposed evaluation framework which provides us a language's quantitative score allow us to determine which language is acceptable as first object-oriented language to teach. Moreover, we have also calculated the conformance of widely used object-oriented languages.

object-oriented architecture, programming language, python, (18 more...)

arXiv.org Artificial Intelligence

2306.01819

Country:

North America > United States > California > Madera County > Madera (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Europe > United Kingdom (0.04)
(3 more...)

Genre: Research Report (0.81)

Industry: Education > Curriculum > Subject-Specific Education (0.34)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (1.00)

Add feedback

Egocentric Planning for Scalable Embodied Task Achievement

Liu, Xiaotian, Palacios, Hector, Muise, Christian

arXiv.org Artificial IntelligenceJun-2-2023

Embodied agents face significant challenges when tasked with performing actions in diverse environments, particularly in generalizing across object types and executing suitable actions to accomplish tasks. Furthermore, agents should exhibit robustness, minimizing the execution of illegal actions. In this work, we present Egocentric Planning, an innovative approach that combines symbolic planning and Object-oriented POMDPs to solve tasks in complex environments, harnessing existing models for visual perception and natural language processing. We evaluated our approach in ALFRED, a simulated environment designed for domestic tasks, and demonstrated its high scalability, achieving an impressive 36.07% unseen success rate in the ALFRED benchmark and winning the ALFRED challenge at CVPR Embodied AI workshop. Our method requires reliable perception and the specification or learning of a symbolic description of the preconditions and effects of the agent's actions, as well as what object types reveal information about others. It is capable of naturally scaling to solve new tasks beyond ALFRED, as long as they can be solved using the available skills. This work offers a solid baseline for studying end-to-end and hybrid methods that aim to generalize to new tasks, including recent approaches relying on LLMs, but often struggle to scale to long sequences of actions or produce robust plans for novel tasks.

machine learning, natural language, object-oriented architecture, (21 more...)

arXiv.org Artificial Intelligence

2306.01295

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Ontario > Kingston (0.14)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry: Law (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.67)

Add feedback

1MDB suspect and Jho Low associate dies weeks after questioning

Al JazeeraMay-31-2023, 03:09:41 GMT

A suspect in the 1MDB scandal has died weeks after being deported to Malaysia to face questioning over his role in the $4.5bn fraud. Kee Kok Thiam died in hospital on Monday following a "sudden massive stroke" and was cremated on Wednesday morning, Kee's family said in a statement. "We urge all parties not to entertain any speculations on this unfortunate event and allow the family the space to grief [sic] on his passing," the statement said. News of the 56-year-old businessman's death comes hours after Al Jazeera reported that the Malaysian Anti-Corruption Commission (MACC) had confirmed the whereabouts of fugitive Malaysian financier Jho Taek Low – the alleged mastermind of the 1MDB scandal – in Macau based on its questioning of Kee. The MACC said that Kee, who was deported from Macau earlier this month, revealed he had met with Low and other 1MDB fugitives in the Chinese territory and that Low had instructed him "not to return to Malaysia as a witness in the 1MDB case".

jho low associate die week, kee, malaysia, (4 more...)

Al Jazeera

Country:

Asia > Malaysia (0.91)
Asia > Macao (0.52)
Oceania > Australia (0.08)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (1.00)

Add feedback

LOWA: Localize Objects in the Wild with Attributes

Guo, Xiaoyuan, Chen, Kezhen, Rao, Jinmeng, Zhang, Yawen, Sun, Baochen, Yang, Jie

arXiv.org Artificial IntelligenceMay-31-2023

We present LOWA, a novel method for localizing objects with attributes effectively in the wild. It aims to address the insufficiency of current open-vocabulary object detectors, which are limited by the lack of instance-level attribute classification and rare class names. To train LOWA, we propose a hybrid vision-language training strategy to learn object detection and recognition with class names as well as attribute information. With LOWA, users can not only detect objects with class names, but also able to localize objects by attributes. LOWA is built on top of a two-tower vision-language architecture and consists of a standard vision transformer as the image encoder and a similar transformer as the text encoder. To learn the alignment between visual and text inputs at the instance level, we train LOWA with three training steps: object-level training, attribute-aware learning, and free-text joint training of objects and attributes. This hybrid training strategy first ensures correct object detection, then incorporates instance-level attribute information, and finally balances the object class and attribute sensitivity. We evaluate our model performance of attribute classification and attribute localization on the Open-Vocabulary Attribute Detection (OVAD) benchmark and the Visual Attributes in the Wild (VAW) dataset, and experiments indicate strong zero-shot performance. Ablation studies additionally demonstrate the effectiveness of each training step of our approach.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2305.20047

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.71)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)

Add feedback

Generating Visual Spatial Description via Holistic 3D Scene Understanding

Zhao, Yu, Fei, Hao, Ji, Wei, Wei, Jianguo, Zhang, Meishan, Zhang, Min, Chua, Tat-Seng

arXiv.org Artificial IntelligenceMay-25-2023

Visual spatial description (VSD) aims to generate texts that describe the spatial relations of the given objects within images. Existing VSD work merely models the 2D geometrical vision features, thus inevitably falling prey to the problem of skewed spatial understanding of target objects. In this work, we investigate the incorporation of 3D scene features for VSD. With an external 3D scene extractor, we obtain the 3D objects and scene features for input images, based on which we construct a target object-centered 3D spatial scene graph (Go3D-S2G), such that we model the spatial semantics of target objects within the holistic 3D scenes. Besides, we propose a scene subgraph selecting mechanism, sampling topologically-diverse subgraphs from Go3D-S2G, where the diverse local structure features are navigated to yield spatially-diversified text generation. Experimental results on two VSD datasets demonstrate that our framework outperforms the baselines significantly, especially improving on the cases with complex visual spatial relations. Meanwhile, our method can produce more spatially-diversified generation. Code is available at https://github.com/zhaoyucs/VSD.

machine learning, natural language, object-oriented architecture, (19 more...)

arXiv.org Artificial Intelligence

2305.11768

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
Asia > Singapore > Central Region > Singapore (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Transportation > Ground > Rail (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.35)

Add feedback

Grounding and Distinguishing Conceptual Vocabulary Through Similarity Learning in Embodied Simulations

Ghaffari, Sadaf, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceMay-23-2023

We present a novel method for using agent experiences gathered through an embodied simulation to ground contextualized word vectors to object representations. We use similarity learning to make comparisons between different object types based on their properties when interacted with, and to extract common features pertaining to the objects' behavior. We then use an affine transformation to calculate a projection matrix that transforms contextualized word vectors from different transformer-based language models into this learned space, and evaluate whether new test instances of transformed token vectors identify the correct concept in the object embedding space. Our results expose properties of the embedding spaces of four different transformer models and show that grounding object token vectors is usually more helpful to grounding verb and attribute token vectors than the reverse, which reflects earlier conclusions in the analogical reasoning and psycholinguistic literature.

information, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2305.13668

Country:

North America > United States > Colorado > Larimer County > Fort Collins (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
Asia > China > Hong Kong (0.04)

Genre:

Research Report > Promising Solution (0.34)
Research Report > New Finding (0.34)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.88)

Add feedback

Real-time Simultaneous Multi-Object 3D Shape Reconstruction, 6DoF Pose Estimation and Dense Grasp Prediction

Agrawal, Shubham, Chavan-Dafle, Nikhil, Kasahara, Isaac, Engin, Selim, Huh, Jinwook, Isler, Volkan

arXiv.org Artificial IntelligenceMay-16-2023

Abstract-- Robotic manipulation systems operating in complex environments rely on perception systems which provide information about the geometry (pose and 3D shape) of the objects in the scene along with other semantic information such as object labels. This information is then used for choosing the feasible grasps on relevant objects. In this paper, we present a novel method to provide this geometric and semantic information of all objects in the scene as well as feasible grasps on those objects simultaneously. The main advantage of our method is its speed as it avoids sequential perception and grasp planning steps. With detailed quantitative analysis we show that our method delivers competitive performance compared to the state-of-the-art dedicated methods for object shape, pose, and grasp predictions, while providing fast inference at 30 frames per second speed.

machine learning, object-oriented architecture, prediction, (21 more...)

arXiv.org Artificial Intelligence

2305.0951

Country: North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.47)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.42)

Add feedback

Sequence-Agnostic Multi-Object Navigation

Gireesh, Nandiraju, Agrawal, Ayush, Datta, Ahana, Banerjee, Snehasis, Sridharan, Mohan, Bhowmick, Brojeshwar, Krishna, Madhava

arXiv.org Artificial IntelligenceMay-10-2023

The Multi-Object Navigation (MultiON) task requires a robot to localize an instance (each) of multiple object classes. It is a fundamental task for an assistive robot in a home or a factory. Existing methods for MultiON have viewed this as a direct extension of Object Navigation (ON), the task of localising an instance of one object class, and are pre-sequenced, i.e., the sequence in which the object classes are to be explored is provided in advance. This is a strong limitation in practical applications characterized by dynamic changes. This paper describes a deep reinforcement learning framework for sequence-agnostic MultiON based on an actor-critic architecture and a suitable reward specification. Our framework leverages past experiences and seeks to reward progress toward individual as well as multiple target object classes. We use photo-realistic scenes from the Gibson benchmark dataset in the AI Habitat 3D simulation environment to experimentally show that our method performs better than a pre-sequenced approach and a state of the art ON method extended to MultiON.

machine learning, object-oriented architecture, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2305.06178

Country:

Europe > United Kingdom > England > West Midlands > Birmingham (0.04)
Asia > India > Telangana > Hyderabad (0.04)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)

Add feedback