Goto

Collaborating Authors

 Object-Oriented Architecture


Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning

Neural Information Processing Systems

Object-oriented representations in reinforcement learning have shown promise in transfer learning, with previous research introducing a propositional object-oriented framework that has provably efficient learning bounds with respect to sample complexity. However, this framework has limitations in terms of the classes of tasks it can efficiently learn. In this paper we introduce a novel deictic object-oriented framework that has provably efficient learning bounds and can solve a broader range of tasks. Additionally, we show that this framework is capable of zero-shot transfer of transition dynamics across tasks and demonstrate this empirically for the Taxi and Sokoban domains. Papers published at the Neural Information Processing Systems Conference.


Hierarchical Object Representation for Open-Ended Object Category Learning and Recognition

Neural Information Processing Systems

Most robots lack the ability to learn new objects from past experiences. To migrate a robot to a new environment one must often completely re-generate the knowledge- base that it is running with. Since in open-ended domains the set of categories to be learned is not predefined, it is not feasible to assume that one can pre-program all object categories required by robots. Therefore, autonomous robots must have the ability to continuously execute learning and recognition in a concurrent and interleaved fashion. This paper proposes an open-ended 3D object recognition system which concurrently learns both the object categories and the statistical features for encoding objects. In particular, we propose an extension of Latent Dirichlet Allocation to learn structural semantic features (i.e.


Deep Symmetry Networks

Neural Information Processing Systems

The chief difficulty in object recognition is that objects' classes are obscured by a large number of extraneous sources of variability, such as pose and part deformation. These sources of variation can be represented by symmetry groups, sets of composable transformations that preserve object identity. Convolutional neural networks (convnets) achieve a degree of translational invariance by computing feature maps over the translation group, but cannot handle other groups. As a result, these groups' effects have to be approximated by small translations, which often requires augmenting datasets and leads to high sample complexity. In this paper, we introduce deep symmetry networks (symnets), a generalization of convnets that forms feature maps over arbitrary symmetry groups.


Partially-Supervised Image Captioning

Neural Information Processing Systems

Image captioning models are becoming increasingly successful at describing the content of images in restricted domains. However, if these models are to function in the wild --- for example, as assistants for people with impaired vision --- a much larger number and variety of visual concepts must be understood. To address this problem, we teach image captioning models new visual concepts from labeled images and object detection datasets. Since image labels and object classes can be interpreted as partial captions, we formulate this problem as learning from partially-specified sequence data. We then propose a novel algorithm for training sequence models, such as recurrent neural networks, on partially-specified sequences which we represent using finite state automata.


3D Object Proposals for Accurate Object Class Detection

Neural Information Processing Systems

The goal of this paper is to generate high-quality 3D object proposals in the context of autonomous driving. Our method exploits stereo imagery to place proposals in the form of 3D bounding boxes. We formulate the problem as minimizing an energy function encoding object size priors, ground plane as well as several depth informed features that reason about free space, point cloud densities and distance to the ground. Our experiments show significant performance gains over existing RGB and RGB-D object proposal methods on the challenging KITTI benchmark. Combined with convolutional neural net (CNN) scoring, our approach outperforms all existing results on all three KITTI object classes.


Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation

Neural Information Processing Systems

Holistic 3D indoor scene understanding refers to jointly recovering the i) object bounding boxes, ii) room layout, and iii) camera pose, all in 3D. The existing methods either are ineffective or only tackle the problem partially. In this paper, we propose an end-to-end model that simultaneously solves all three tasks in real-time given only a single RGB image. The essence of the proposed method is to improve the prediction by i) parametrizing the targets (e.g., 3D boxes) instead of directly estimating the targets, and ii) cooperative training across different modules in contrast to training these modules individually. Specifically, we parametrize the 3D object bounding boxes by the predictions from several modules, i.e., 3D camera pose and object attributes.


[FREE]Learn Python and Artificial Intelligence (AI) Coding Tools - Tricksinfo

#artificialintelligence

Python is a very popular multi-paradigm programming language. Object-oriented programming and structured programming are fully supported in this, and many of its features support functional programming and aspect-oriented programming – for that matter. This easy-to-understand course aims to teach everyone the basics of Python Language, learning outcomes, benefits of learning Python, advantages of Python etc. You will learn where to use Python Language and know about who would actually use it in their daily office lives. You will also learn the comparative parameters of python with other programming languages, in the world – with a highlight on popularity and frameworks of Python.


5 Ways Julia Is Better Than Python

#artificialintelligence

Julia is a multi-paradigm, primarily functional programming language that was created for machine-learning and statistical programming. Python is another multi-paradigm programming language that is used for machine-learning, though generally Python is considered to be object-oriented. Julia, on the other hand, is more based on the functional paradigm. Though Julia certainly isn't as popular as Python, there are some huge benefits to using Julia for Data Science that make it a better choice in a lot of situations that Python. It's hard to talk about Julia without talking about speed.


Python (OOP) : Master Python OOP From Scratch with Projects.

#artificialintelligence

Python Programming Basics and Python Object Oriented Programming Guide for Python Programmers & Python Coders in a simple and easy way with Examples, quizzes, Resources & Python Projects to master Python from zero to hero. First Python Project (CCG) to master what you will learn. Second Python Project X-O Game Classes to master what you will learn. How to use python IDLE. How to use Jupyter notebook(I python).


SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition

arXiv.org Machine Learning

The ability to decompose complex multi-object scenes into meaningful abstractions like objects is fundamental to achieve higher-level cognition. Previous approaches for unsupervised object-oriented scene representation learning are either based on spatial-attention or scene-mixture approaches and limited in scalability which is a main obstacle towards modeling real-world scenes. In this paper, we propose a generative latent variable model, called SPACE, that provides a unified probabilistic modeling framework that combines the best of spatial-attention and scene-mixture approaches. SPACE can explicitly provide factorized object representations for foreground objects while also decomposing background segments of complex morphology. Previous models are good at either of these, but not both. SPACE also resolves the scalability problems of previous methods by incorporating parallel spatial-attention and thus is applicable to scenes with a large number of objects without performance degradations. We show through experiments on Atari and 3D-Rooms that SPACE achieves the above properties consistently in comparison to SPAIR, IODINE, and GENESIS. Results of our experiments can be found on our project website: https://sites.google.com/view/space-project-page