AITopics | Representation Of Examples

Collaborating Authors

Representation Of Examples

News Overviews Instructional Materials AI-Alerts Classics

Fast, smooth and adaptive regression in metric spaces

Neural Information Processing SystemsFeb-15-2020, 02:27:12 GMT

It was recently shown that certain nonparametric regressors can escape the curse of dimensionality in the sense that their convergence rates adapt to the intrinsic dimension of data (\cite{BL:65, SK:77}). We prove some stronger results in more general settings. In particular, we consider a regressor which, by combining aspects of both tree-based regression and kernel regression, operates on a general metric space, yields a smooth function, and evaluates in time $O(\log n)$. We derive a tight convergence rate of the form $n {-2/(2 d)}$ where $d$ is the Assouad dimension of the input space. Papers published at the Neural Information Processing Systems Conference.

adaptive regression, dimension, metric space, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.68)

Add feedback

Robust Nonparametric Regression with Metric-Space Valued Output

Hein, Matthias

Neural Information Processing SystemsFeb-15-2020, 01:56:23 GMT

Motivated by recent developments in manifold-valued regression we propose a family of nonparametric kernel-smoothing estimators with metric-space valued output including a robust median type estimator and the classical Frechet mean. Depending on the choice of the output space and the chosen metric the estimator reduces to partially well-known procedures for multi-class classification, multivariate regression in Euclidean space, regression with manifold-valued output and even some cases of structured output learning. In this paper we focus on the case of regression with manifold-valued input and output. We show pointwise and Bayes consistency for all estimators in the family for the case of manifold-valued output and illustrate the robustness properties of the estimator with experiments. Papers published at the Neural Information Processing Systems Conference.

estimator, metric-space, robust nonparametric regression

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.67)

Add feedback

Multi-Scale Representation Learning for Spatial Feature Distributions using Grid Cells

Mai, Gengchen, Janowicz, Krzysztof, Yan, Bo, Zhu, Rui, Cai, Ling, Lao, Ni

arXiv.org Artificial IntelligenceFeb-15-2020

Unsupervised text encoding models have recently fueled substantial progress in NLP. The key idea is to use neural networks to convert words in texts to vector space representations based on word positions in a sentence and their contexts, which are suitable for end-to-end training of downstream tasks. We see a strikingly similar situation in spatial analysis, which focuses on incorporating both absolute positions and spatial contexts of geographic objects such as POIs into models. A general-purpose representation model for space is valuable for a multitude of tasks. However, no such general model exists to date beyond simply applying discretization or feed-forward nets to coordinates, and little effort has been put into jointly modeling distributions with vastly different characteristics, which commonly emerges from GIS data. Meanwhile, Nobel Prize-winning Neuroscience research shows that grid cells in mammals provide a multi-scale periodic representation that functions as a metric for location encoding and is critical for recognizing places and for path-integration. Therefore, we propose a representation learning model called Space2Vec to encode the absolute positions and spatial relationships of places. We conduct experiments on two real-world geographic data for two different tasks: 1) predicting types of POIs given their positions and context, 2) image classification leveraging their geo-locations. Results show that because of its multi-scale representations, Space2Vec outperforms well-established ML approaches such as RBF kernels, multi-layer feed-forward nets, and tile embedding approaches for location modeling and image classification tasks. Detailed analysis shows that all baselines can at most well handle distribution at one scale but show poor performances in other scales. In contrast, Space2Vec's multi-scale representation can handle distributions at different scales.

mac aodha, poi type, representation, (15 more...)

arXiv.org Artificial Intelligence

2003.00824

Country:

North America > United States > California > Santa Barbara County > Santa Barbara (0.14)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.34)

Add feedback

Learning to Prune in Metric and Non-Metric Spaces

Boytsov, Leonid, Naidan, Bilegsaikhan

Neural Information Processing SystemsFeb-14-2020, 17:26:44 GMT

Our focus is on approximate nearest neighbor retrieval in metric and non-metric spaces. We employ a VP-tree and explore two simple yet effective learning-to prune approaches: density estimation through sampling and "stretching" of the triangle inequality. Both methods are evaluated using data sets with metric (Euclidean) and non-metric (KL-divergence and Itakura-Saito) distance functions. Conditions on spaces where the VP-tree is applicable are discussed. The VP-tree with a learned pruner is compared against the recently proposed state-of-the-art approaches: the bbtree, the multi-probe locality sensitive hashing (LSH), and permutation methods.

learning, metric and non-metric space, prune, (1 more...)

Neural Information Processing Systems

Genre: Research Report > Promising Solution (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.67)

Add feedback

PointNet : Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Qi, Charles Ruizhongtai, Yi, Li, Su, Hao, Guibas, Leonidas J.

Neural Information Processing SystemsFeb-14-2020, 16:45:53 GMT

Few prior works study deep learning on point sets. PointNet is a pioneer in this direction. However, by design PointNet does not capture local structures induced by the metric space points live in, limiting its ability to recognize fine-grained patterns and generalizability to complex scenes. In this work, we introduce a hierarchical neural network that applies PointNet recursively on a nested partitioning of the input point set. By exploiting metric space distances, our network is able to learn local features with increasing contextual scales.

deep hierarchical feature learning, metric space, pointnet, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.65)

Add feedback

Linear Relaxations for Finding Diverse Elements in Metric Spaces

Bhaskara, Aditya, Ghadiri, Mehrdad, Mirrokni, Vahab, Svensson, Ola

Neural Information Processing SystemsFeb-14-2020, 15:27:46 GMT

Choosing a diverse subset of a large collection of points in a metric space is a fundamental problem, with applications in feature selection, recommender systems, web search, data summarization, etc. Various notions of diversity have been proposed, tailored to different applications. The general algorithmic goal is to find a subset of points that maximize diversity, while obeying a cardinality (or more generally, matroid) constraint. The goal of this paper is to develop a novel linear programming (LP) framework that allows us to design approximation algorithms for such problems. We study an objective known as {\em sum-min} diversity, which is known to be effective in many applications, and give the first constant factor approximation algorithm. Our LP framework allows us to easily incorporate additional constraints, as well as secondary objectives.

algorithm, linear relaxation, objective, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.65)

Add feedback

Improved Error Bounds for Tree Representations of Metric Spaces

Chowdhury, Samir, Mémoli, Facundo, Smith, Zane T.

Neural Information Processing SystemsFeb-14-2020, 12:27:46 GMT

Estimating optimal phylogenetic trees or hierarchical clustering trees from metric data is an important problem in evolutionary biology and data analysis. Intuitively, the goodness-of-fit of a metric space to a tree depends on its inherent treeness, as well as other metric properties such as intrinsic dimension. Existing algorithms for embedding metric spaces into tree metrics provide distortion bounds depending on cardinality. Because cardinality is a simple property of any set, we argue that such bounds do not fully capture the rich structure endowed by the metric. We consider an embedding of a metric space into a tree proposed by Gromov.

improved error bound, metric space, tree representation, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (1.00)

Add feedback

Active Nearest-Neighbor Learning in Metric Spaces

Kontorovich, Aryeh, Sabato, Sivan, Urner, Ruth

Neural Information Processing SystemsFeb-14-2020, 07:00:06 GMT

We propose a pool-based non-parametric active learning algorithm for general metric spaces, called MArgin Regularized Metric Active Nearest Neighbor (MARMANN), which outputs a nearest-neighbor classifier. We give prediction error guarantees that depend on the noisy-margin properties of the input sample, and are competitive with those obtained by previously proposed passive learners. We prove that the label complexity of MARMANN is significantly lower than that of any passive learner with similar error guarantees. Our algorithm is based on a generalized sample compression scheme and a new label-efficient active model-selection procedure. Papers published at the Neural Information Processing Systems Conference.

active nearest-neighbor learning, error guarantee, metric space, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.78)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.69)

Add feedback

The exponentially weighted average forecaster in geodesic spaces of non-positive curvature

Paris, Quentin

arXiv.org Machine LearningFeb-3-2020

The problem of prediction with expert advice [ Cesa-Bianchi and Lugosi, 2006 ] is a by now standard model of online learning. Traditionally studied for outcom es taking values in a vector space, less seems to be known when the outcome space is a more general metr ic space. This paper partly addresses the problem by focusing on the case of NPC spaces, i .e., geodesic metric spaces with non-positive curvature in the sense of Alexandrov. The class of NPC spaces includes many metric spaces of partic ular interest in the data sciences. Apart from Hilbert spaces, interesting examples are hyperb olic spaces [ Nickel and Kiela, 2017 ], the space of real symmetric positive-definite matrices with Log -Euclidean [ Arsigny et al., 2007 ] or Log-Cholesky [ Lin, 2019 ] Riemannian metrics and more generally all complete and sim ply connected Riemannian manifolds with non-positive sectional curvatu re.

barycenter, forecaster, npc space, (14 more...)

arXiv.org Machine Learning

2002.00852

Country:

North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Education (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.76)

Add feedback

Simple and Effective Graph Autoencoders with One-Hop Linear Models

Salha, Guillaume, Hennequin, Romain, Vazirgiannis, Michalis

arXiv.org Machine LearningJan-21-2020

Graph autoencoders (AE) and variational autoencoders (VAE) recently emerged as powerful node embedding methods, with promising performances on challenging tasks such as link prediction and node clustering. Graph AE, VAE and most of their extensions rely on graph convolutional networks (GCN) encoders to learn vector space representations of nodes. In this paper, we propose to replace the GCN encoder by a significantly simpler linear model w.r.t. the direct neighborhood (one-hop) adjacency matrix of the graph. For the two aforementioned tasks, we show that this approach consistently reaches competitive performances w.r.t. GCN-based models for numerous real-world graphs, including all benchmark datasets commonly used to evaluate graph AE and VAE. We question the relevance of repeatedly using these datasets to compare complex graph AE and VAE. We also emphasize the effectiveness of the proposed encoding scheme, that appears as a simpler and faster alternative to GCN encoders for many real-world applications.

graph, graph ae, kipf and welling, (15 more...)

arXiv.org Machine Learning

2001.07614

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Greece > Attica > Athens (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.34)

Add feedback