AITopics | Mania, Horia

Collaborating Authors

Mania, Horia

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Time varying regression with hidden linear dynamics

Jadbabaie, Ali, Mania, Horia, Shah, Devavrat, Sra, Suvrit

arXiv.org Machine LearningDec-29-2021

The distribution of labels given the covariates changes over time in a variety of applications of regression. Some example domains where such problems arise include economics, marketing, fashion, and supply chain optimization, where market properties evolve over time. Motivated by such problems, we revisit a model for time-varying linear regression that assumes the unknown parameters evolve according to a linear dynamical system. One way to account for distribution change in linear regression is to assume that the unknown model parameters change slowly with time [2, 15, 37]. While this assumption simplifies the problem and makes it tractable, it misses on exploiting additional structure available and it also fails to model periodicity (e.g., due to seasonality) present in some problems. As an alternative, we are interested in a dynamic model previously studied by Chow [7], Carraro [5], and Shumway et al. [26].

artificial intelligence, banking & finance, machine learning, (19 more...)

arXiv.org Machine Learning

2112.14862

Country: North America > United States > Massachusetts (0.14)

Genre: Research Report (0.82)

Industry: Banking & Finance > Economy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Why do classifier accuracies show linear trends under distribution shift?

Mania, Horia, Sra, Suvrit

arXiv.org Machine LearningDec-31-2020

Several recent studies observed that when classification models are evaluated on two different data distributions, the models' accuracies on one distribution are approximately a linear function of their accuracies on another distribution. We offer an explanation for these observations based on two assumptions that can be assessed empirically: (1) certain events have similar probabilities under the two distributions; (2) the probability that a lower accuracy model correctly classifies a data point sampled from one distribution when a higher accuracy model classifies it incorrectly is small.

artificial intelligence, machine learning, probability, (17 more...)

arXiv.org Machine Learning

2012.15483

Country: North America > United States > Massachusetts (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Bandit Learning in Decentralized Matching Markets

Liu, Lydia T., Ruan, Feng, Mania, Horia, Jordan, Michael I.

arXiv.org Machine LearningDec-14-2020

A fundamental question at the intersection of learning theory and game theory is as follows: how should individually rational agents act when they have to learn about the consequences of their actions in the same uncertain environment? The emerging research area of multiplayer learning-- which explores this basic question and is motivated by a broad range of modern applications, from modeling competition among firms [Mansour et al., 2018, Aridor et al., 2019] to implementing protocols for wireless networks [Liu and Zhao, 2010, Cesa-Bianchi et al., 2016, Shahrampour et al., 2017]--has been of increasing interest. A particularly salient application is the online marketplace, which can often be modeled as a two-sided matching market with uncertainty. Examples include online labor markets (Upwork and TaskRabbit for freelancing, Handy for housecleaning), online crowdsourcing platforms (Amazon Mechanical Turk), and peer-to-peer platforms (Airbnb). The multi-armed bandit is a core learning problem in which a player is faced with a choice among K actions, each of which is associated with a reward distribution, and the goal is to learn which action has the highest reward, doing so as quickly as possible so as to be able to reap rewards even while the learning process is underway. To introduce a game-theoretic aspect into the bandit problem, it is natural to place the problem into the context of a two-way matching market, where the choices faced by the players are identified with the entities on the other side of the market, and where the need to realize a matching imposes economic constraints and incentives. Such a blend of bandit learning with two-sided matching markets was introduced by Liu et al. [2020], who formulated a problem in which the players and the arms form the two sides of the market, and each side has preferences over the other side.

algorithm 1, crowdsourcing, social media, (23 more...)

arXiv.org Machine Learning

2012.07348

Country:

Asia > Middle East > Jordan (0.14)
North America > United States > California (0.14)

Genre: Research Report (0.50)

Industry:

Banking & Finance (0.66)
Education (0.48)
Leisure & Entertainment > Games (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.87)
Information Technology > Communications > Social Media > Crowdsourcing (0.54)

Add feedback

Active Learning for Nonlinear System Identification with Guarantees

Mania, Horia, Jordan, Michael I., Recht, Benjamin

arXiv.org Machine LearningJun-18-2020

While the identification of nonlinear dynamical systems is a fundamental building block of model-based reinforcement learning and feedback control, its sample complexity is only understood for systems that either have discrete states and actions or for systems that can be identified from data generated by i.i.d. random inputs. Nonetheless, many interesting dynamical systems have continuous states and actions and can only be identified through a judicious choice of inputs. Motivated by practical settings, we study a class of nonlinear dynamical systems whose state transitions depend linearly on a known feature embedding of state-action pairs. To estimate such systems in finite time identification methods must explore all directions in feature space. We propose an active learning approach that achieves this by repeating three steps: trajectory planning, trajectory tracking, and re-estimation of the system from all available data. We show that our method estimates nonlinear dynamical systems at a parametric rate, similar to the statistical rate of standard linear regression.

artificial intelligence, identification, machine learning, (17 more...)

arXiv.org Machine Learning

2006.10277

Country: North America > United States > California (0.14)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Model Similarity Mitigates Test Set Overuse

Mania, Horia, Miller, John, Schmidt, Ludwig, Hardt, Moritz, Recht, Benjamin

Neural Information Processing SystemsMar-19-2020, 00:46:30 GMT

Excessive reuse of test data has become commonplace in today's machine learning workflows. Popular benchmarks, competitions, industrial scale tuning, among other applications, all involve test data reuse beyond guidance by statistical confidence bounds. Nonetheless, recent replication studies give evidence that popular benchmarks continue to support progress despite years of extensive reuse. We proffer a new explanation for the apparent longevity of test data: Many proposed models are similar in their predictions and we prove that this similarity mitigates overfitting. Specifically, we show empirically that models proposed for the ImageNet ILSVRC benchmark agree in their predictions well beyond what we can conclude from their accuracy levels alone.

artificial intelligence, machine learning, model similarity mitigate test, (4 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.91)

Add feedback

Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator

Dean, Sarah, Mania, Horia, Matni, Nikolai, Recht, Benjamin, Tu, Stephen

Neural Information Processing SystemsFeb-14-2020, 14:12:15 GMT

We consider adaptive control of the Linear Quadratic Regulator (LQR), where an unknown linear system is controlled subject to quadratic costs. Leveraging recent developments in the estimation of linear systems and in robust controller synthesis, we present the first provably polynomial time algorithm that achieves sub-linear regret on this problem. We further study the interplay between regret minimization and parameter estimation by proving a lower bound on the expected regret in terms of the exploration schedule used by any algorithm. Finally, we conduct a numerical study comparing our robust adaptive algorithm to other methods from the adaptive LQR literature, and demonstrate the flexibility of our proposed method by extending it to a demand forecasting problem subject to state constraints. Papers published at the Neural Information Processing Systems Conference.

artificial intelligence, linear quadratic regulator, machine learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Competing Bandits in Matching Markets

Liu, Lydia T., Mania, Horia, Jordan, Michael I.

arXiv.org Machine LearningJun-12-2019

Stable matching, a classical model for two-sided markets, has long been studied with little consideration for how each side's preferences are learned. With the advent of massive online markets powered by data-driven matching platforms, it has become necessary to better understand the interplay between learning and market objectives. We propose a statistical learning model in which one side of the market does not have a priori knowledge about its preferences for the other side and is required to learn these from stochastic rewards. Our model extends the standard multi-armed bandits framework to multiple players, with the added feature that arms have preferences over players. We study both centralized and decentralized approaches to this problem and show surprising exploration-exploitation trade-offs compared to the single player multi-armed bandits setting.

agent, big data, upstream oil & gas, (21 more...)

arXiv.org Machine Learning

1906.05363

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.64)

Industry:

Education (0.46)
Energy > Oil & Gas > Upstream (0.34)
Health & Medicine (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.96)
Information Technology > Data Science > Data Mining > Big Data (0.87)

Add feedback

Model Similarity Mitigates Test Set Overuse

Mania, Horia, Miller, John, Schmidt, Ludwig, Hardt, Moritz, Recht, Benjamin

arXiv.org Machine LearningMay-29-2019

artificial intelligence, neural network, similarity, (19 more...)

arXiv.org Machine Learning

1905.1258

Country: North America > United States > California (0.14)

Genre: Research Report (0.85)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Certainty Equivalent Control of LQR is Efficient

Mania, Horia, Tu, Stephen, Recht, Benjamin

arXiv.org Machine LearningFeb-20-2019

One of the most straightforward methods for controlling a dynamical system with unknown transitions isbased on the certainty equivalence principle: a model of the system is fit by observing its time evolution, and a control policy is then designed by treating the fitted model as the truth [8]. Despite the simplicity of this method, it is challenging to guarantee its efficiency because small modeling errors may propagate to large, undesirable behaviors on long time horizons. As a result, most work on controlling systems with unknown dynamics has explicitly incorporated robustness against model uncertainty [11, 12, 20, 25, 35, 36]. In this work, we show that for the standard baseline of controlling an unknown linear dynamical system with a quadratic objective function, known as the Linear Quadratic Regulator (LQR), certainty equivalent control synthesis achieves better cost than prior methods that account for model uncertainty. In the case of offline control, where one collects some data and then designs a fixed control policy to be run on an infinite time horizon, we show that the gap between the performance of the certainty equivalent controller and the optimal control policy scales quadratically with the error in the model parameters.

artificial intelligence, controller, optimization problem, (15 more...)

arXiv.org Machine Learning

1902.07826

Country: North America > United States > California (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator

Dean, Sarah, Mania, Horia, Matni, Nikolai, Recht, Benjamin, Tu, Stephen

Neural Information Processing SystemsDec-31-2018

We consider adaptive control of the Linear Quadratic Regulator (LQR), where an unknown linear system is controlled subject to quadratic costs. Leveraging recent developments in the estimation of linear systems and in robust controller synthesis, we present the first provably polynomial time algorithm that provides high probability guarantees of sub-linear regret on this problem. We further study the interplay between regret minimization and parameter estimation by proving a lower bound on the expected regret in terms of the exploration schedule used by any algorithm. Finally, we conduct a numerical study comparing our robust adaptive algorithm to other methods from the adaptive LQR literature, and demonstrate the flexibility of our proposed method by extending it to a demand forecasting problem subject to state constraints.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.35)

Add feedback