AITopics | Roy, Benjamin V.

Collaborating Authors

Roy, Benjamin V.

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems

Ibrahimi, Morteza, Javanmard, Adel, Roy, Benjamin V.

Neural Information Processing SystemsDec-31-2012

We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control schemes. More recently, an asymptotic regret bound of $\tilde{O}(\sqrt{T})$ was shown for $T \gg p$ where $p$ is the dimension of the state space. In this work we consider the case where the matrices describing the dynamic of the LQ system are sparse and their dimensions are large. We present an adaptive control scheme that for $p \gg 1$ and $T \gg \polylog(p)$ achieves a regret bound of $\tilde{O}(p \sqrt{T})$. In particular, our algorithm has an average cost of $(1+\eps)$ times the optimum cost after $T = \polylog(p) O(1/\eps^2)$. This is in comparison to previous work on the dense dynamics where the algorithm needs $\Omega(p)$ samples before it can estimate the unknown dynamic with any significant accuracy. We believe our result has prominent applications in the emerging area of computational advertising, in particular targeted online advertising and advertising in social networks.

artificial intelligence, controller, information technology services, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County (0.15)

Industry:

Information Technology > Services (0.54)
Marketing (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.50)

Add feedback

Directed Regression

Kao, Yi-hao, Roy, Benjamin V., Yan, Xiang

Neural Information Processing SystemsDec-31-2009

When used to guide decisions, linear regression analysis typically involves estimation of regression coefficients via ordinary least squares and their subsequent use to make decisions. When there are multiple response variables and features do not perfectly capture their relationships, it is beneficial to account for the decision objective when computing regression coefficients. Empirical optimization does so but sacrifices performance when features are well-chosen or training data are insufficient. We propose directed regression, an efficient algorithm that combines merits of ordinary least squares and empirical optimization. We demonstrate through a computational study that directed regression can generate significant performance gains over either alternative. We also develop a theory that motivates the algorithm.

artificial intelligence, coefficient, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County (0.15)

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Consensus Propagation

Roy, Benjamin V., Moallemi, Ciamac C.

Neural Information Processing SystemsDec-31-2006

We propose consensus propagation, an asynchronous distributed protocol for averaging numbers across a network. We establish convergence, characterize the convergence rate for regular graphs, and demonstrate that the protocol exhibits better scaling properties than pairwise averaging, an alternative that has received much recent attention. Consensus propagation can be viewed as a special case of belief propagation, and our results contribute to the belief propagation literature. In particular, beyond singly-connected graphs, there are very few classes of relevant problems for which belief propagation is known to converge.

artificial intelligence, machine learning, propagation, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County (0.14)
North America > United States > Massachusetts > Middlesex County (0.14)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

TD(0) Leads to Better Policies than Approximate Value Iteration

Roy, Benjamin V.

Neural Information Processing SystemsDec-31-2006

We consider approximate value iteration with a parameterized approximator inwhich the state space is partitioned and the optimal cost-to-go function over each partition is approximated by a constant. We establish performanceloss bounds for policies derived from approximations associated with fixed points. These bounds identify benefits to having projection weights equal to the invariant distribution of the resulting policy. Suchprojection weighting leads to the same fixed points as TD(0). Our analysis also leads to the first performance loss bound for approximate valueiteration with an average cost objective.

artificial intelligence, partition, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County (0.14)
North America > United States > California > Santa Clara County (0.14)
North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)

Add feedback

Consensus Propagation

Roy, Benjamin V., Moallemi, Ciamac C.

Neural Information Processing SystemsDec-31-2006

We propose consensus propagation, an asynchronous distributed protocol for averaging numbers across a network.

artificial intelligence, machine learning, propagation, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County (0.14)
North America > United States > Massachusetts > Middlesex County (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.98)
Information Technology > Communications > Networks (0.68)

Add feedback

Solitaire: Man Versus Machine

Yan, Xiang, Diaconis, Persi, Rusmevichientong, Paat, Roy, Benjamin V.

Neural Information Processing SystemsDec-31-2005

In this paper, we use the rollout method for policy improvement to analyze aversion of Klondike solitaire.

artificial intelligence, build stack, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry: Leisure & Entertainment > Games (0.95)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

A Cost-Shaping LP for Bellman Error Minimization with Performance Guarantees

Farias, Daniela D., Roy, Benjamin V.

Neural Information Processing SystemsDec-31-2005

We introduce a new algorithm based on linear programming that approximates the differential value function of an average-cost Markov decision process via a linear combination of pre-selected basis functions. The algorithm carries out a form of cost shaping and minimizes a version of Bellman error. We establish an error bound that scales gracefully with the number of states without imposing the (strong) Lyapunov condition required by its counterpart in[6]. We propose a path-following method that automates selection of important algorithm parameters which represent counterparts tothe "state-relevance weights" studied in [6].

algorithm, artificial intelligence, optimization problem, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

Distributed Optimization in Adaptive Networks

Moallemi, Ciamac C., Roy, Benjamin V.

Neural Information Processing SystemsDec-31-2004

We develop a protocol for optimizing dynamic behavior of a network of simple electronic components, such as a sensor network, an ad hoc network of mobile devices, or a network of communication switches. This protocol requires only local communication and simple computations which are distributed among devices. The protocol is scalable to large networks. As a motivating example, we discuss a problem involving optimization of power consumption, delay, and buffer overflow in a sensor network. Our approach builds on policy gradient methods for optimization of Markov decision processes. The protocol can be viewed as an extension of policy gradient methods to a context involving a team of agents optimizing aggregate performance through asynchronous distributed communication and computation. We establish that the dynamics of the protocol approximate the solution to an ordinary differential equation that follows the gradient of the performance objective.

artificial intelligence, machine learning, sensor, (14 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County (0.14)

Industry: Semiconductors & Electronics (0.34)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Add feedback

Distributed Optimization in Adaptive Networks

Moallemi, Ciamac C., Roy, Benjamin V.

Neural Information Processing SystemsDec-31-2004

This protocol requires only local communication and simple computations which are distributed among devices.

artificial intelligence, machine learning, sensor, (14 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County (0.14)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Approximate Linear Programming for Average-Cost Dynamic Programming

Roy, Benjamin V., Farias, Daniela D.

Neural Information Processing SystemsDec-31-2003

This paper extends our earlier analysis on approximate linear programming as an approach to approximating the cost-to-go function in a discounted-cost dynamic program [6]. In this paper, we consider the average-cost criterion and a version of approximate linear programming that generates approximations to the optimal average cost and differential cost function. We demonstrate that a naive version of approximate linear programming prioritizes approximation of the optimal average cost and that this may not be well-aligned with the objective of deriving a policy with low average cost. For that, the algorithm should aim at producing a good approximation of the differential cost function. We propose a twophase variant of approximate linear programming that allows for external control of the relative accuracy of the approximation of the differential cost function over different portions of the state space via state-relevance weights. Performance bounds suggest that the new algorithm is compatible with the objective of optimizing performance and provide guidance on appropriate choices for state-relevance weights.

artificial intelligence, differential cost function, optimization problem, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County (0.28)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback