AITopics | Littman, M. L.

Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions

Csirik, J. A., Littman, M. L., McAllester, D., Schapire, R. E., Stone, P.

arXiv.org Artificial IntelligenceJun-26-2011

Auctions are becoming an increasingly popular method for transacting business, especially over the Internet. This article presents a general approach to building autonomous bidding agents to bid in multiple simultaneous auctions for interacting goods. A core component of our approach learns a model of the empirical price dynamics based on past data and uses the model to analytically calculate, to the greatest extent possible, optimal bids. We introduce a new and general boosting-based algorithm for conditional density estimation problems of this kind, i.e., supervised learning problems in which the goal is to estimate the entire conditional distribution of the real-valued label. This approach is fully implemented as ATTac-2001, a top-scoring agent in the second Trading Agent Competition (TAC-01). We present experiments demonstrating the effectiveness of our boosting-based price predictor relative to several reasonable alternatives.

banking & finance, inductive learning, tion, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1613/jair.1200

1106.527

Country: North America > United States (0.14)

Genre: Research Report (1.00)

Industry:

Banking & Finance > Trading (0.53)
Consumer Products & Services (0.46)
Leisure & Entertainment > Games (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.48)

Add feedback

ATTac-2000: An Adaptive Autonomous Bidding Agent

Kearns, M., Littman, M. L., Singh, S., Stone, P.

arXiv.org Artificial IntelligenceJun-3-2011

The First Trading Agent Competition (TAC) was held from June 22nd to July 8th, 2000. TAC was designed to create a benchmark problem in the complex domain of e-marketplaces and to motivate researchers to apply unique approaches to a common task. This article describes ATTac-2000, the first-place finisher in TAC. ATTac-2000 uses a principled bidding strategy that includes several elements of adaptivity. In addition to the success at the competition, isolated empirical results are presented indicating the robustness and effectiveness of ATTac-2000's adaptive strategy.

attac-2000, banking & finance, us government, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1613/jair.865

1106.0678

Country: North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > Experimental Study (0.46)

Industry:

Banking & Finance > Trading (1.00)
Consumer Products & Services (0.94)
Information Technology (0.93)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

The First Probabilistic Track of the International Planning Competition

Younes, H.L.S., Littman, M. L., Weissman, D., Asmuth, J.

Journal of Artificial Intelligence ResearchDec-16-2005

The 2004 International Planning Competition, IPC-4, included a probabilistic planning track for the first time. We describe the new domain specification language we created for the track, our evaluation methodology, the competition domains we developed, and the results of the participating teams.

Add feedback

Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions

Stone, P., Schapire, R. E., Littman, M. L., Csirik, J. A., McAllester, D.

Journal of Artificial Intelligence ResearchSep-1-2003

Auctions are becoming an increasingly popular method for transacting business, especially over the Internet. This article presents a general approach to building autonomous bidding agents to bid in multiple simultaneous auctions for interacting goods. A core component of our approach learns a model of the empirical price dynamics based on past data and uses the model to analytically calculate, to the greatest extent possible, optimal bids. We introduce a new and general boosting-based algorithm for conditional density estimation problems of this kind, i.e., supervised learning problems in which the goal is to estimate the entire conditional distribution of the real-valued label. This approach is fully implemented as ATTac-2001, a top-scoring agent in the second Trading Agent Competition (TAC-01). We present experiments demonstrating the effectiveness of our boosting-based price predictor relative to several reasonable alternatives.

banking & finance, decision-theoretic bidding, machine learning, (3 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1200

AI Access Foundation

10339

Journal of Artificial Intelligence Research

Industry: Banking & Finance > Trading (0.53)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.53)
Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback

The Computational Complexity of Probabilistic Planning

Littman, M. L., Goldsmith, J., Mundhenk, M.

Journal of Artificial Intelligence ResearchAug-1-1998

We examine the computational complexity of testing and finding small plans in probabilistic planning domains with both flat and propositional representations. The complexity of plan evaluation and existence varies with the plan type sought; we examine totally ordered plans, acyclic plans, and looping plans, and partially ordered plans under three natural definitions of plan value. We show that problems of interest are complete for a variety of complexity classes: PL, P, NP, co-NP, PP, NP^PP, co-NP^PP, and PSPACE. In the process of proving that certain planning problems are complete for NP^PP, we introduce a new basic NP^PP-complete problem, E-MAJSAT, which generalizes the standard Boolean satisfiability problem to computations involving probabilistic quantities; our results suggest that the development of good heuristics for E-MAJSAT could be important for the creation of efficient algorithms for a wide variety of problems.

artificial intelligence, computational complexity, planning & scheduling, (4 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.505

AI Access Foundation

10208

Journal of Artificial Intelligence Research

Genre: Research Report > New Finding (0.53)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.53)

Add feedback

Reinforcement Learning: A Survey

Kaelbling, L. P., Littman, M. L., Moore, A. W.

Journal of Artificial Intelligence ResearchMay-1-1996

This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word ``reinforcement.'' The paper discusses central issues of reinforcement learning, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state. It concludes with a survey of some implemented systems and an assessment of the practical utility of current methods for reinforcement learning.

artificial intelligence, machine learning, reinforcement learning, (1 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.301

AI Access Foundation

10166

Journal of Artificial Intelligence Research

Genre: Overview (0.87)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback