AITopics

doi: 10.1613/jair.591

AI Access Foundation

10230

Country:

North America > United States (0.14)
Oceania > Nauru > Aiwo Constituency > Aiwo District (0.04)
Asia > Middle East > UAE (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.89)

Practically Coordinating

Durfee, Edmund H.

To coordinate, intelligent agents might need to know something about themselves, about each other, about how others view themselves and others, about how others think others view themselves and others, and so on. Taken to an extreme, the amount of knowledge an agent might possess to coordinate its interactions with others might outstrip the agent's limited reasoning capacity (its available time, memory, and so on). Much of the work in studying and building multiagent systems has thus been devoted to developing practical techniques for achieving coordination, typically by limiting the knowledge available to, or necessary for, agents. This article categorizes techniques for keeping agents suitably ignorant so that they can practically coordinate and gives a selective survey of examples of these techniques for illustration.

agent, coordination, knowledge, (16 more...)

Country:

North America > United States > California > San Mateo County > Menlo Park (0.05)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.68)

Gomez-Perez, Asuncion, Benjamins, V. Richard

Applications of Ontologies and Problem-Solving Methods

The Workshop on Applications of Ontologies and Problem-Solving Methods (PSMs), held in conjunction with the Thirteenth Biennial European Conference on Artificial Intelligence (ECAI-98), was held on 24 to 25 August 1998. Twenty-six people participated, and 16 papers were presented. Participants included scientists and practitioners from both the ontology and PSM communities. The first day was devoted to paper presentations and discussions. The second (half) day, a joint session was held with two other workshops: (1) Building, Maintaining, and Using Organizational Memories and (2) Intelligent Information Integration. The reason for the joint session was that in all three workshops, ontologies play a prominent role, and the goal was to bring together researchers working on related issues in different communities. The workshop ended with a discussion about the added value of a combined ontologies-PSM workshop compared to separate workshops.

application, ontology, university, (16 more...)

Country:

Europe > Spain > Galicia > Madrid (0.05)
South America > Paraguay > Asunción > Asunción (0.05)
Europe > Netherlands > North Holland > Amsterdam (0.05)
(15 more...)

Genre: Instructional Material (0.47)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Kargupta, Hillol, Chan, Philip

The Distributed Data-Mining Worksho

Victor Lesser (University of Massachusetts at Amherst) gave an invited talk on distributed interpretation and its of Hong Kong Polytechnic University, possible implication in DDM. Mining, brought interested researchers (Brigham Young University) and Salvatore The paper sessions ended with two and practitioners together and created Stolfo (Columbia University) working paper presentations by Billy an environment for crystallizing the presented the effects of class distribution Wallace and Juan Botia, Marcedes Garijo, fast-growing field of DDM. The concluding session was the panel Lawrence Hall, Nitesh Chawla, and 40 participants attended the workshop. Stolfo, George Cybenko Kevin W. Bowyer (all of University of The workshop had 13 presentations, Stolfo stressed suggested different techniques for Cybenko of Dartmouth University. Organizers sincerely hope that the session.

artificial intelligence, data mining, university, (15 more...)

Country:

North America > United States > Massachusetts (0.25)
Asia > China > Hong Kong (0.25)
North America > United States > Illinois (0.16)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.35)

Kott, Alexander, Saks, Victor, Mercer, Albert

A New Technique Enables Dynamic Replanning and Rescheduling of Aeromedical Evacuation

We describe an application of a dynamic replanning technique in a highly dynamic and complex domain: the military aeromedical evacuation of patients to medical treatment facilities. U.S. Transportation Command (USTRANSCOM) is the U.S. Department of Defense (DoD) agency responsible for evacuating patients during wartime and peace. Doctrinally, patients requiring extended treatment must be evacuated by air to a suitable medical treatment facility. The Persian Gulf War was the first significant armed conflict in which this concept was put to a serious test. The results were far from satisfactory -- about 60 percent of the patients ended up at the wrong destinations. In early 1993, the DoD tasked USTRANSCOM to consolidate the command and control of medical regulation and aeromedical evacuation operations. The ensuing analysis led to TRAC2ES (TRANSCOM regulating and command and control evacuation system), a decision support system for planning and scheduling medical evacuation operations. Probably the most challenging aspect of the problem has to do with the dynamics of a domain in which requirements and constraints continuously change over time. Continuous dynamic replanning is a key capability of TRAC2ES. This article describes the application and the AI approach we took in providing this capability.

artificial intelligence, planning & scheduling, trac 2, (14 more...)

Country:

North America > United States > Louisiana (0.46)
Asia > Middle East > Saudi Arabia > Arabian Gulf (0.24)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Friedman, N., Halpern, J. Y.

Modeling Belief in Dynamic Systems, Part II: Revision and Update

Journal of Artificial Intelligence ResearchMar-1-1999

The study of belief change has been an active area in philosophy and AI. In recent years two special cases of belief change, belief revision and belief update, have been studied in detail. In a companion paper (Friedman & Halpern, 1997), we introduce a new framework to model belief change. This framework combines temporal and epistemic modalities with a notion of plausibility, allowing us to examine the change of beliefs over time. In this paper, we show how belief revision and belief update can be captured in our framework. This allows us to compare the assumptions made by each method, and to better understand the principles underlying them. In particular, it shows that Katsuno and Mendelzon's notion of belief update (Katsuno & Mendelzon, 1991a) depends on several strong assumptions that may limit its applicability in artificial intelligence. Finally, our analysis allow us to identify a notion of minimal change that underlies a broad range of belief change operations including revision and update.

bel null, nullnull, nullnullnull, (16 more...)

doi: 10.1613/jair.506

AI Access Foundation

10224

Country: Asia > Middle East > Israel (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (1.00)

The Asymptotic Convergence-Rate of Q-learning

Szepesvári, Csaba

Q-Iearning is a popular reinforcement learning (RL) algorithm whose convergence is well demonstrated in the literature (Jaakkola et al., 1994; Tsitsiklis, 1994; Littman and Szepesvari, 1996; Szepesvari and Littman, 1996). Our aim in this paper is to provide an upper bound for the convergence rate of (lookup-table based) Q-Iearning algorithms. Although, this upper bound is not strict, computer experiments (to be presented elsewhere) and the form of the lemma underlying the proof indicate that the obtained upper bound can be made strict by a slightly more complicated definition for R. Our results extend to learning on aggregated states (see (Singh et al., 1995» and other related algorithms which admit a certain form of asynchronous stochastic approximation (see (Szepesv iri and Littman, 1996». Present address: Associative Computing, Inc., Budapest, Konkoly Thege M. u. 29-33, HUNGARY-1121 The Asymptotic Convergence-Rate of Q-leaming

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.30)
Europe > Hungary > Budapest > Budapest (0.24)
Europe > Hungary > Csongrád-Csanád County > Szeged (0.05)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

The Asymptotic Convergence-Rate of Q-learning

Szepesvári, Csaba

Q-Iearning is a popular reinforcement learning (RL) algorithm whose convergence is well demonstrated in the literature (Jaakkola et al., 1994; Tsitsiklis, 1994; Littman and Szepesvari, 1996; Szepesvari and Littman, 1996). Our aim in this paper is to provide an upper bound for the convergence rate of (lookup-table based) Q-Iearning algorithms. Although, this upper bound is not strict, computer experiments (to be presented elsewhere) and the form of the lemma underlying the proof indicate that the obtained upper bound can be made strict by a slightly more complicated definition for R. Our results extend to learning on aggregated states (see (Singh et al., 1995» and other related algorithms which admit a certain form of asynchronous stochastic approximation (see (Szepesv iri and Littman, 1996». Present address: Associative Computing, Inc., Budapest, Konkoly Thege M. u. 29-33, HUNGARY-1121 The Asymptotic Convergence-Rate of Q-leaming

algorithm, asymptotic convergence-rate, convergence rate, (13 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.30)
Europe > Hungary > Budapest > Budapest (0.24)
Europe > Hungary > Csongrád-Csanád County > Szeged (0.05)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Hofmann, Reimar, Tresp, Volker

Nonlinear Markov Networks for Continuous Variables

We address the problem oflearning structure in nonlinear Markov networks with continuous variables. This can be viewed as non-Gaussian multidimensional density estimation exploiting certain conditional independencies in the variables. Markov networks are a graphical way of describing conditional independencies well suited to model relationships which do not exhibit a natural causal ordering. We use neural network structures to model the quantitative relationships between variables. The main focus in this paper will be on learning the structure for the purpose of gaining insight into the underlying process. Using two data sets we show that interesting structures can be found using our approach. Inference will be briefly addressed.

boston housing data, markov boundary, markov network, (12 more...)

Country:

Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)
Asia > Japan (0.04)

Industry: Banking & Finance (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Monaco, Jeffrey F., Ward, David G., Barto, Andrew G.

Automated Aircraft Recovery via Reinforcement Learning: Initial Experiments

An emerging use of reinforcement learning (RL) is to approximate optimal policies for large-scale control problems through extensive simulated control experience. Described here are initial experiments directed toward the development of an automated recovery system (ARS) for high-agility aircraft. An ARS is an outer-loop flight control system designed to bring the aircraft from a range of initial states to straight, level, and non-inverted flight in minimum time while satisfying constraints such as maintaining altitude and accelerations within acceptable limits. Here we describe the problem and present initial results involving only single-axis (pitch) recoveries. Through extensive simulated control experience using a medium-fidelity simulation of an F-16, the RL system approximated an optimal policy for longitudinal-stick inputs to produce near-minimum-time transitions to straight and level flight in unconstrained cases, as well as while meeting a pilot-station acceleration constraint. 2 AIRCRAFT MODEL

aircraft, initial condition, rl system, (13 more...)