Benchmarks, Test Beds, Controlled Experimentation, and the Design of Agent Architectures

Hanks, Steve, Pollack, Martha E., Cohen, Paul R.

Dec-15-1993–AI Magazine

Benchmarks, test beds, and controlled experimentation are becoming more common. We discuss these issues as they relate to research on agent design. We survey existing test beds for agents and argue for appropriate caution in their use. We end with a debate on the proper role of experimental methodology in the design and validation of planning agents.

artificial intelligence, experimentation, performance indicator, (6 more...)

AI Magazine

Dec-15-1993

Journals Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.40)