Benchmarks, Test Beds, Controlled Experimentation, and the Design of Agent Architectures
Hanks, Steve, Pollack, Martha E., Cohen, Paul R.
The methodological underpinnings of AI are slowly changing. Benchmarks, test beds, and controlled experimentation are becoming more common. Although we are optimistic that this change can solidify the science of AI, we also recognize a set of difficult issues concerning the appropriate use of this methodology. We discuss these issues as they relate to research on agent design. We survey existing test beds for agents and argue for appropriate caution in their use. We end with a debate on the proper role of experimental methodology in the design and validation of planning agents.
- Country:
- Oceania > Australia (0.04)
- North America > United States
- New York (0.04)
- Texas (0.04)
- Pennsylvania (0.04)
- Michigan (0.04)
- District of Columbia > Washington (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California
- San Mateo County > Menlo Park (0.04)
- Santa Clara County
- Palo Alto (0.04)
- Mountain View (0.04)
- Los Altos (0.04)
- Asia
- Middle East > Israel (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Instructional Material (0.67)
- Industry:
- Technology: