LEGOEval: An Open-Source Toolkit for Dialogue System Evaluation via Crowdsourcing
Li, Yu, Arnold, Josh, Yan, Feifan, Shi, Weiyan, Yu, Zhou
–arXiv.org Artificial Intelligence
We present LEGOEval, an open-source toolkit that enables researchers to easily evaluate dialogue systems in a few lines of code using the online crowdsource platform, Amazon Mechanical Turk. Compared to existing toolkits, LEGOEval features a flexible task design by providing a Python API that maps to commonly used React.js interface components. Researchers can personalize their evaluation procedures easily with our built-in pages as if playing with LEGO blocks. Thus, LEGOEval provides a fast, consistent method for reproducing human evaluation results. Besides the flexible task design, LEGOEval also offers an easy API to review collected data.
arXiv.org Artificial Intelligence
May-5-2021
- Country:
- Oceania > Australia
- North America
- Canada (0.04)
- United States
- Pennsylvania (0.04)
- Texas > Travis County
- Austin (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Yolo County
- Davis (0.04)
- Europe > Czechia
- Prague (0.04)
- Genre:
- Research Report (0.50)
- Technology: