LEGOEval: An Open-Source Toolkit for Dialogue System Evaluation via Crowdsourcing

Li, Yu, Arnold, Josh, Yan, Feifan, Shi, Weiyan, Yu, Zhou

May-5-2021–arXiv.org Artificial Intelligence

We present LEGOEval, an open-source toolkit that enables researchers to easily evaluate dialogue systems in a few lines of code using the online crowdsource platform, Amazon Mechanical Turk. Compared to existing toolkits, LEGOEval features a flexible task design by providing a Python API that maps to commonly used React.js interface components. Researchers can personalize their evaluation procedures easily with our built-in pages as if playing with LEGO blocks. Thus, LEGOEval provides a fast, consistent method for reproducing human evaluation results. Besides the flexible task design, LEGOEval also offers an easy API to review collected data.

evaluation, evaluation task, legoeval, (14 more...)

arXiv.org Artificial Intelligence

May-5-2021

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - Canada (0.04)
  - United States
    - Pennsylvania (0.04)
    - Texas > Travis County
      - Austin (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California > Yolo County
      - Davis (0.04)
- Europe > Czechia
  - Prague (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology
  - Artificial Intelligence (1.00)
  - Communications > Social Media
    - Crowdsourcing (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found