Table 2: self-playvsinteractiveeval

Feb-15-2026, 08:07:16 GMT–Neural Information Processing Systems

Reward Quality Fluency Diversity Contingen. Reward exploitation in RL16 is a known problem and an active area of17 research (Amodei et al., 2016). Additionally,wehave21 run further experiments and provide strong empirical evidence that our proposed metrics are not easily exploitable.22 Primary (evaluation) and secondary (EI) contributions [R2, R3]: The main contribution of this work is an30 evaluation methodology that captures higher level human conversation concepts.

evaluation, interactive evaluation, table 2, (1 more...)

Neural Information Processing Systems

Feb-15-2026, 08:07:16 GMT

Conferences PDF

Add feedback

Duplicate Docs Excel Report

Title
fc9812127bf09c7bd29ad6723c683fb5-AuthorFeedback.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found