COLING 2022 Highlights

Oct-25-2022, 06:20:45 GMT–#artificialintelligence

Recent metrics for natural language generation rely on pre-trained language models, for instance BERTScore, BLEURT, and COMET. These metrics achieve a high correlation with human evaluations on standard benchmarks. However, it is unclear how these metrics perform for styles and domains that aren't well represented in their training data. In other words, are these metrics robust? The authors found that BERTScore isn't robust to character-level perturbations.

bertscore, coling 2022, evaluation, (5 more...)

#artificialintelligence

Oct-25-2022, 06:20:45 GMT

News Web Page

Add feedback

Country:
- Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.08)

Genre:
- Personal (0.44)
- Research Report > New Finding (0.42)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Generation (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found