Non-repeatable experiments and non-reproducible results: The reproducibility crisis in human evaluation in NLP

A Belz, C Thomson, E Reiter, S Mille Findings of the Association for Computational Linguistics: ACL 2023, 3676-3687, 2023

Abstract

None