HANNA Benchmark

HANNA is a large annotated dataset of Human-ANnotated NArratives for ASG evaluation. It was introduced in the paper “Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation” accepted in COLING 2022.

The repository is accessible here.