Skip to content

Latest commit

 

History

History
16 lines (13 loc) · 605 Bytes

chaganty2018.md

File metadata and controls

16 lines (13 loc) · 605 Bytes

Chaganty 2018

This dataset contains quality judgments for several different summarization systems on the CNN/DailyMail dataset. The data was published in The price of debiasing automatic metrics in natural language evaluation.

sacrerouge setup-dataset chaganty2018 \
    <output-dir>

The output files are the following:

  • documents.jsonl: The CNN/DailyMail documents
  • summaries.jsonl: The system summaries
  • metrics.jsonl: The corresponding manual evaluation metrics for the system summaries

Notes

006588 appears twice for ml+rl.