Skip to content

Latest commit

 

History

History
73 lines (54 loc) · 2.14 KB

stats.fr.md

File metadata and controls

73 lines (54 loc) · 2.14 KB
lastUpdate lang total number of reigns total number of episodes ratio of fully processed episodes total number of defects in episodes total number of extracted keywords total number of published web pages estimated end date
2022-01-07
fr
50
1007
40 %
14648
904
1962
2025-07-08

Stats about the French version of the Shahnahme

Fully processed episode:

  • Any episode who has been reviewed manually and guaranteed to be an exact copy of the original document.

Defect :

  • A defect is any part of the text that is not valid in terms of spelling or grammatical construct. This is due to OCR (Optical Character Recognition) being not as perfect as a human eye.

Keyword :

  • A keyword is the name of a character, the name of a place, the name of an entity or any word of interest like for example résurrection, quatorze, immortel.

2021-09-26

  • ratio of fully processed episodes: 38 %
  • total number of defects in episodes: 14842
  • total number of extracted keywords: 818
  • total number of published web pages: 1945
  • estimated end date: 2025-02-15

2021-09-08

  • ratio of fully processed episodes: 38 %
  • total number of defects in episodes: 15579
  • total number of extracted keywords: 818
  • total number of published web pages: 1945
  • estimated end date: 2025-01-12

2021-08-19

  • ratio of fully processed episodes: 37 %
  • total number of defects in episodes: 16440
  • total number of extracted keywords: 795
  • total number of published web pages: 1854
  • estimated end date: 2025-01-12

2021-08-13

  • ratio of fully processed episodes: 36 %
  • total number of defects in episodes: 16916
  • total number of extracted keywords: 789
  • total number of published web pages: 1848
  • estimated end date: 2025-02-02

2021-08-12

  • ratio of fully processed episodes: 36 %
  • total number of defects in episodes: 16931
  • total number of extracted keywords: 786
  • total number of published web pages: 1845
  • estimated end date: 2025-01-26

2021-08-09

  • ratio of fully processed episodes: 36 %
  • total number of defects in episodes: 17163
  • total number of extracted keywords: 786
  • total number of published web pages: 1845
  • estimated end date: 2025-01-26