From 69ffc7ba4572e4ad51a361a290f60cd0b97ca9d7 Mon Sep 17 00:00:00 2001 From: Helena Date: Sun, 5 Apr 2020 14:03:59 +0200 Subject: [PATCH] Initial public commit --- README.md | 2 + about.html | 153 ++++++++++++++++++++ citation.html | 26 ++++ credits.html | 50 +++++++ discover.php | 209 +++++++++++++++++++++++++++ index.php | 387 ++++++++++++++++++++++++++++++++++++++++++++++++++ result.php | 171 ++++++++++++++++++++++ rhyme.php | 123 ++++++++++++++++ showpoem.php | 81 +++++++++++ 9 files changed, 1202 insertions(+) create mode 100644 README.md create mode 100644 about.html create mode 100644 citation.html create mode 100644 credits.html create mode 100644 discover.php create mode 100644 index.php create mode 100644 result.php create mode 100644 rhyme.php create mode 100644 showpoem.php diff --git a/README.md b/README.md new file mode 100644 index 0000000..94a771b --- /dev/null +++ b/README.md @@ -0,0 +1,2 @@ +# DISCOver +DISCOver: an interface to explore the DISCO corpus diff --git a/about.html b/about.html new file mode 100644 index 0000000..6ad0be1 --- /dev/null +++ b/about.html @@ -0,0 +1,153 @@ + + + + + About (the DISCOurse) + + + + DISCO + + + + + + + + + + + + + + + + +
+

About this corpus

+

Corpus description

+

Our corpus currently offers a total of 4087 sonnets in Spanish: 2676 from the 19th + century, 330 from the 18th century and 1088 from the so-called Spanish Golden Age (15th + to 17th centuries). There are a total of 1204 authors (both from Spain and Latin + America). It intends to provide a wide sample, inspired by distant reading approaches (Moretti, 2005). The raw texts were in most cases extracted from + Biblioteca Virtual Miguel de Cervantes (1999), with some + 18th-century texts coming from Wikisource. A table in section Data + Distribution below summarizes these data.

+

The corpus is available in plain-text and in TEI formats; XML-TEI P5 was used given this + standard’s benefits in terms of reuse, storage, and retrieval. Author metadata were + extracted or inferred from unstructured content in the sources (year, place of birth and + death, and gender), and placed in the TEIheader, or in a metadata table in the case of + the plain-text version. For both TEI and plain-text formats, two versions of the texts + are available: one collecting every sonnet per author, the other encoding a single + sonnet per file. For corpus preparation, we closely followed the TEI guidelines and + RIDE’s criteria for Digital Text Collections (Henny-Krahmer and Neuber, + 2017).

+

Additionally, authors have been assigned VIAF identifiers and described using RDFa + attributes. This gives the corpus an entry-point to the Linked Open Data cloud, + enhancing its findability. The corpus is available as a GitHub repository and saved in + Zenodo, in response to good practices for data use, reuse, and conservation.

Data distribution

+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
Table 1: Corpus data distribution per period, author gender and primary continent of + literary activity
PeriodNbr of SonnetsNbr of AuthorsTokens
19th2676685Female48252,518
Male637
America334
Europe348 (+3)
18th32342Female129,006
Male41
America6
Europe36
15th-17th
(Golden Age)
1088477Female3199,779
Male446
America12
Europe458 (+7)
+

Bibliography

+

Biblioteca Virtual Miguel de Cervantes (1999): Biblioteca Virtual + Miguel de Cervantes + http://www.cervantesvirtual.com

+

Henny-Krahmer, Ulrike, and Frederike Neuber. 2017. “Criteria for Reviewing Digital Text Collections, Version 1.0.” A Review Journal for Digital Editions and Resources, no. 6. https://www.i-d-e.de/publikationen/weitereschriften/criteria-text-collections-version-1-0>.

+

Moretti, Franco. 2005. Graphs, Maps, Trees: Abstract Models for a Literary History. Verso

+
+
+

Cálamo currante

+

Si escribir te propones un soneto,
+ ve haciendo lo que yo, que, a fe, no es harto;
+ tras el verso tercero saldrá el cuarto...
+ ¡Si es coser y cantar! ¡Mira: un cuarteto!

+ +

Haz otro igual después, que te prometo
+ que si aquesto es parir, es fácil parto;
+ van seis versos, y el séptimo ya ensarto;
+ otro, y van ocho, y al primer terceto.

+ +

Todo es que el verso nono venga al baile
+ y el décimo en la rueda esté metido.
+ ¿Hay consonante a baile y fraile? Haíle.

+ +

Pues entonces, ya es esto pan comido,
+ y cata a Periquillo hecho fraile,
+ y cata el sonetejo concluido.

+

Francisco de Osuna

+
+
+ + + + \ No newline at end of file diff --git a/citation.html b/citation.html new file mode 100644 index 0000000..b281e29 --- /dev/null +++ b/citation.html @@ -0,0 +1,26 @@ + + + + + About DISCO + + + + + + + +

Credits

+

This interface visualizes and analyses the data available at:

+

Ruiz Fabo, Pablo, Helena Bermúdez Sabel, Clara Martínez Cantón, and José + Calvo Tello. 2017. Diachronic Spanish Sonnet Corpus (DISCO). Madrid: UNED. https://github.com/pruizf/disco. +

+

That dataset was enhanced, and rhyme annotation was added using the tool RhymeTagger, + developed by Petr Plecháč (Ústav pro českou literaturu AV ČR, v. v. i.).

+ + diff --git a/credits.html b/credits.html new file mode 100644 index 0000000..6a75766 --- /dev/null +++ b/credits.html @@ -0,0 +1,50 @@ + + + + + Credits + + + + DISCO + + + + + + + + + + + + + + + + +
+

Credits

+

How to cite

+

Bermúdez Sabel, Clara Martínez Cantón, Pablo Ruiz Fabo. 2019. DISCOver: an interface to explore the DISCO corpus. http://prf1.org/disco/ +

Dataset

+

This interface visualizes and analyses the data available at:

+
Ruiz Fabo, Pablo, Helena Bermúdez Sabel, Clara Martínez Cantón, and José + Calvo Tello. 2017. Diachronic Spanish Sonnet Corpus (DISCO). Madrid: UNED. https://github.com/pruizf/disco. +
+

This dataset was enhanced, and rhyme annotation was added using the tool RhymeTagger, + developed by Petr Plecháč (Ústav pro českou literaturu AV ČR, v. v. i.).

+

The rhyme database (including the query and visualizations resources) Gunstick, the rhyme database and related tools developed + by the Versologie research group.

+

The interface was developed thanks to a Josef Dobrovský Fellowship, funded by the Akademie věd České republiky (year 2018).

+
+ + + + + diff --git a/discover.php b/discover.php new file mode 100644 index 0000000..fbe5c87 --- /dev/null +++ b/discover.php @@ -0,0 +1,209 @@ + + + + + + + + DISCOver trends + + + + + + + + + + + + + + + + +
+ + + + + +

DISCOver trends

+ +
+
+ + + +
+ + + + \ No newline at end of file diff --git a/index.php b/index.php new file mode 100644 index 0000000..f41290b --- /dev/null +++ b/index.php @@ -0,0 +1,387 @@ + + + + + + corpus DISCO + + + + + + + + + + + + + + + + + + +
+ + + + +
+  Gender: +
+ + +
+ Origin: +
+ + +
+ Century: +
+ + + +

+ + + + + + +

+ + +
+ +
+ + + + + + + + + + + +
+ + +
+
+
+
+ +
+ + + + + +
+ +
+
+ + + + + + + + + + diff --git a/result.php b/result.php new file mode 100644 index 0000000..5714f41 --- /dev/null +++ b/result.php @@ -0,0 +1,171 @@ + + + + + + + + +Rhyme DISCOrdance + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + + + + +
+ + +
+ + +

+ + +
+ + # R.TOKEN: | + # R.TYPE: + +
+ + +
+ + +
+ + +

This resource replicates Gunstick, the rhyme database and related tools developed + by the Versologie research group.

+ +
+ + + +
+ + + +
+ +
+ + + +
+
+ + +
+
+
+ + +
+ + + + + + + + + +
RhymeLine 1 (call)Line 2 (echo)AuthorWorkCentury
+
+ +
+ +
+
+
+ + + + + + + +
+ + + + + + + + + + diff --git a/rhyme.php b/rhyme.php new file mode 100644 index 0000000..29546d7 --- /dev/null +++ b/rhyme.php @@ -0,0 +1,123 @@ + + + + + + + + +DISCOver rhyme + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
+ + + + +
+ + +
+ + +

Rhyme database

+

This resource replicates Gunstick, the rhyme database and related tools developed + by the Versologie research group.

+
+ + +
+ + + +
+ +
+
+ + + + + + + + + + + + + + +
Rhyme word: +

Century: From: + To: +
Author:
+ + + + + +
+
+ +
+ + + +
+ + + + + + + + + + diff --git a/showpoem.php b/showpoem.php new file mode 100644 index 0000000..4f09ee2 --- /dev/null +++ b/showpoem.php @@ -0,0 +1,81 @@ + + + + + + DISCO - text edition + + + + + + + + + + + + + + + + + + + +
+ +
+

Prosodic elements

+ + + +
+ + $val){ + + //echo the contents + } + + + + + ?> + +
+ + + + +