You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 5, 2019. It is now read-only.
Visualization to display "archival coverage," starting with epa.gov. This takes a list of urls and associated archiving information, and turns that into a tree of url paths with associated coverage information.
8
+
**Coverage** is a project for visualizing the status of digital data archiving efforts across various data repositories run by different initiatives. Its current scope covers data within the epa.gov top-level domain.
9
9
10
-
The output is cached in `cache.json`, because this is a large file, we provide incremental pieces of the cached tree as a web server. To dynamically calculate coverage completion to can work with the `cache.json` file.
10
+
This code repo provides the JSON back-end: [`https://api.archivers.co/coverage`](https://api.archivers.co/coverage)
11
11
12
-
## Current Coverage Sources
12
+
The [`datatogether/webapp` repo](https://github.com/datatogether/webapp) provides the visual front-end: [`https://archivers.co/coverage`](https://archivers.co/coverage)
13
13
14
-
Actual source datasets can be found in the `/repositories` directory. It currently includes the following:
15
14
16
-
* Archivers 2
17
-
* archivers.space
18
-
* EDGI Nomination Tool Uncrawlables
19
-
* The Internet Archive
20
-
* Project Svalbard json-ld crawl
15
+
## Current Data Repositories
16
+
17
+
Actual source datasets can be found in each [`/repositories/*` directory](/repositories). It currently includes the following:
Requests for new data repositories are tracked under the [`data-repository`](https://github.com/datatogether/coverage/labels/data-repository) issue label.
26
+
27
+
28
+
## How It Works
29
+
30
+
It takes a list of urls and associated archiving information, and turns that into a tree of url paths with associated coverage information.
31
+
32
+
The output is cached in [`cache.json`](cache.json). Because this is a large file, we provide incremental pieces of the cached tree as a web server. To dynamically calculate coverage completion to can work with the `cache.json` file.
33
+
21
34
22
35
## License & Copyright
23
36
@@ -32,18 +45,21 @@ PARTICULAR PURPOSE.
32
45
33
46
See the [`LICENSE`](./LICENSE) file for details.
34
47
48
+
35
49
## Getting Involved
36
50
37
51
We would love involvement from more people! If you notice any errors or would like to submit changes, please see our [Contributing Guidelines](./github/CONTRIBUTING.md).
38
52
39
53
We use GitHub issues for [tracking bugs and feature requests](./issues) and Pull Requests (PRs) for [submitting changes](./pulls)
40
54
55
+
41
56
## Installation
42
57
43
58
The easiest way to get going is to use [docker-compose](https://docs.docker.com/compose/install/). Once you have that:
0 commit comments