Skip to content

Commit 151d6a1

Browse files
authored
Update README.md: enhance combining item/thesis export for comparison with DSpace docs.
1 parent 78dbf1d commit 151d6a1

File tree

1 file changed

+18
-2
lines changed

1 file changed

+18
-2
lines changed

README.md

Lines changed: 18 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -159,7 +159,23 @@ where the key (e.g., `name`) is the output CSV column label, the columns determi
159159
The steps to set up a validation run.
160160

161161
1. Use `./jupiter_output_scripts/juptiter_collection_metadata_to_CSV.rb` to export (CSV) Jupiter metadata
162+
162163
2. Use `./combine_jupiter_item_and_thesis.py` to combine the Jupiter Item and Thesis CSV into a singel CSV to ease comparisioin with Scholaris as Scholaris uses a single type to store both using optional field to store thesis related metadata.
164+
165+
``` bash
166+
# Combine Item and Thesis metadata
167+
./venv/bin/python src/combine_jupiter_item_and_thesis.py \
168+
--input_item ~/Downloads/era_export/jupiter_item_2025-03-31_16-26-48.csv \
169+
--input_thesis ~/Downloads/era_export/jupiter_thesis_2025-03-31_16-26-28.csv \
170+
--output /tmp/jupiter_combined_item_thesis.csv
171+
172+
# Combine item and Thesis ActiveStorage
173+
./venv/bin/python src/combine_jupiter_item_and_thesis.py \
174+
--input_item ~/Downloads/era_export/jupiter_item_activestorage_2025-03-31_16-26-48.csv \
175+
--input_thesis ~/Downloads/era_export/jupiter_thesis_activestroage_2025-03-31_16-26-28.csv \
176+
--output /tmp/jupiter_combined_activestorage.csv
177+
```
178+
163179
3. Use `./dspace_api_exports.py` to export (CSV) DSpace metadata
164180

165181
```bash
@@ -214,14 +230,14 @@ The steps to set up a validation run.
214230

215231
# Item audit results
216232
venv/bin/python src/compare_csv.py \
217-
--input_jupiter ~/Downloads/era_export/jupiter_items_2025-03-06_12-08-01.csv \
233+
--input_jupiter ~/Downloads/era_export/jupiter_combined_item_thesis.csv \
218234
--input_dspace ~/Downloads/scholaris_items.csv \
219235
--output /tmp/migration_audit_items_$(date +%Y-%m-%d_%H:%M:%S).csv \
220236
--type items
221237

222238
# Bitstream audit results
223239
venv/bin/python src/compare_csv.py \
224-
--input_jupiter ~/Downloads/era_export/jupiter_items_2025-03-06_12-08-01.csv \
240+
--input_jupiter ~/Downloads/era_export/jupiter_combined_activestorage.csv \
225241
--input_dspace ~/Downloads/scholaris_bitstreams.csv \
226242
--output /tmp/migration_audit_bitstreams_$(date +%Y-%m-%d_%H:%M:%S).csv \
227243
--type bitstreams

0 commit comments

Comments
 (0)