Generate the data for the PSEO Expansion Map
Disclaimer: The data generated here should not be used for analysis. It's cobbling together a few things to tell a story about the growth of the PSEO partnership.
The web app is nice but sometimes you just need an image for a presentation. gifcap.dev works well for recording the web map to a gif file.
This project uses conda.
# install deps
conda env create --file=environment.yaml
# activate
conda activate pseo-partners-map
# create the output file
./generate.py -r -o viz_data.csv
# create the output file and the intermediate sqlite database
# add the -f/--force flag to replace an existing intermediate database
./generate.py -r -d ./pseo.db -o viz_data.csv
This process uses all public PSEO earnings vintages (link) to identify when an institution joined the partnership. The grad totals per institution are from the IPEDS counts attached to the earnings files.
The initial release was not used as it's in a different format and doesn't contain gradcounts
The pseo data uses the opeid while the school locations use a unit id. Some crosswalks are available here which are stored in the data/
directory. No effort has gone into syncing data vintages or doing cleanup on 3rd party sources.
61 institutions couldn't be crosswalked from OPEID <-> UNITID
The institution names and locations are taken from NCES and saved as a csv to the data/
directory in this repo. These locations use a UNITID
which requires the crosswalk above.
2 institutions were given an unitid that has no entry in the location data