Building Data Utilities Workflow
Given a list of addresses, this workflow will automatically go through the following steps:
- Normalize each address - address consists of (at least) street, city, state. Could contain postal_code and country.
- Geocode each address via Amazon Location Services to a lat/long coordinate
- Download the Microsoft Building Footprints for all areas encompassed by the geocoded coordinates
- Find the footprint that intersects (or is closest to) each geocoded coordinate
- Generate the UBID for each footprint
- Export the resulting data as csv and GeoJSON
-
Optionally create a Virtualenv Environment
-
Dependencies are managed through Poetry, install with
pip install poetry -
Install dependencies with
poetry install -
Create a
.envfile in the root with your Amazon Location Services API key in the following format. You will also need to specify the Amazon base url. If none is specified, the following will be used: https://places.geo.us-east-2.api.aws/v2. For NREL gateway, you will also need to specify an APP ID.AMAZON_API_KEY=XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX AMAZON_BASE_URL=https://places.geo.us-east-2.api.aws/v2 AMAZON_APP_ID=XXXXXXXXX
Note that if the env keys for AMAZON_API_KEY, AMAZON_BASE_URL, and AMAZON_APP_ID exist in your profile, then it use that over the .env file.
For NREL users using the rate-limited key, use the following as the AMAZON_BASE_URL: https://developer.nrel.gov/api/tada/amazon-location-service/places/v2
Due to the nature of this application, we are passing IntendedUse=Storage to the Amazon Location Services API. This results in a slightly higher rate per transaction, but allows us to store the results.
-
Create a
locations.jsonfile in the root containing a list of addresses to process in the format:[ { "street": "100 W 14th Ave Pkwy", "city": "Denver", "state": "CO" }, { "street": "200 E Colfax Ave", "city": "Denver", "state": "CO" }, { "street": "320 W Colfax Ave", "city": "Denver", "state": "CO" } ]
- Run the workflow with
python main.pyorpoetry run python main.py - The results will be saved to
./data/covered-buildings.csvand./data/covered-buildings.geojson. Example of these files are in thetests/datadirectory as well.
- This workflow is optimized to be self-updating, and only downloads quadkeys and quadkey dataset-links if they haven't previously been downloaded or if an update is available
- Possible next steps:
- Cache geocoding results (if allowed) to avoid API limit penalties when re-running
- Allow other geocoders like Google, without persisting the geocoding results
- Add distance from geocoded result to footprint boundary,
proximity_to_geocoding_coord(intersections would be 0)
When using this tool with the Amazon Location Services geocoding API (or any other geocoder) always confirm that the terms of service allow for using and storing geocoding results. For Amazon Location Services, passing the IntendedUse=Storage parameter to each query allows the user to store the geocoding results.