Guardian Connector Scripts Hub

This repository contains scripts, flows and apps to help communities guard and manage their land.

The code is intended to run on Windmill, a platform that can turn scripts into workflows and UIs. It empowers semi-technical users to access, edit and schedule code to run on a given interval without being overwhelmed by the usual barriers to entry (git, IDE, local environments, secrets managements, etc).

🌱 Read a blog post about Windmill is being used in this repository for supporting Indigenous communities

Available scripts, flows, and apps

Some of the tools available in the Guardian Connector Scripts Hub are:

Connector scripts to ingest data from data collection or annotation tools such as KoboToolbox, ODK, CoMapeo, ArcGIS, Global Forest Watch, Timelapse, and Locus Map, and store this data (tabular and media attachments) in a data lake.
A flow to download and store GeoJSON and GeoTIFF change detection alerts, post these to a CoMapeo Archive Server API, and send a message to WhatsApp recipients via Twilio.
Scripts to export data from a database into a specific format (e.g., GeoJSON).
An app to import and transform datasets from a variety of file formats and sources into a PostgreSQL database.

A Windmill Workspace populated with some of the tools in this repository.

User Roles

In Windmill, there are a number of roles that can be assigned to users, each with different permissions and responsibilities.

Windmill has an official documentation page that covers the core concepts of roles and permissions in Windmill.

For a Windmill workspace, you will want at least one admin user to manage the workspace and configurations. However, you can also set up operator users with a more limited set of permissions to execute scripts and monitor their progress.

By default, Windmill operators are able to run scripts, set schedules, and create resources and variables. We recommend turning off the ability to create resources and variables for operators, and only allowing them to execute scripts and monitor their progress.

To do this, when logged in as an admin you can navigate to the Workspace Settings page, and under the Users tab, you can modify the Operator Settings. Disable all settings except Runs and Schedules:

Lastly, for operator users to have access to all of the Folders containing the workspace scripts, flows, and apps (e.g. the subdirectories in the f/ directory), you will need to add a group to which the operator user belongs to all of the folders containing these like export and connectors. For example, you could add the default g/all group to all of the folders containing these, as all users are members of this group by default. You can do this in the Folders & Groups page in the Windmill web app:

Deploying the code to Windmill workspaces

Install the Windmill CLI, and set it up to talk to your deployed Workspace.

Then push the code in this Git repo to your workspace:

wmill sync push --skip-variables

Folders named ./tests/ are excluded by wmill.yml from syncing to Windmill — because otherwise Windmill tries to make the tests (as with ALL python files) into bona-fide Windmill scripts.

This repo also provides a shell script to batch push changes to a number of workspaces at once. To use this, set the WORKSPACES environment variable with a list of workspace names. You can do this directly in the command line:

   WORKSPACES=gc-windmill,gc-testing-server bin/push.sh

Alternatively, use a subshell to load a WORKSPACES variable from a .env file without affecting your current shell environment:

   (set -a; source .env; set +a; bin/push.sh)

Adding custom resource types

At times, this repository may contain one or more custom resource types that are used by a number of the scripts in this repository. However, we commit to adding all of our resource types to Windmill Hub.

There is no way to sync custom resource types to a Windmill workspace, so you will need to manually add them. The easiest way to do this is to paste them in to the JSON editor when creating a new resource type.

Development

Developing scripts

In Windmill, scripts can be written in Python, TypeScript, Go, and a number of other languages. Flows and apps can be built through the Windmill UI.

For information on developing scripts, see the Windmill Scripts quickstart.

You may develop within Windmill's code editor, or locally. Developing locally has the advantage of being able to run tests.

For users of Visual Studio Code, there is an extension that allows you to code locally, while testing your code in your Windmill workspace. See Windmill's local development documentation for more information.

Organization of the code

The f/ directory is designated for storing code in a workspace folder, and will be used when synchronizing the contents of this repository with a server.

Within the f/ directory, we store code in directories that represent a specific set of tasks. For example, the f/connectors/ directory contains scripts for data ETL and pipelining tasks.

Note that Windmill also designates a u/ directory for storing code per user on a workspace. We are not using this convention in this repository. See Windmill's local development guide for more information on these directories and how they are synchronized with a server.

Coding Standards

To maintain consistency across the repository, please follow these conventions when developing scripts:

Language and Dependencies

While Windmill supports multiple languages (Python, TypeScript, Go, etc.), for ETL-related workflows, use Python to maintain consistency across the repository.
Use Python 3.11 when setting dependencies in «name».script.lock and requirements-test.txt files, as it is the latest stable version we use for our Windmill instances and in CircleCI.

Script Metadata

Add a clear summary for each script in the «name».script.yaml file.
Add descriptions for all parameters in the «name».script.yaml file. These descriptions surface in the Windmill UI and help users understand how to configure scripts.

Testing Requirements

Always include end-to-end tests that call the script's main function.
For scripts that interact with external servers (e.g. APIs), define a mock server in tests/conftest.py with sample responses in tests/assets/server_responses.py.
Place all other test assets (sample files, fixtures, etc.) in tests/assets/.

Code Organization

Place reusable code in an appropriate module within common_logic/ rather than duplicating code across scripts.
Always place the main function first in each script file, above helper functions.
Use consistent import ordering and structure.

Documentation

Always add docstrings to functions and modules:
- Use NumPy format for Python docstrings
- Use JSDoc for JavaScript/TypeScript
Add information about your script to the folder's README.md, including:
- What the script does and when to use it
- Where to find required parameters (access tokens, hash IDs, API endpoints, etc.); screenshots can be helpful and are welcome
- Any special setup or configuration requirements
- Examples of usage where appropriate

Filing Issues and Pull Requests

For issues about specific scripts, flows, or apps, prefix your issue title with [«name»] in order to enhance searchability. You are also encouraged to add a tag for the specific folder where the code is located, like connectors or common_logic.

When in Doubt

Refer to existing scripts in the repository as examples of these conventions in practice.
Ask the maintainers of this repository for guidance.

Syncing Changes

If you developed on the server, sync your remote changes into Git version control once done:

wmill sync pull --skip-variables  # optionally add --raw to clobber your local repo
# TODO: git add, commit, etc

Tests

Running tests with the `tox` test runner

Make sure you have Python installed locally or create a virtual environment for your test runner.

Install the following dependencies:

pip install 'tox>=4.15.0,<5' 'tox-docker~=5.0'

Then you can run all tests using tox:

tox

If you are using homebrew you might have better luck using pipx or uvx to manage the virtual environment for the test runner:

# Install test runner and run test in one go
uvx --with tox-docker tox

The tox-docker test runner dependency is only required to run the alerts test suite; others can run with only tox.

Individual Test Suites

Each script in gc-scripts-hub defines its own virtual env according to the «scriptname».script.lock file that Windmill creates. This means that to test, you'll need to have Windmill create it, either by running the script being tested at least once in Windmill itself, or using the CLI:

wmill script generate-metadata

For more about how Windmill chooses the package dependencies to go in these metadata/lock files, read https://www.windmill.dev/docs/advanced/imports#imports-in-python

It also means that tox creates different environments for each "folder" of scripts. To run tests for only one folder, specify the folder as an -e «environment» CLI arg:

tox -e alerts

Note that the versions of package dependencies must be the same for scripts across a tox environment, or you will get an error about conflicting dependencies.

Running Windmill

You may get a free cloud workspace, or self-host.

We recommend self-hosting using CapRover and its Windmill one-click-app.

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
.circleci		.circleci
.github		.github
bin		bin
docs		docs
f		f
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTING_EXAMPLES.md		CONTRIBUTING_EXAMPLES.md
CONTRIBUTING_LLM_GUIDELINES.md		CONTRIBUTING_LLM_GUIDELINES.md
LICENSE		LICENSE
README.md		README.md
tox.ini		tox.ini
wmill.yaml		wmill.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Guardian Connector Scripts Hub

Available scripts, flows, and apps

User Roles

Deploying the code to Windmill workspaces

Adding custom resource types

Development

Developing scripts

Organization of the code

Coding Standards

Language and Dependencies

Script Metadata

Testing Requirements

Code Organization

Documentation

Filing Issues and Pull Requests

When in Doubt

Syncing Changes

Tests

Running tests with the `tox` test runner

Individual Test Suites

Running Windmill

About

Uh oh!

Uh oh!

Contributors 4

Languages

License

ConservationMetrics/gc-scripts-hub

Folders and files

Latest commit

History

Repository files navigation

Guardian Connector Scripts Hub

Available scripts, flows, and apps

User Roles

Deploying the code to Windmill workspaces

Adding custom resource types

Development

Developing scripts

Organization of the code

Coding Standards

Language and Dependencies

Script Metadata

Testing Requirements

Code Organization

Documentation

Filing Issues and Pull Requests

When in Doubt

Syncing Changes

Tests

Running tests with the tox test runner

Individual Test Suites

Running Windmill

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 4

Languages

Running tests with the `tox` test runner