Skip to content

Commit cdf517f

Browse files
Bill Metangmotdunning
Bill Metangmo
authored andcommitted
Added liminal as data pipeline orchestrator for Airflow
1 parent 1fda528 commit cdf517f

File tree

4 files changed

+73
-0
lines changed

4 files changed

+73
-0
lines changed

pipeline/install.md

+21
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
2+
## Data Pipelines
3+
4+
### Prerequesites
5+
6+
* [liminal](https://github.com/apache/incubator-liminal)
7+
* a local kubernetes cluster
8+
9+
### Getting started
10+
11+
``` bash
12+
kubectl apply -f storage.yml
13+
14+
liminal build
15+
liminal deploy --clean
16+
# in case of issue see https://github.com/helm/charts/issues/23589
17+
liminal start
18+
```
19+
20+
Then go to http://localhost:8080 and launch the job.
21+

pipeline/liminal.yml

+38
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
---
2+
name: GettingStartedPipeline
3+
volumes:
4+
- volume: gettingstartedvol
5+
claim_name: gettingstartedvol-pvc
6+
local:
7+
path: .
8+
images:
9+
- image: mmrs-python
10+
type: python
11+
source: .
12+
pipelines:
13+
- pipeline: getting_started_pipeline
14+
owner: Bosco Albert Baracus
15+
start_date: 1970-01-01
16+
timeout_minutes: 10
17+
schedule: 0 * 1 * *
18+
default_arg_loaded: check
19+
default_array_loaded: [2, 3, 4]
20+
default_object_loaded:
21+
key1: val1
22+
key2: val2
23+
metrics:
24+
namespace: TestNamespace
25+
backends: [ ]
26+
tasks:
27+
- task: mmrs_download
28+
type: python
29+
description: download mmrs data
30+
image: mmrs-python
31+
env_vars:
32+
env1: "a"
33+
env2: "b"
34+
mounts:
35+
- mount: mymount
36+
volume: gettingstartedvol
37+
path: /mnt/gettingstartedvol
38+
cmd: python -u test_mrms_download.py

pipeline/requirements.txt

+2
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
pyarrow
2+
pandas

pipeline/storage.yml

+12
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,12 @@
1+
apiVersion: v1
2+
kind: PersistentVolumeClaim
3+
metadata:
4+
name: gettingstartedvol-pvc
5+
spec:
6+
accessModes:
7+
- ReadWriteOnce
8+
volumeMode: Filesystem
9+
resources:
10+
requests:
11+
storage: 5Gi
12+
storageClassName: hostpath

0 commit comments

Comments
 (0)