a kata to process a parquet dataset on k8s cluster using Volcano with a custom queue
steps:
- create the pvc and copy the dataset to it
- create configmap with script and its reuquirements
- create the dataset queue
- apply the job
kubectl apply -f ./manifests/pvc.yaml
then
bash copy.sh
kubectl create configmap extract-script --from-file=tasks/extract/extract_script.py --from-file=tasks/extract/requirements.txt
kubectl apply -f ./manifest/volcano/queue-dataset.yaml
kubectl apply -f ./manifest/volcano/job-extract-mnist.yaml