This project should help with some of the oozie pains you might run into. When you have close to a thousand coordinators running and you have a problem on the cluster the last thing you want to do is manually triggering a retry.
But perhaps this will more serve as an example on how to use the oozie rest service for whatever needs you have.
For now this isn't incredibly modular/generic and might drastically change.
- Look at each running coordinator and it's actions and decide to rerun them based on some criteria.
- Look at constantly failing coordinators and after some time decide to suspend them.
Look at suspended coordinators and if they have been suspended for x days kill them.
Java 1.7+ Maven 3.2+
mvn package spring-boot:repackage
You now have a jar in the target dir which you can directly use:
java -jar target/OozieZombie-<version>.jar
nohup java -Xmx64m -jar OozieZombie-<version>.jar &
If you want to make some config changes on the machine you run this on create a file called application.properties and add properties there.