Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WOW Scheduling strategy #13

Merged
merged 458 commits into from
Mar 13, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
458 commits
Select commit Hold shift + click to select a range
495f9bc
Updated versions
Lehmann-Fabian May 4, 2022
5a413a3
FilePath getter + store triple
Lehmann-Fabian May 4, 2022
c8c4917
sortTaskInputs
Lehmann-Fabian May 4, 2022
1bfad9b
Use AlignmentWrapper
Lehmann-Fabian May 4, 2022
8126223
Location and not String as key in alignmentmap
Lehmann-Fabian May 4, 2022
d961263
Fixed Bug: Format error in ftp.py
Lehmann-Fabian May 4, 2022
bd321d6
Added cost functions
Lehmann-Fabian May 4, 2022
055e87a
Added Greedy Alignment
Lehmann-Fabian May 4, 2022
dd218ea
Use Greedy
Lehmann-Fabian May 4, 2022
ce66777
CanSchedule method for node
Lehmann-Fabian May 4, 2022
bae3c06
Reuse getAvailable nodes for validSchedulerPlan
Lehmann-Fabian May 4, 2022
396f07e
Prefer current node for files
Lehmann-Fabian May 4, 2022
421a011
Fixed problem current node was not recognized.
Lehmann-Fabian May 4, 2022
c2fb7c4
Refactored SchedulerWithDaemonSet
Lehmann-Fabian May 4, 2022
2350dfa
Trace location aware scheduling
Lehmann-Fabian May 5, 2022
9307b14
Cost function configurable
Lehmann-Fabian May 5, 2022
9f78db5
Fixed potential bug
Lehmann-Fabian May 5, 2022
609b843
use computeIfAbsent
Lehmann-Fabian May 5, 2022
feaac48
Refactoring
Lehmann-Fabian May 5, 2022
8967d92
Logging instead of printStackTrace
Lehmann-Fabian May 5, 2022
f4ca274
Search for best alignment
Lehmann-Fabian May 6, 2022
b9eff4b
Refactored code
Lehmann-Fabian May 6, 2022
865f263
Refactored ftp.py script v1
Lehmann-Fabian May 6, 2022
4e107c4
Refactored ftp.py script v2
Lehmann-Fabian May 6, 2022
be3cfc7
Refactored ftp.py script V3
Lehmann-Fabian May 9, 2022
1eb0324
Download in parallel
Lehmann-Fabian May 10, 2022
cc6fe73
Reordered imports
Lehmann-Fabian May 10, 2022
777721e
Do not always request daemonset
Lehmann-Fabian May 10, 2022
d709427
Log when step was finished
Lehmann-Fabian May 10, 2022
e7dcf18
Trace init runtimes
Lehmann-Fabian May 10, 2022
d33ec8f
Sort downloads by size desc
Lehmann-Fabian May 10, 2022
664192b
Trace throughput
Lehmann-Fabian May 10, 2022
1de8c87
Don't log config
Lehmann-Fabian May 10, 2022
db729e3
exitIfFileWasNotFound
Lehmann-Fabian May 10, 2022
d7a6689
copying to Node handling more efficient
Lehmann-Fabian May 11, 2022
57d5219
Consider currently copied files in the scheduling decision
Lehmann-Fabian May 11, 2022
0c8ea7a
Code style
Lehmann-Fabian May 11, 2022
1fb67e9
Create alignment for all unscheduled tasks in separate method
Lehmann-Fabian May 11, 2022
dbb8741
Consider files fetched by tasks in the same batch
Lehmann-Fabian May 11, 2022
6c2fc76
Nullcheck
Lehmann-Fabian May 20, 2022
d9dc2ac
Log place in queue correctly
Lehmann-Fabian May 20, 2022
b3e0d97
transfer workDir via TaskConfig
Lehmann-Fabian May 20, 2022
1895fd1
unscheduledTasks as Queue, re-add tasks if best node is not available…
Lehmann-Fabian May 24, 2022
ca00eb3
RandomAlignment filter files on Node
Lehmann-Fabian May 24, 2022
5abdc4b
Trace only if tracing is activated
Lehmann-Fabian May 30, 2022
d313238
Remove nodes without enough resources before scheduling
Lehmann-Fabian May 31, 2022
4d1adcd
Enable coverage reports in SonarQube (#5)
srnbckr Jun 1, 2022
4cbea1f
Rewrote writeInitConfig method
Lehmann-Fabian Jun 7, 2022
52fccfd
Merge remote-tracking branch 'origin/master'
Lehmann-Fabian Jun 7, 2022
d6aba71
Receive OutLabel
Lehmann-Fabian Jun 14, 2022
3bb9593
Enable FTP Writes
Lehmann-Fabian Jun 16, 2022
eb448c2
Consider OutLabels for scheduling
Lehmann-Fabian Jun 20, 2022
623867f
Remove unnecessary double casting
Lehmann-Fabian Jun 20, 2022
54a981d
Moved OutLabel classes
Lehmann-Fabian Jun 21, 2022
4ed0adb
Changed package
Lehmann-Fabian Jun 21, 2022
8c3531b
Fix Bug stalemate overwrote node with output label
Lehmann-Fabian Jun 21, 2022
bfedd71
Refactor OutLabel holder. Cache results.
Lehmann-Fabian Jun 21, 2022
f4c172e
Adjusted ftp config to allow uploads from Nextflow
Lehmann-Fabian Jun 21, 2022
4ebf685
Fix do not access empty list in TaskData.calc
Lehmann-Fabian Jun 22, 2022
5e4a029
Code Style: brackets after if statement
Lehmann-Fabian Jun 22, 2022
ef519f3
Use root account to avoid access problems
Lehmann-Fabian Jun 29, 2022
02a8192
Work with new infiles format
Lehmann-Fabian Jul 1, 2022
5d66174
Work with new data format
Lehmann-Fabian Jul 22, 2022
aee9e7a
Trace more
Lehmann-Fabian Jul 22, 2022
9e808db
Sync addToCopyingToNode
Lehmann-Fabian Jul 22, 2022
4d45078
Fix wrong byte number on node
Lehmann-Fabian Jul 22, 2022
cac19e7
Changed log level
Lehmann-Fabian Jul 22, 2022
d27ae8e
Deal with init failures
Lehmann-Fabian Aug 26, 2022
dc7a416
Make ftp script to use python3 default
Lehmann-Fabian Aug 26, 2022
f3c66dd
Create an empty script if nothing to copy
Lehmann-Fabian Aug 26, 2022
0ecc4cb
Don't fail when .command.outfiles does not exist
Lehmann-Fabian Sep 12, 2022
5e047dc
Check if symlink is already correct and do not delete in this case.
Lehmann-Fabian Sep 12, 2022
401e2d1
Do not log "Wait for %d threads to finish" continuously
Lehmann-Fabian Sep 12, 2022
60301cf
formatting
Lehmann-Fabian Sep 12, 2022
5099861
New stalemate function
Lehmann-Fabian Sep 12, 2022
0733170
Method declaration not commited
Lehmann-Fabian Sep 22, 2022
30d03fe
Added Swagger UI
Lehmann-Fabian Sep 22, 2022
4d55def
Method to delete edges and vertices from DAG
Lehmann-Fabian Sep 29, 2022
1489cae
Adjust Scheduler to API Paper
Lehmann-Fabian Sep 29, 2022
042dc0d
Added comment and normalized indent
Lehmann-Fabian Oct 5, 2022
9d84b1e
Return for null label directly
Lehmann-Fabian Oct 5, 2022
7e628b5
Sanitize wrong DNS
Lehmann-Fabian Oct 5, 2022
beaa040
Sanitize wrong DNS 2
Lehmann-Fabian Oct 5, 2022
c78b395
Do not check for still valid nodes twice
Lehmann-Fabian Oct 7, 2022
59cab43
Do not check for still valid nodes twice comment
Lehmann-Fabian Oct 7, 2022
7a97324
Only start X copy tasks per node
Lehmann-Fabian Oct 7, 2022
ab0ac32
Submit maxCopyTasksPerNode to the scheduler
Lehmann-Fabian Oct 7, 2022
1002fce
Calculate Task Rank
Lehmann-Fabian Oct 25, 2022
e95779a
Added different prioritization and assignment strategies
Lehmann-Fabian Nov 1, 2022
d1e7351
changed visibility
Lehmann-Fabian Nov 1, 2022
d6bac29
Use Id to number tasks
Lehmann-Fabian Nov 28, 2022
ecd3698
New Rank strategies
Lehmann-Fabian Nov 28, 2022
e424850
Sychronize tryToScheduleBatch
Lehmann-Fabian Nov 28, 2022
da7c12d
Inefficient map iteration
Lehmann-Fabian Nov 28, 2022
a56a778
Rank by size callable about REST
Lehmann-Fabian Dec 16, 2022
dedc0cd
added REST documentation
Lehmann-Fabian Dec 16, 2022
22b2d3c
Readded imports
Lehmann-Fabian Dec 16, 2022
40c0509
Update Fabric8 version
Lehmann-Fabian Dec 19, 2022
b6d560c
Removed unused initContainer
Lehmann-Fabian Dec 19, 2022
e825df8
Fix problem: modified pod is two times added
Lehmann-Fabian Dec 19, 2022
16c19e8
Fix problem: OutOfBound if pod has no init container
Lehmann-Fabian Dec 19, 2022
77ed172
First version of separate copy tasks (LAv2)
Lehmann-Fabian Dec 23, 2022
8ae21a8
First version of separate copy tasks (LAv2)
Lehmann-Fabian Dec 23, 2022
96903c8
First version of separate copy tasks (LAv2)
Lehmann-Fabian Dec 23, 2022
cb0344c
Added optimalAlignment
Lehmann-Fabian Jan 3, 2023
1cc1d84
Removed unused imports
Lehmann-Fabian Jan 3, 2023
596a9d3
Code improvements
Lehmann-Fabian Jan 3, 2023
0a50067
Refactoring: extracted copy strategy, package for la2 extensions
Lehmann-Fabian Jan 3, 2023
dc37414
added a space
Lehmann-Fabian Jan 3, 2023
8d7bb03
synchronized toString
Lehmann-Fabian Jan 3, 2023
c005be2
Store the copy tasks and consider them
Lehmann-Fabian Jan 3, 2023
c7f9b0e
Fix problem that Node's hash changed
Lehmann-Fabian Jan 10, 2023
6d56b2d
Path into variable
Lehmann-Fabian Jan 10, 2023
9a53c03
added comment
Lehmann-Fabian Jan 10, 2023
323d5ee
Check affinities and that the node has generally enough resources
Lehmann-Fabian Jan 10, 2023
b20131d
Make Requirements clonable
Lehmann-Fabian Jan 10, 2023
57c2c3b
SortedList + test
Lehmann-Fabian Jan 10, 2023
a768aeb
added GetMinSizeInBytes to get more realistic prediction
Lehmann-Fabian Jan 10, 2023
93b3ea6
Calculate the missing data for TaskInputs
Lehmann-Fabian Jan 10, 2023
f1896da
added second scheduling phase
Lehmann-Fabian Jan 10, 2023
66dd040
removed unused imports
Lehmann-Fabian Jan 10, 2023
6b226d6
Use Lombok for constructors
Lehmann-Fabian Jan 10, 2023
3bd2c1f
Removed empty lines
Lehmann-Fabian Jan 10, 2023
86b3f58
Removed unused method
Lehmann-Fabian Jan 10, 2023
0d1f678
Weight the individual and the overall cost
Lehmann-Fabian Jan 10, 2023
ba90baa
Weight the individual and the overall cost moved into GreedyAlignment
Lehmann-Fabian Jan 12, 2023
9f339bc
Use interface instead of concrete class for class instance
Lehmann-Fabian Jan 12, 2023
f94a3b9
log available resources only once
Lehmann-Fabian Jan 13, 2023
e0cbbc8
Compare whether the task is already copied somewhere.
Lehmann-Fabian Jan 13, 2023
c36608d
use currentlyCopyingTasksOnNode as method parameter
Lehmann-Fabian Jan 13, 2023
ef6b0bf
Ignore tasks in phase 3 that are currently copied to a node where the…
Lehmann-Fabian Jan 13, 2023
90d015c
Ignore tasks that have already all data on a node.
Lehmann-Fabian Jan 13, 2023
aef0704
Do not filter nodes that cannot yet start a new copy task
Lehmann-Fabian Jan 13, 2023
bc5ab54
extract methods in SimpleCapacityAvailableToNode into super class to …
Lehmann-Fabian Jan 13, 2023
9a6c63d
Make comparator of TaskStats configurable
Lehmann-Fabian Jan 13, 2023
bc4cd87
EntrySet instead of KeySet
Lehmann-Fabian Jan 17, 2023
b08f268
Do not throw an exception if the model is invalid
Lehmann-Fabian Jan 17, 2023
7b890a1
Store config json file in pvc
Lehmann-Fabian Jan 17, 2023
5bda348
Log error to console
Lehmann-Fabian Jan 17, 2023
ffd6278
Simplified code
Lehmann-Fabian Jan 17, 2023
a782c0c
Avoid to large numbers
Lehmann-Fabian Jan 17, 2023
c8ee4ce
More informative logging
Lehmann-Fabian Jan 17, 2023
8631c41
New comparators
Lehmann-Fabian Jan 17, 2023
523cae9
First version of phase 3
Lehmann-Fabian Jan 17, 2023
89b0937
Make maxHeldReady configurable
Lehmann-Fabian Jan 17, 2023
f8ad33e
Removed unused calculateBestNode method
Lehmann-Fabian Jan 17, 2023
2b2566e
Do not filter tasks that are already ready to run on different nodes …
Lehmann-Fabian Jan 19, 2023
43985ea
Added some prio to the ftp's download speed
Lehmann-Fabian Jan 19, 2023
ccfa802
Only process ready nodes
Lehmann-Fabian Mar 22, 2023
be52a87
Do not crash because of symlink creation
Lehmann-Fabian Mar 22, 2023
662a392
More precise Origin.addInbound Exception
Lehmann-Fabian Mar 22, 2023
4187339
Handle DeadlineExceeded
Lehmann-Fabian Mar 22, 2023
7ce83ed
Change value
Lehmann-Fabian Mar 22, 2023
5d563ea
update ftp.py
Lehmann-Fabian Mar 22, 2023
f64244f
Renamed folder
Lehmann-Fabian Mar 22, 2023
6731887
Merge remote-tracking branch 'upstream/master' into la2
Lehmann-Fabian Mar 22, 2023
0a94d1b
Merge remote-tracking branch 'upstream/master' into la2
Lehmann-Fabian Mar 22, 2023
654bdef
Remove old imports
Lehmann-Fabian Apr 4, 2023
837905a
IDs and createtime final
Lehmann-Fabian Apr 4, 2023
c436db1
Fix check weightForIndividualNode
Lehmann-Fabian Apr 4, 2023
a2dff1e
Fix warnings in InputAlignmentClass
Lehmann-Fabian Apr 4, 2023
ca18565
Remove old imports
Lehmann-Fabian Apr 4, 2023
7814b14
Fix warnings in SchedulerWithDaemonSet
Lehmann-Fabian Apr 4, 2023
d39e8b6
Fix warnings in DAG
Lehmann-Fabian Apr 4, 2023
5a24a1f
Fix warnings in Vertex
Lehmann-Fabian Apr 4, 2023
451cece
Fix warnings in Process
Lehmann-Fabian Apr 4, 2023
cdeb800
Fix warnings in Origin
Lehmann-Fabian Apr 4, 2023
a80eb12
Fix spelling
Lehmann-Fabian Apr 4, 2023
d37e42c
Fix warnings in TaskInputs
Lehmann-Fabian Apr 4, 2023
8462f7e
Fix warnings in RealHierachyFile
Lehmann-Fabian Apr 4, 2023
17ec05c
Fix warnings in LocationWrapper
Lehmann-Fabian Apr 4, 2023
2dab2b6
Fix warnings in TaskState
Lehmann-Fabian Apr 4, 2023
307df8f
Fix spelling
Lehmann-Fabian Apr 4, 2023
d11d0d5
Merge remote-tracking branch 'upstream/master' into la2
Lehmann-Fabian Apr 5, 2023
43fe5d8
Merge remote-tracking branch 'upstream/master' into la2
Lehmann-Fabian Apr 5, 2023
7b62251
add imports
Lehmann-Fabian Apr 5, 2023
c9e23bc
Merge remote-tracking branch 'upstream/master' into la2
Lehmann-Fabian Apr 6, 2023
d239724
New CopyInAdvanced Strategy
Lehmann-Fabian May 15, 2023
a6d250e
Do not copy the same task everywhere
Lehmann-Fabian May 15, 2023
416d7f2
Make Zero, task and node requirements immutable
Lehmann-Fabian Jul 17, 2023
f23a0b9
Added a testclass for Requirements
Lehmann-Fabian Jul 17, 2023
df4d3f9
Spelling
Lehmann-Fabian Jul 19, 2023
bf7c05c
Improved sorted list using binarySearch
Lehmann-Fabian Jul 19, 2023
36f2534
Added java docs
Lehmann-Fabian Jul 20, 2023
b1b202a
Changed CopyInAdvance Interface
Lehmann-Fabian Jul 20, 2023
fb1e739
Added schedulerTimeDeltaPhaseThree to the TraceRecord
Lehmann-Fabian Jul 20, 2023
5142b69
Added readyTasksPerNode in LAV2
Lehmann-Fabian Jul 20, 2023
a7cf6f3
First version of a more intelligent strategy for phase 3
Lehmann-Fabian Jul 20, 2023
0fc1089
Reduce calculations in loop
Lehmann-Fabian Jul 20, 2023
e22a8f7
Refactored calculateScore to use it also in other places
Lehmann-Fabian Jul 20, 2023
074b7e0
Force NoArgsConstructor
Lehmann-Fabian Jul 21, 2023
4a494cd
Task.getInputSize() also for LA scheduling
Lehmann-Fabian Jul 21, 2023
07ce88a
Use Lombok for equalsAndHash
Lehmann-Fabian Jul 21, 2023
e9dab4c
Store how often the task was copied
Lehmann-Fabian Jul 25, 2023
af48784
Log time of post scheduling
Lehmann-Fabian Jul 25, 2023
bd61407
Log time of post scheduling
Lehmann-Fabian Jul 25, 2023
629442d
Filter tasks not in LA2
Lehmann-Fabian Jul 25, 2023
97cd90c
Minimize jar to save storage
Lehmann-Fabian Jul 25, 2023
aa1d21c
Added new Constructor and added atLeastOneBigger
Lehmann-Fabian Jul 25, 2023
90c7d47
Switch back to normal CopyInAdvance
Lehmann-Fabian Jul 25, 2023
63f862a
Change CopyInAdvanceMoreIntelligent
Lehmann-Fabian Jul 25, 2023
1a1064e
prefer two tasks which sum up to the same score otherwise
Lehmann-Fabian Jul 25, 2023
247f562
Filter tasks that can not start on any node
Lehmann-Fabian Oct 6, 2023
badbfb3
Change Watcher to Handler
Lehmann-Fabian Oct 6, 2023
8e9fb91
Change Watcher to Handler in Scheduler
Lehmann-Fabian Oct 6, 2023
2d033b2
Merge remote-tracking branch 'upstream/master' into la2
Lehmann-Fabian Oct 6, 2023
c16cb22
Remove unused imports
Lehmann-Fabian Oct 7, 2023
d778be8
Remove unused imports
Lehmann-Fabian Oct 7, 2023
c2165cd
Merge remote-tracking branch 'upstream/master' into la2
Lehmann-Fabian Oct 11, 2023
731d401
OptimalReadyToRunToNode optimizer sops after 10 seconds
Lehmann-Fabian Oct 11, 2023
5cfb40d
Update versions
Lehmann-Fabian Oct 28, 2023
d84c9b9
Added comment
Lehmann-Fabian Oct 28, 2023
f4e9240
Merge remote-tracking branch 'upstream/master' into la2
Lehmann-Fabian Oct 28, 2023
fd30987
Filter null in post scheduling
Lehmann-Fabian Jan 20, 2024
3787888
Removed unused code and rewrote outLabel
Lehmann-Fabian Jun 21, 2024
6035063
Prepare Task grouping
Lehmann-Fabian Jun 21, 2024
2f1c09f
Move affinityMatch method from Scheduler to Node
Lehmann-Fabian Jun 25, 2024
cf7a01a
change signature of calculateScore to also consider the node
Lehmann-Fabian Jul 1, 2024
dc61f33
Make outLabels a set
Lehmann-Fabian Jul 2, 2024
da73c70
Refactor LAV2 Scheduler to set phaseTwo and phaseThreeComperator by i…
Lehmann-Fabian Jul 3, 2024
5c305eb
Make new LAGroups scheduler usable
Lehmann-Fabian Jul 3, 2024
2b5621e
First version of OutLabel aware scheduling
Lehmann-Fabian Jul 8, 2024
be8e0d7
Fix indent
Lehmann-Fabian Jul 8, 2024
034bc01
Refactor to hide excludedNodes variable
Lehmann-Fabian Jul 8, 2024
1b67349
Create Copy Tasks for tasks that ran on a node that is not responsibl…
Lehmann-Fabian Jul 18, 2024
544ff59
Add documentation for Scheduler.java
Lehmann-Fabian Nov 6, 2024
e96c23c
Add a class comment to TaskOnNodeWrapper.java
Lehmann-Fabian Nov 6, 2024
4518b6e
Comment OutputFiles.java
Lehmann-Fabian Nov 6, 2024
140eff3
Comment DataMissing.java
Lehmann-Fabian Nov 6, 2024
f76693d
Comment FakeTaskInputs.java
Lehmann-Fabian Nov 6, 2024
ae91bb5
Optimize Imports
Lehmann-Fabian Nov 6, 2024
de5fda5
Refactor and comment GroupCluster and LabelCount
Lehmann-Fabian Nov 6, 2024
f1a506a
Fix problem that the node of the pod with the workflow engine was not…
Lehmann-Fabian Feb 14, 2025
ef1cdaa
Fix not all pods have an ip
Lehmann-Fabian Feb 14, 2025
f715e7a
Update to Java 21 + improve Dockerfile
Lehmann-Fabian Mar 5, 2025
e915691
Update Java version to 21
Lehmann-Fabian Mar 5, 2025
1d642b8
Only test Java 21
Lehmann-Fabian Mar 5, 2025
7eea10e
Add warnings if execution cannot be registered
Lehmann-Fabian Mar 6, 2025
3575012
Reformat comments
Lehmann-Fabian Mar 6, 2025
2d013c1
improve comment
Lehmann-Fabian Mar 6, 2025
18c5347
Merge remote-tracking branch 'upstream/master' into wow-ponder
Lehmann-Fabian Mar 12, 2025
6c6c399
Merge branch 'updates' into wow-ponder
Lehmann-Fabian Mar 12, 2025
8a755d4
Convert Tests to Junit 5
Lehmann-Fabian Mar 12, 2025
21d7c6c
Fix VertexDeserializerTest
Lehmann-Fabian Mar 12, 2025
784df49
Read spring-boot-maven-plugin
Lehmann-Fabian Mar 12, 2025
7cafa0e
update OR-Tools to 9.12.4544
Lehmann-Fabian Mar 12, 2025
ce39b75
Change naming to WOW
Lehmann-Fabian Mar 13, 2025
073f971
Optimize Imports
Lehmann-Fabian Mar 13, 2025
e547e01
Reorder attributes
Lehmann-Fabian Mar 13, 2025
a02b84b
Add WOW to Readme
Lehmann-Fabian Mar 13, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions .github/workflows/workflow.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ jobs:
strategy:
fail-fast: false
matrix:
java_version: [ 17, 21 ]
java_version: [ 21 ]
steps:
- name: Checkout
uses: actions/checkout@v3
Expand All @@ -32,10 +32,10 @@ jobs:
steps:
- name: Checkout
uses: actions/checkout@v3
- name: Set up JDK 17
- name: Set up JDK 21
uses: actions/setup-java@v3
with:
java-version: 17
java-version: 21
distribution: 'temurin'
- name: Set outputs
id: vars
Expand Down
38 changes: 21 additions & 17 deletions Dockerfile
Original file line number Diff line number Diff line change
@@ -1,18 +1,22 @@
FROM maven:3-openjdk-17-slim AS build
WORKDIR /build
COPY pom.xml pom.xml
RUN mvn dependency:go-offline --no-transfer-progress -Dmaven.repo.local=/mvn/.m2nrepo/repository
COPY src/ src/
RUN mvn package --no-transfer-progress -DskipTests -Dmaven.repo.local=/mvn/.m2nrepo/repository

#
# Package stage
#
FROM openjdk:17-alpine
FROM eclipse-temurin:21-jdk-jammy AS builder
WORKDIR /app
RUN addgroup -S javagroup && adduser -S javauser -G javagroup && mkdir data
COPY --from=build /build/target/cws-k8s-scheduler*.jar cws-k8s-scheduler.jar
RUN chown -R javauser:javagroup /app
USER javauser
EXPOSE 8080
ENTRYPOINT ["java","-jar","/app/cws-k8s-scheduler.jar"]

RUN apt-get update && apt-get install -y maven && rm -rf /var/lib/apt/lists/*

COPY pom.xml .
RUN mvn dependency:go-offline --no-transfer-progress

COPY src/ ./src/
RUN mvn package --no-transfer-progress -DskipTests

FROM eclipse-temurin:21-jre-jammy

WORKDIR /app

RUN apt-get update && apt-get install -y \
libglib2.0-0 \
&& rm -rf /var/lib/apt/lists/*

COPY --from=builder /app/target/cws-k8s-scheduler-*-SNAPSHOT.jar app.jar

CMD ["java", "-jar", "app.jar"]
2 changes: 1 addition & 1 deletion Dockerfile-development
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
FROM maven:3-openjdk-17-slim AS build
FROM maven:3-openjdk-18-slim AS build
WORKDIR /build
COPY pom.xml pom.xml
RUN mkdir data/ && mvn dependency:go-offline -B -Dmaven.repo.local=/mvn/.m2nrepo/repository
Expand Down
136 changes: 136 additions & 0 deletions Jenkinsfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,136 @@
pipeline {
agent {
kubernetes {
yamlFile 'jenkins-pod.yaml'
}
}
environment {
// creates DOCKERHUB_USR and DOCKERHUB_PSW env variables
DOCKERHUB = credentials('fondahub-dockerhub')
}

stages {
stage('Build') {
steps {
container('maven') {
// run a clean build without tests to see if the project compiles
sh 'mvn clean test-compile -DskipTests=true -Dmaven.javadoc.skip=true -B -V'
}
}
}

stage('Test') {
steps {
container('maven') {
// run JUnit tests
sh 'mvn test -B -V'
}
}
post {
// collect test results
always {
junit 'target/surefire-reports/TEST-*.xml'
jacoco classPattern: 'target/classes,target/test-classes', execPattern: 'target/coverage-reports/*.exec', inclusionPattern: '**/*.class', sourcePattern: 'src/main/java,src/test/java'
archiveArtifacts 'target/surefire-reports/TEST-*.xml'
archiveArtifacts 'target/*.exec'
}
}
}

stage('Package') {
steps {
container('maven') {
sh 'mvn package -DskipTests=true -Dmaven.javadoc.skip=true -B -V'
}
}
post {
success {
archiveArtifacts 'target/*.jar'
}
}
}

stage('Static Code Analysis') {
steps {
container('maven') {
withSonarQubeEnv('fonda-sonarqube') {
sh '''
mvn sonar:sonar -B -V -Dsonar.projectKey=workflow_k8s_scheduler \
-Dsonar.branch.name=$BRANCH_NAME -Dsonar.sources=src/main/java -Dsonar.tests=src/test/java \
-Dsonar.inclusions="**/*.java" -Dsonar.test.inclusions="src/test/java/**/*.java" \
-Dsonar.junit.reportPaths=target/surefire-reports
'''
}
}
}
}

stage('Build and push Docker') {
// only push images from the master branch
when {
branch "master"
}
// agents are specified per stage to enable real parallel execution
parallel {
stage('workflow-k8s-scheduler') {
agent {
kubernetes {
yamlFile 'jenkins-pod.yaml'
}
}
steps {
container('hadolint') {
sh "hadolint --format json Dockerfile | tee -a hadolint_scheduler.json"
}
// build and push image to fondahub/workflow-k8s-scheduler
container('docker') {
sh "echo $DOCKERHUB_PSW | docker login -u $DOCKERHUB_USR --password-stdin"
sh "docker build . -t fondahub/workflow-k8s-scheduler:${GIT_COMMIT[0..7]}"
sh "docker tag fondahub/workflow-k8s-scheduler:${GIT_COMMIT[0..7]} fondahub/workflow-k8s-scheduler:latest"
sh "docker push fondahub/workflow-k8s-scheduler:${GIT_COMMIT[0..7]}"
sh "docker push fondahub/workflow-k8s-scheduler:latest"
}
}
post {
always {
archiveArtifacts "hadolint_scheduler.json"
recordIssues(
aggregatingResults: true,
tools: [hadoLint(pattern: "hadolint_scheduler.json", id: "scheduler")]
)
}
}
}
stage('vsftpd') {
agent {
kubernetes {
yamlFile 'jenkins-pod.yaml'
}
}
steps {
container('hadolint') {
sh "hadolint --format json daemons/ftp/Dockerfile | tee -a hadolint_vsftpd.json"
}
// build and push image to fondahub/vsftpd
container('docker') {
sh "echo $DOCKERHUB_PSW | docker login -u $DOCKERHUB_USR --password-stdin"
sh "docker build daemons/ftp/ -t fondahub/vsftpd:${GIT_COMMIT[0..7]}"
sh "docker tag fondahub/vsftpd:${GIT_COMMIT[0..7]} fondahub/vsftpd:latest"
sh "docker push fondahub/vsftpd:${GIT_COMMIT[0..7]}"
sh "docker push fondahub/vsftpd:latest"
}
}
post {
always {
archiveArtifacts "hadolint_vsftpd.json"
recordIssues(
aggregatingResults: true,
tools: [hadoLint(pattern: "hadolint_vsftpd.json", id: "vsfptd")]
)
}
}
}
}
}
}
}
10 changes: 6 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,8 @@
# Common Workflow Scheduler for Kubernetes
[![DOI](https://zenodo.org/badge/596122315.svg)](https://zenodo.org/badge/latestdoi/596122315)
# Kubernetes Workflow Scheduler

In this repository, you will find the Common Workflow Scheduler for Kubernetes proposed in the paper "**How Workflow Engines Should Talk to Resource Managers: A Proposal for a Common Workflow Scheduling Interface**."
SWAGGER: http://localhost:8080/swagger-ui.html

API-DOCS: http://localhost:8080/v3/api-docs/

---
#### Build
Expand Down Expand Up @@ -188,6 +189,7 @@ The following strategies are available:
| random | Randomly prioritize tasks. |
| max | Prioritize tasks with larger input size. |
| min | Prioritize tasks with smaller input size. |
| wow | WOW scheduler for data location awareness. This is scheduling + node assignment. Details are provided in our paper [tbd](tbd). |

| Node Assignment Strategy | Behaviour |
|--------------------------|-----------------------------------------------------------------------------------------|
Expand Down Expand Up @@ -215,4 +217,4 @@ Lehmann Fabian, Jonathan Bader, Friedrich Tschirpke, Lauritz Thamsen, and Ulf Le
```
---
#### Acknowledgement:
This work was funded by the German Research Foundation (DFG), CRC 1404: "FONDA: Foundations of Workflows for Large-Scale Scientific Data Analysis."
This work was funded by the German Research Foundation (DFG), CRC 1404: "FONDA: Foundations of Workflows for Large-Scale Scientific Data Analysis."
17 changes: 17 additions & 0 deletions daemons/ftp/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
FROM python:slim
RUN apt update && apt install -y \
vsftpd \
&& rm -rf /var/lib/apt/lists/*

RUN mkdir -p /var/run/vsftpd/empty

COPY vsftpd.conf /etc/vsftpd.conf

USER root
RUN echo 'root:password' | chpasswd

COPY ftp.py /code/ftp.py

WORKDIR /code

ENTRYPOINT ["sh","-c","/usr/sbin/vsftpd /etc/vsftpd.conf"]
Loading