Skip to content

Commit 48944f4

Browse files
[RFC] Add initial version of the roadmap (#159)
Signed-off-by: terrytangyuan <[email protected]>
1 parent 39627b1 commit 48944f4

File tree

1 file changed

+24
-0
lines changed

1 file changed

+24
-0
lines changed

ROADMAP.md

+24
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
# Roadmap of MPI Operator
2+
3+
This document provides a high-level overview of where MPI Operator will grow in future releases. See discussions in the original RFC [here](https://github.com/kubeflow/mpi-operator/pull/159).
4+
5+
## New Features / Enhancements
6+
7+
* Decouple the tight dependency on Open MPI and support other collective communication frameworks.
8+
Related issue: [#12](https://github.com/kubeflow/mpi-operator/issues/12).
9+
* Support new versions of MPI Operator in [kubeflow/manifests](https://github.com/kubeflow/manifests).
10+
* Redesign different components of MPI Operator to support fault tolerant collective communication frameworks such as [caicloud/ftlib](https://github.com/caicloud/ftlib).
11+
* Allow more flexible RBAC when `MPIJob`s so existing RBAC resources can be reused. Related issue: [#20](https://github.com/kubeflow/mpi-operator/issues/20).
12+
* Support installation of MPI Operator via [Helm](https://github.com/helm/helm). Related issue: [#11](https://github.com/kubeflow/mpi-operator/issues/11).
13+
* Support [Go modules](https://blog.golang.org/migrating-to-go-modules).
14+
* Consider support launching framework-specific services such as [TensorBoard](https://www.tensorflow.org/tensorboard) and [Horovod Timeline](https://github.com/horovod/horovod#horovod-timeline). Since [tf-operator](https://github.com/kubeflow/tf-operator) already supports TensorBoard, we may want to consider moving this to [kubeflow/common](https://github.com/kubeflow/common) so it can be reused. Related issue: [#138](https://github.com/kubeflow/mpi-operator/issues/138).
15+
16+
## CI/CD
17+
18+
* Automate the process to publish images to Docker Hub whenever there's new release/commit. Related issue: [#93](https://github.com/kubeflow/mpi-operator/issues/93).
19+
* Ensure new versions of `deploy/mpi-operator.yaml` are always compatible with [kubeflow/manifests](https://github.com/kubeflow/manifests).
20+
* Add end-to-end tests via Kubeflow's testing infrastructure. Related issue: [#9](https://github.com/kubeflow/mpi-operator/issues/9).
21+
22+
## Bug Fixes
23+
24+
* Better statuses of launcher and worker pods. Related issues: [#90](https://github.com/kubeflow/mpi-operator/issues/90)

0 commit comments

Comments
 (0)