Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
Description
As a PoC, we need several below steps to run the TF job on k8s.
- The submarine server can run as an independent daemon
- tf-operator deployed in a k8s cluster. Use kubeflow/tf-operator for now
- REST API in submarine server to accept a generic TF job spec
- submitter-k8s translate the spec into tf-operator format and use k8s client API to submit this to the cluster
- REST API to get tf job status from k8s and parse it to a standard format to be consumed
Attachments
Issue Links
- is a parent of
-
SUBMARINE-214 [WIP] Add submitter-k8s module
- Resolved
- is duplicated by
-
SUBMARINE-214 [WIP] Add submitter-k8s module
- Resolved