-
Notifications
You must be signed in to change notification settings - Fork 494
Open
Description
Recently Kubernetes added native support for sidecars containers as part of KEP 753: kubernetes/enhancements#3761
We need to discuss if we can improve our architecture to run Katib Metrics Collectors as Kubernetes sidecars.
It requires to use initContainer with restartPolicy: Always to run the container as a sidecar.
This feature will be available only in Kubernetes 1.28, but we can start the design discussions now.
Existing problems with Katib Metrics Collectors Sidecar:
- If training container is finished before Metrics Collector is started, the Trial will fail.
- Since we override Trial start command, the Trial might fail. E.g. Add support for entire kubeflow pipelines as trial target (in addition to containers) #1914 (comment).
cc @kubeflow/wg-automl-leads @tenzen-y @gaocegege @votti
tenzen-y