-
Notifications
You must be signed in to change notification settings - Fork 840
Open
Description
I have read this readme file: https://github.com/kubeflow/trainer/tree/master/docs/proposals/2401-llm-trainer-v2
It seems that TorchTuneConfig can support for multi-node LLM fine-tuning. Is this feature on the roadmap? If so, when it will be ready? It seems that currently Python SDK kubeflow V0.1.0 does not support for peft_config and multi-node, which is also highlighted in the doc: https://www.kubeflow.org/docs/components/trainer/user-guides/builtin-trainer/torchtune/ (It’s worth noticing that we do not support multi-node fine-tuning with TorchTune.) Do I have correct understanding?
Metadata
Metadata
Assignees
Labels
No labels