trainer/docs/proposals
Antonin Stefanutti c333826023
Remove TrainJobCreated condition (#2621)
* Remove TrainJobCreated condition

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Update KEP-2170 proposal

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Remove Created condition from SDK

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Default TrainJob status to Created unconditionally

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Set Failed condition on TrainJob runtime creation errors

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Emit a warning event upon TrainJob resources reconcile error

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Update TrainJob resources creation failed event

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Truncate event message to the maximum length limit

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Update state diagram in KEP-2170

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Append ellipsis to event message if it's truncated

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

---------

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>
2025-05-02 17:59:05 +00:00
..
2003-train-api Update README and out-of-date docs (#2252) 2024-09-10 10:18:20 +00:00
2145-jax-integration Update README and out-of-date docs (#2252) 2024-09-10 10:18:20 +00:00
2170-kubeflow-trainer-v2 Remove TrainJobCreated condition (#2621) 2025-05-02 17:59:05 +00:00
2401-llm-trainer-v2 fix(doc): tidy up KEP-2401. (#2594) 2025-04-11 22:04:05 +00:00
README.md Add 'KEP Usage' KEP and template link (#2423) 2025-02-15 00:23:37 +00:00

README.md

Proposals

Kubeflow uses the KEP process to document large scale changes to the project.

Details on the process (including the KEP template, recommendations, etc.) can be found at kubeflow/community/proposals