trainer/pkg
Antonin Stefanutti c333826023
Remove TrainJobCreated condition (#2621)
* Remove TrainJobCreated condition

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Update KEP-2170 proposal

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Remove Created condition from SDK

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Default TrainJob status to Created unconditionally

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Set Failed condition on TrainJob runtime creation errors

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Emit a warning event upon TrainJob resources reconcile error

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Update TrainJob resources creation failed event

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Truncate event message to the maximum length limit

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Update state diagram in KEP-2170

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Append ellipsis to event message if it's truncated

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

---------

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>
2025-05-02 17:59:05 +00:00
..
apis/trainer/v1alpha1 Remove TrainJobCreated condition (#2621) 2025-05-02 17:59:05 +00:00
apply Implemenet MPI Plugin for OpenMPI (#2493) 2025-03-13 05:00:56 +00:00
client feat(controller): Refactor the Initializer APIs of TrainJob (#2523) 2025-03-17 07:15:51 +00:00
constants Remove TrainJobCreated condition (#2621) 2025-05-02 17:59:05 +00:00
controller Remove TrainJobCreated condition (#2621) 2025-05-02 17:59:05 +00:00
initializers feat(controller): Refactor the Initializer APIs of TrainJob (#2523) 2025-03-17 07:15:51 +00:00
runtime Implement trainer.kubeflow.org/resource-in-use finalizer mechanism to TrainingRuntime (#2608) 2025-05-01 20:17:04 +00:00
util Implement trainer.kubeflow.org/resource-in-use finalizer mechanism to ClusterTrainingRuntime (#2625) 2025-05-02 01:27:04 +00:00
webhooks feat: add replicatedJobs.replicas validations in validateReplicatedJobs function. (#2533) 2025-03-21 10:36:27 +00:00