Commit Graph

9 Commits

Author SHA1 Message Date
Antonin Stefanutti a728b3f77d
chore(operator): Upgrade Kubernetes to v1.34 (#2804)
Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>
2025-08-28 17:02:21 +00:00
Tsubasa Nagasawa a350fa0e45
feat: support affinity in TrainJob pod spec overrides (#2796)
Signed-off-by: Tsubasa Nagasawa <toversus2357@gmail.com>
2025-08-28 15:52:22 +00:00
Anya Kramar 0e62c455ef
chore: Remove tool.hatch.build.targets.wheel from pyproject (#2803)
Signed-off-by: kramaranya <kramaranya15@gmail.com>
2025-08-28 15:36:22 +00:00
Anya Kramar f9ec85f3db
fix(api): update license path for kubeflow_trainer_api (#2778)
Signed-off-by: kramaranya <kramaranya15@gmail.com>
2025-08-07 00:33:55 +00:00
Andrey Velichkevich 05b4c4543e
fix(api): Fix license path for Kubeflow Trainer Python API (#2771)
Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
2025-08-04 14:44:53 +00:00
Antonin Stefanutti 45ce64f386
Upgrade Kubernetes to v1.33 (#2756)
Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>
Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
2025-07-28 15:18:49 +00:00
Antonin Stefanutti 1ea271265d
feat: Add schedulingGates to PodSpecOverrides (#2700)
* feat: Add schedulingGates to PodSpecOverrides

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Change desired job to target job in PodSpecOverrides comments

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

---------

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>
2025-07-03 11:03:20 +00:00
Andrey Velichkevich 68d4ba1270
feat(controller): Implement PodSpecOverride API (#2614)
* feat(controller): Implement PodSpecOverride API

Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Update API comments

Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Fix InitContainers override

Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Fix generate

Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Update pkg/apis/trainer/v1alpha1/trainjob_types.go

Co-authored-by: Shao Wang <2690692950@qq.com>
Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Change API to targetJob and add validation

Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Add unit test for the JobSet builder

Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Add integration tests

Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Update pkg/apis/trainer/v1alpha1/trainjob_types.go

Co-authored-by: Shao Wang <2690692950@qq.com>
Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Update pkg/apis/trainer/v1alpha1/trainjob_types.go

Co-authored-by: Shao Wang <2690692950@qq.com>
Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Update pkg/apis/trainer/v1alpha1/trainjob_types.go

Co-authored-by: Antonin Stefanutti <astefanutti@users.noreply.github.com>
Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>

* Merge PodSpecOverrides before the build phase

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Fix make generate

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Fix PodSpec SMT patch unmarshalling into JobSet template

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Add UT for volumes PodSpecOverrides

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Add UT for tolerations PodSpecOverrides

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Add UT for node selector PodSpecOverrides

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Revert back to using TargetJobs []PodSpecOverrideTargetJob

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Run make fmt

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Add target jobs unicity validation for PodSpecOverrides

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

* Update with code review from @tenzen-y

Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>

---------

Signed-off-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Antonin Stefanutti <antonin@stefanutti.fr>
Co-authored-by: Shao Wang <2690692950@qq.com>
Co-authored-by: Antonin Stefanutti <astefanutti@users.noreply.github.com>
Co-authored-by: Antonin Stefanutti <antonin@stefanutti.fr>
2025-06-06 17:04:18 +00:00
Anya Kramar 1fb792e424
Move generated Python models into kubeflow_trainer_api package (#2632)
Signed-off-by: kramaranya <kramaranya15@gmail.com>
2025-05-07 18:14:39 +00:00