Commit Graph

5217 Commits

Author SHA1 Message Date
Andre Keedy 4f30519ee5
Add CoreWeave Cluster Autoscaler provider (#8332)
* initial commit

* Add coreweave manager and node pools using unstructure
Add provider unit tests
coverage = 68.7%
run go fmt
update comments on coreweave files
change const to unexported
Add boilerplate header
Remove unused function from manager
Add coreave tag exclusion from the build_all

* address comments and feedbacks

* use the shared *rest.Config from the autoscaler's logic

* update the comments for clouProvider in charts values file

* update charts README with coreweave cloudProvider
2025-08-11 14:19:07 -07:00
Manoj 126b76d130
Utho autoscaler (#8398)
* cloudprovider: init Utho cloud provider

* add utho cloud provider logic

* implement tests and mock client for Utho cloud provider functionality

* enhance Utho cloud provider: improve logging, add ReadNodePool method, and restore toProviderID function

* add unit tests for Utho cloud provider node group functionality

* add deployment and secret configuration for Utho cloud provider

* remove outdated Utho Go module versions from go.sum

* add stress-test deployment example, improve error messages, and enhance node group tests

* update utho autoscaler image version to 1.0.0

* refactor: reorder parameters in setupMockListNodePools

* fix: correct license formatting in utho_cloud_provider_test.go and add license header to utils.go

* fix: comment out approvers and reviewers in OWNERS file

* Remove utho-go as dependency

* add comments to clarify structures and services in Utho API

* add comments to Utho SDK

* add comments

* remove unnecessary comments in cloud_instances.go and kubernetes.go

* Revert changes to go.mod

* Revert changes to go.mod

* ensure newline at end of go.mod and go.sum files

---------

Co-authored-by: hmada15 <31375621+hmada15@users.noreply.github.com>
Co-authored-by: m-kased <31375621+m-kased@users.noreply.github.com>
2025-08-11 09:35:08 -07:00
Robin D. 6d32a56ca9
fix: not failing the main loop when one NodeGroup fails on TemplateNodeInfo() (#8402)
* fix: not failing the main loop when one NodeGroup fails on TemplateNodeInfo()

* test: add a unit test
2025-08-07 17:19:42 -07:00
Bartłomiej Wróblewski b05821737f Update vendored kubernetes to 1.34.0-beta.0 2025-08-05 14:16:44 +00:00
Kubernetes Prow Robot 24ba0e7df0
Merge pull request #8404 from jackfrancis/ca-update-deps-improvements
CI: make update-deps.sh macOS compatible
2025-08-04 18:29:40 -07:00
Jack Francis 6e1da17bba
CI: make update-deps.sh macOS compatible 2025-08-04 09:43:55 -07:00
Kubernetes Prow Robot 14f79255b2
Merge pull request #8403 from kincoy/fix-typo-autoscaler-interface-comment
chore(autoscaler): fix typo in Autoscaler interface comment
2025-08-04 09:27:40 -07:00
Kubernetes Prow Robot 3344f064df
Merge pull request #8291 from adamoldak/zero_or_max_scaling_scaleup_errors
Do not remove healthy nodes from partially failing zero-or-max-scaling node groups
2025-08-04 06:13:39 -07:00
Adam Oldak 1e3cede9aa
Do not remove healthy nodes from partially failing zero-or-max-scaling node pool scale-ups 2025-08-04 11:44:27 +00:00
kincoy ed6fc0f199 chore(autoscaler): fix typo in Autoscaler interface comment
Signed-off-by: kincoy <1152072645@qq.com>
2025-08-04 16:29:03 +08:00
Kubernetes Prow Robot c1352dad7c
Merge pull request #8297 from pat-s/feat/imagesForArch-nodepool
Hetzner(feat): add option to set nodepool-specific image IDs
2025-07-31 23:47:15 -07:00
Kubernetes Prow Robot 172a22c195
Merge pull request #8376 from jackfrancis/ca-maintainer-elmiko
CA: add elmiko as maintainer
2025-07-31 11:01:15 -07:00
Jack Francis 26d6b38699
CA: add release automation validation 2025-07-30 17:54:44 -07:00
Jack Francis 2eb5adda2c
just approver 2025-07-28 09:31:05 -07:00
Jack Francis 0d36de3aa2
CA: add elmiko as maintainer 2025-07-28 07:55:24 -07:00
Kubernetes Prow Robot 1d5f0471bc
Merge pull request #8315 from vbhargav875/oci-oke-handle-ooc
Handle Out of host capacity scenario in OCI nodepools
2025-07-24 23:34:27 -07:00
Vijay Bhargav Eshappa 1fbc7a9d48 Handle Out of host capacity scenario in OCI nodepools 2025-07-24 16:04:49 +05:30
Kubernetes Prow Robot 9a256e5c83
Merge pull request #8138 from aleskandro/patch-1
Fix typo in expander/grpcplugin/README.md
2025-07-21 12:34:27 -07:00
Maximiliano Uribe f9b93ec395
adding env variable EnableLabelPrediction (#8324)
* adding env variable EnableLabelPrediction

* addressing comments

* adding ut test and nil scenario

* adding ephemeral storage ut

* changing default value to true
2025-07-21 10:08:27 -07:00
Kubernetes Prow Robot 0d14eca879
Merge pull request #7993 from pierreozoux/pierreozoux-patch-2
docs(autoscaler): add details about flags
2025-07-21 07:50:28 -07:00
Kubernetes Prow Robot 0fb7d53506
Merge pull request #8296 from drjackild/add-m4-prices
add price info for M4 machine family
2025-07-11 07:51:29 -07:00
Maksym Fuhol 6f0f000a20 Do not consider pod PreemptionPolicy while determining whether pod is expendable
Change https://github.com/kubernetes/autoscaler/pull/6577 added a support for considering preemption policies into expendable pods evaluation, this is not correct as this propepty describes whether the created pod can evict other pods - thus should be considered only within scheduler and its framework. This change removes the policy out of the consideration
2025-07-10 10:16:12 +00:00
Kubernetes Prow Robot 12e6e2e182
Merge pull request #8268 from hakman/compare-examples
azure: Make it easier to compare examples
2025-07-09 09:43:27 -07:00
Kubernetes Prow Robot 8e47b51d39
Merge pull request #8300 from jincong8973/master
feat: improve external gPRC ca example
2025-07-07 23:57:27 -07:00
Kubernetes Prow Robot f3c58dae9c
Merge pull request #8273 from krzysied/fake-pods
Omit fake pods during eviction
2025-07-07 12:31:26 -07:00
Jack Francis 3f9526837e azure: 2025-06-30 CA E2E module updates 2025-07-07 10:53:01 -07:00
Kubernetes Prow Robot b1780e6401
Merge pull request #8267 from hakman/update-examples
azure: Add volumeattachments read to ClusterRole for examples
2025-07-07 09:37:27 -07:00
Kubernetes Prow Robot 65c4d6f702
Merge pull request #8292 from pmendelski/revert-exclude-injected-node-groups-from-balancing
Revert filter out non-existing node-groups before scale-up balancing
2025-07-07 09:09:29 -07:00
Kubernetes Prow Robot 563f074dd1
Merge pull request #8295 from luizm/chart-1.33.0
chore: bump CA chart image to v1.33
2025-07-07 07:45:26 -07:00
靳聪 815da21233 feat: improve external gPRC ca example 2025-07-07 20:05:23 +08:00
Krzysztof Siedlecki 82178880ba Omit fake pod evictions 2025-07-07 09:45:17 +00:00
pat-s 008a3b916e
typo 2025-07-05 15:14:24 +02:00
pat-s 77cb4c8bf8
add files 2025-07-05 15:03:29 +02:00
Yevhen Dubovskoi 9424deef46 add price info for M4 machine family 2025-07-04 20:16:34 +00:00
Kubernetes Prow Robot c93df03bca
Merge pull request #8275 from dsafdsa1/deps
CA: bump k8s dependencies to v1.33.0-alpha.1, run codegen
2025-07-04 11:23:25 -07:00
luizm b44c40d4ec
chore: bump CA Chart image to v1.33 2025-07-04 10:59:27 -03:00
mendelski 7912e2d0f6
Revert filter out non-existing node-groups before scale-up balancing 2025-07-03 20:19:06 +00:00
mendelski 792fba7ed1
Filterout non-existing node-groups before scale-up balancing 2025-07-03 07:52:36 +00:00
Maximiliano Uribe 7a1e49ac1f removing UT cases where changes in azure api may affect results, fake values in real apis not necessary 2025-06-30 19:25:32 +00:00
pawel siwek 4560f69eaf Revert "tmp: make apis/ a package"
This reverts commit 897989f231.
2025-06-30 11:36:16 +00:00
pawel siwek 353b44637b update-codegen.sh
hack/update-codegen.sh
2025-06-30 11:35:22 +00:00
pawel siwek 897989f231 tmp: make apis/ a package
Following https://github.com/kubernetes/autoscaler/pull/7195
2025-06-30 11:33:38 +00:00
pawel siwek d5c1e15385 update-deps.sh
./hack/update-deps.sh v1.34.0-alpha.1 v1.34.0-alpha.1 https://github.com/kubernetes/kubernetes.git
2025-06-30 11:31:47 +00:00
Krzysztof Siedlecki 2814dcafaf Export fake pods definition to a dedicated module 2025-06-27 12:16:09 +00:00
Kubernetes Prow Robot 77e3f571bf
Merge pull request #8266 from elmiko/add-more-balance-logging
cluster-autoscaler: add logging for failed node balancing
2025-06-26 23:36:28 -07:00
Elizabeth Yam 8e0d47c61e
add h4d pricing (#8205)
* add h4d pricing

* fix go fmt

* revert gofmt on other files
2025-06-26 10:42:29 -07:00
Ciprian Hacman 20a59a9f41 azure: Make it easier to compare examples 2025-06-26 07:42:10 +03:00
Ciprian Hacman c942ff37ad azure: Add volumeattachments read to ClusterRole for examples 2025-06-26 06:58:59 +03:00
elmiko 771b9ee591 add logging for failed node balancing
this change adds debug logs at level 5 to aid in triaging failed node
balancing. It adds logs to help determine why two node groups are not
considered as similar. These logs can be quite noisy so the logging
level has been set to 5 by default.
2025-06-25 10:43:37 -04:00
Kubernetes Prow Robot 31caf5b0bf
Merge pull request #8183 from MenD32/feat/cordon-node-before-terminate-by-default
feat: cordon node before terminate by default
2025-06-25 01:52:29 -07:00