Commit Graph

9284 Commits

Author SHA1 Message Date
Luiz Antonio 00f627fbb9 Also read resources from containerStatus in priority processor 2025-04-01 15:25:10 -04:00
dependabot[bot] 3e43170446
Bump golang in /vertical-pod-autoscaler/pkg/recommender
Bumps golang from 1.24.1 to 1.24.2.

---
updated-dependencies:
- dependency-name: golang
  dependency-version: 1.24.2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2025-04-01 18:35:52 +00:00
dependabot[bot] 4a1b362ca5
Bump golang in /vertical-pod-autoscaler/pkg/admission-controller
Bumps golang from 1.24.1 to 1.24.2.

---
updated-dependencies:
- dependency-name: golang
  dependency-version: 1.24.2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2025-04-01 18:34:33 +00:00
dependabot[bot] 9148a69e87
Bump golang in /vertical-pod-autoscaler/pkg/updater
Bumps golang from 1.24.1 to 1.24.2.

---
updated-dependencies:
- dependency-name: golang
  dependency-version: 1.24.2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2025-04-01 18:33:34 +00:00
Luiz Antonio 204ac56883 Also read resources from containerStatus in oom observer 2025-04-01 13:48:03 -04:00
Luiz Antonio a8c5030035 Also read resources from containerStatus in recommendation_provider 2025-04-01 12:21:38 -04:00
Luiz Antonio ea396b5f2b Also read resources from containerStatus in resource_updates 2025-04-01 12:00:04 -04:00
Kubernetes Prow Robot dc91330f6a
Merge pull request #7989 from loick111/feature/clusterapi-instances-status
ClusterAPI: Report machine phases to improve cluster-autoscaler decisions
2025-04-01 07:44:38 -07:00
Kubernetes Prow Robot c107f2bba5
Merge pull request #7998 from jackfrancis/add-owners-aliases
update OWNERS, add OWNERS_ALIASES
2025-04-01 01:56:51 -07:00
Florian Ströger ecb572a945 Use Patch to Scale clusterapi nodepools to avoid modification conflicts
Issue: https://github.com/kubernetes/autoscaler/issues/7872
Signed-off-by: Florian Ströger <stroeger@youniqx.com>
2025-04-01 08:26:45 +02:00
Kubernetes Prow Robot 6c7c0c1ffd
Merge pull request #7984 from toredash/fix/vpa-autoscaler-flags-link
chore(release-process): update flags and flags link
2025-03-31 18:58:36 -07:00
Luiz Antonio 3d138309b9 Add helpers to get containers requests/limits 2025-03-31 17:22:45 -04:00
Kubernetes Prow Robot 13dba1751c
Merge pull request #7936 from fcrespofastly/allow_tpl_on_various_parts_of_the_chart
allow `tpl` on common metadata to DRY
2025-03-31 10:08:41 -07:00
Jack Francis d97c4b22df update OWNERS, add OWNERS_ALIASES
Signed-off-by: Jack Francis <jackfrancis@gmail.com>
2025-03-31 09:50:51 -07:00
Fernando Crespo Gravalos 460797ba4a remove extra new line
Signed-off-by: Fernando Crespo Gravalos <fcrespo@fastly.com>
2025-03-31 18:49:37 +02:00
Fernando Crespo Gravalos a6a54e8806 use backticks to escape braces
Signed-off-by: Fernando Crespo Gravalos <fcrespo@fastly.com>
2025-03-31 18:46:34 +02:00
Fernando Crespo Gravalos 365c3d1d0c update README.md
Signed-off-by: Fernando Crespo Gravalos <fcrespo@fastly.com>
2025-03-31 18:46:34 +02:00
Fernando Crespo Gravalos 63c7d13622 remove testing label leftover
Signed-off-by: Fernando Crespo Gravalos <fcrespo@fastly.com>
2025-03-31 18:46:34 +02:00
Fernando Crespo Gravalos fd0f93a94d this allows DRY common metadata
Signed-off-by: Fernando Crespo Gravalos <fcrespo@fastly.com>
2025-03-31 18:46:32 +02:00
Damika Gamlath 49b271f75a Emit scale down metric even when there is no scale down candidates.
Update scale scaleDownInCooldown definition to skip considering zero candidates as a reason to be in scaleDownInCooldown state
2025-03-31 14:46:23 +00:00
Kubernetes Prow Robot 19cb11766d
Merge pull request #7994 from jackfrancis/update-helm-docs-braces
helm: backtick to escape braces
2025-03-31 03:26:46 -07:00
Jack Francis 8b31ea0140 helm: backtick to escape braces
Signed-off-by: Jack Francis <jackfrancis@gmail.com>
2025-03-28 15:33:18 -07:00
Kubernetes Prow Robot 5d4f6f1b80
Merge pull request #7979 from laoj2/fix-timeouts
Parametrize pod resize timeouts in AEP-4016 beta
2025-03-28 12:50:40 -07:00
Kubernetes Prow Robot 7d475d181c
Merge pull request #7990 from omerap12/add-omer-as-approver
Add omerap12 to VPA approvers
2025-03-28 12:48:45 -07:00
Kubernetes Prow Robot 10f98da57e
Merge pull request #7972 from goto-opensource/fix/cluster-autoscaler-chart-remove-aws-region-default
fix: remove default value for `awsRegion` value in cluster-autoscaler Helm chart to avoid accidental misconfigurations
2025-03-28 12:14:34 -07:00
Loick MAHIEUX 005a42b9af feat(cluster-autoscaler): improve nodes listing in ClusterAPI provider
Add improved error handling for machines phase in the ClusterAPI node group
implementation. When a machine is in Deleting/Failed/Pending phase, mark the cloudprovider.Instance
with a status for cluster-autoscaler recovery actions.

The changes:
- Enhance Nodes listing to allow reporting the machine phase in Instance status
- Add error status reporting for failed machines

This change helps identify and manage failed machines more effectively,
allowing the autoscaler to make better scaling decisions.
2025-03-28 15:07:34 +01:00
Tore Stendal Lønøy 9bcecb96c8 chore(docs): update link to use full URI
the previous link was a failed attempt to using relative URL to avoid having to replace URLs in the future, if the repository was moved to a new location
2025-03-28 10:16:55 +01:00
Omer Aplatony 52334dec72 Add omerap12 to VPA approvers
Signed-off-by: Omer Aplatony <omerap12@gmail.com>
2025-03-28 09:00:42 +00:00
Tore Stendal Lønøy 21422e5419 chore(docs): update sed command to work correctly on mac 2025-03-28 09:42:52 +01:00
Luiz Antonio 81d42aacef Parametrize pod resize timeouts in AEP-4016 2025-03-27 14:39:15 -04:00
Kubernetes Prow Robot 27d00f486c
Merge pull request #7985 from toredash/fix/vpa-api-docs-script
chore(vpa): update api-docs script
2025-03-27 11:14:34 -07:00
Kubernetes Prow Robot db597b1acd
Merge pull request #7966 from pmendelski/htnap-events-for-tpu
Emit event on successful async scale-up
2025-03-27 02:32:34 -07:00
Tore Stendal Lønøy 55eb65255e chore(vpa): update api-docs script
missed some exports to work correctly.
2025-03-27 09:29:04 +01:00
Tore Stendal Lønøy c9ee74c39b chore(release-process): update flags and flags link
as part of the release process, it is required to update the flags documentation and also ensuring that the components.md file points to the tagged version of the flags.md file.
2025-03-27 09:24:16 +01:00
Kubernetes Prow Robot 7b6996469b
Merge pull request #7973 from jincong8973/master
feat: add ignoreDaemonSetsUtilization and zeroOrMaxNodeScaling to NodeGroupAutoscalingOptions
2025-03-27 00:00:35 -07:00
KrJin e713b51bd6 feat: add missing field zeroOrMaxNodeScaling and ignoreDaemonSetsUtilization to NodeGroupAutoscalingOptions
[squashed]Add field IgnoreDaemonSetsUtilization and zeroOrMaxNodeScaling that missing in externalgrpc proto
2025-03-27 11:28:12 +08:00
Kubernetes Prow Robot 2ca5b44652
Merge pull request #7977 from elmiko/refactor-findscalableproviderids
refactor findScalableResourceProviderIDs in clusterapi
2025-03-26 10:22:43 -07:00
elmiko 5e1fc195a3 refactor findScalableResourceProviderIDs in clusterapi
this change refactors the function so that it each distinct machine
state can be filtered more easily. the unit tests have been
supplemented, but not changed to ensure that the functionality continues
to work as expected. these changes are to help better detect edge cases
where machines can be transiting through pending phase and might be
removed by the autoscaler.
2025-03-26 12:41:09 -04:00
Kubernetes Prow Robot 8251159ba3
Merge pull request #7891 from voelzmo/enh/drop-metrics-for-init-containers
Drop metrics for init containers
2025-03-26 07:10:39 -07:00
mendelski 0c522556c5
Emit event on successful async scale-up 2025-03-26 13:11:03 +00:00
Kubernetes Prow Robot 94ae175e94
Merge pull request #7886 from plkokanov/fix/empty-histogram-after-load-from-checkpoint
Fixes histograms becoming empty after loaded from checkpoints
2025-03-26 03:04:36 -07:00
Kubernetes Prow Robot 63309979ba
Merge pull request #7826 from Azure/rakechill/update-skewer-version-master
Update skewer version to v0.0.19 (master)
2025-03-26 01:30:34 -07:00
Kubernetes Prow Robot e95e35c94e
Merge pull request #7965 from DigitalVeer/master
pricing changes: updated z3 pricing information
2025-03-25 10:48:33 -07:00
Alfred Krohmer 1687d46e68 fix: remove default value for `awsRegion` value in cluster-autoscaler Helm chart to avoid accidental misconfigurations
The region should be specified explicitly. Having a default might be dangerous as cluster-autoscaler might find matching autoscaling groups in other regions that belong to a different cluster and wipe all nodes in them as it doesn't recognize these as belonging to its own cluster.
2025-03-25 14:43:30 +01:00
Plamen Kokanov 4a233bf7df Address review comments 2025-03-25 12:23:00 +02:00
Kubernetes Prow Robot aa1d413ea3
Merge pull request #7970 from laoj2/fix-pod-condition
Replace PodResizing with PodResizeInProgress pod condition in AEP-4016
2025-03-24 22:06:33 -07:00
Luiz Antonio 8892f21919 Replace PodResizing with PodResizeInProgress condition in AEP-4016 2025-03-24 14:57:42 -04:00
Kubernetes Prow Robot 52cd68a498
Merge pull request #7954 from abdelrahman882/FixScaledownCoolDown
Fix cool down status condition to trigger scale down
2025-03-24 07:38:33 -07:00
Plamen Kokanov 72c2f93c7c Describe why the additional bucket handling is necessary 2025-03-24 16:24:16 +02:00
Kubernetes Prow Robot f90590b90f
Merge pull request #7914 from abdelrahman882/bsp
Add time based drainability rule for non-pdb-assigned system pods
2025-03-24 06:20:36 -07:00