Commit Graph

2985 Commits

Author SHA1 Message Date
Joel Speed 9f670d4ea8
Ensure ClusterAPI DeleteNodes accounts for out of band changes scale
Because the autoscaler assumes it can delete nodes in parallel, it 
fetches nodegroups for each node in separate go routines and then 
instructs each nodegroup to delete a single node.
Because we don't share the nodegroup across go routines, the cached 
replica count in the scalableresource can become stale and as such, if 
the autoscaler attempts to scale down multiple nodes at a time, the 
cluster api provider only actually removes a single node.

To prevent this, we must ensure we have a fresh replica count for every 
scale down attempt.
2022-01-21 16:08:00 +00:00
Kubernetes Prow Robot 5c741c881d
Merge pull request #4626 from lzhecheng/remove-deleteblob-ut
Remove TestDeleteBlob UT
2022-01-19 17:53:52 -08:00
Zhecheng Li 5b99b58ba1 Remove TestDeleteBlob UT
Signed-off-by: Zhecheng Li <zhechengli@microsoft.com>
2022-01-20 09:28:18 +08:00
Kubernetes Prow Robot f8266a5101
Merge pull request #4627 from yaroslava-serdiuk/templates
GCE: Add m2-megamem-416 price
2022-01-19 07:06:06 -08:00
Yaroslava Serdiuk abacf124ad GCE: Add m2-megamem-416 price 2022-01-19 14:51:22 +00:00
Kubernetes Prow Robot 698c02b17c
Merge pull request #4603 from yaroslava-serdiuk/templates
Introduce gce image types and remove *_containerd gce os distributions
2022-01-19 04:56:04 -08:00
Yaroslava Serdiuk 5380a9dd83 Cluster-Autoscaler: Introduce gce image types and remove *_containerd gce os distributions. 2022-01-19 12:26:36 +00:00
Kubernetes Prow Robot 91e8f8e40c
Merge pull request #4617 from kisieland/add_context_to_scale_down_processor
Add AutoscalingContext to the scale-down post-processor
2022-01-18 03:07:08 -08:00
Maciek Pytel 217d780160 Add FAQ entry about the go version used 2022-01-18 10:22:57 +01:00
Maciek Pytel 24f896cd9d Add go:build tags matching existing +build tags
As of go1.17 both tags are expected to exist simultaneously.
Added tags in all cluster autoscaler files. Added verify-gomod.sh
exceptions for non-compliant autogenerated VPA files.
2022-01-18 10:22:57 +01:00
Daniel Gutowski a230b47fec Add AutoscalingContext to the scale-down post-processor 2022-01-18 07:58:53 +00:00
Benjamin Pineau 1aca77527a azure: change a flacky test
It seems that test gets varying error messages which prompted
Bartłomiej previous fix, but I'm now seeing the original error
message string back ("Server failed to authenticate [...]"),
so that `TestDeleteBlob` test is failing again (other PRs' tests
failures suggest that's not just my laptop).

Let's assume this was meant to check for an error, until someone
can confirm, that might be better than potentially hidding other
PRs real tests failures.
2022-01-17 19:01:05 +01:00
Kubernetes Prow Robot f5de590bea
Merge pull request #4580 from cprivite/Rename_Packet_to_Equinix_Metal
Rename packet to equinix metal
2022-01-13 08:04:30 -08:00
Kubernetes Prow Robot 441d7968fa
Merge pull request #4519 from kisieland/scale_down_candidate_select_processor
Introduce the scale down processor that picks the final scale down candidates
2022-01-13 08:02:30 -08:00
Kubernetes Prow Robot b9bfdc1bbc
Merge pull request #4579 from randomvariable/remove-randomvariable-owners
Cluster API OWNERS: Remove randomvariable
2022-01-13 07:12:30 -08:00
Kubernetes Prow Robot 80574ca166
Merge pull request #4508 from aledbf/done-error
Cluster Autoscaler: GCE: check the result of the operation
2022-01-13 07:08:30 -08:00
Kubernetes Prow Robot 00721caf97
Merge pull request #4582 from cprivite/Use_Current_cluster-autoscaler_image_In_Example
use gcr hosted cluster-autoscaler image
2022-01-13 06:18:30 -08:00
Bartłomiej Wróblewski f0a9ede345 Fix constant used in azure unit tests 2022-01-11 16:05:16 +00:00
Kubernetes Prow Robot b3576e0cdc
Merge pull request #4507 from ByteAlex/hetzner-node-name
Shorten Hetzners node names with hex repr
2022-01-09 19:09:12 -08:00
Chris Privitere a220224889 use gcr hosted cluster-autoscaler image
Signed-off-by: Chris Privitere <cprivite@users.noreply.github.com>
2022-01-06 20:59:59 +00:00
Chris Privitere c4e1aa247e Add note to readme about the rename of Packet.
Signed-off-by: Chris Privitere <cprivite@users.noreply.github.com>
2022-01-05 20:26:16 +00:00
Chris Privitere 8f8d071b9e Update example facility and machine plans to current versions.
Signed-off-by: Chris Privitere <cprivite@users.noreply.github.com>
2022-01-05 18:20:31 +00:00
Chris Privitere 0396f5c3c9 Rename packet to Equinix Metal 2022-01-05 17:45:48 +00:00
Naadir Jeewa ee761bdc24
Cluster API OWNERS: Remove randomvariable
Signed-off-by: Naadir Jeewa <jeewan@vmware.com>
2022-01-05 15:11:21 +00:00
Daniel Gutowski 8064d6d1fd Introduce the scale down processor that picks the final scale down candidates. 2022-01-03 16:05:36 +00:00
Jayant Jain 729038ff2d Adding support for Debugging Snapshot 2021-12-30 09:08:05 +00:00
Qi Ni dc64e41104 chore: remove a time comsuming unit test in provider azure 2021-12-27 10:37:52 +08:00
Kubernetes Prow Robot 6d19e3ddb9
Merge pull request #4441 from marwanad/fix-pod-equivalence-perf
fix pod equivalency checks for pods with projected volumes
2021-12-24 04:12:15 -08:00
Kubernetes Prow Robot fca1dc0513
Merge pull request #4550 from marwanad/csi-topology-label-ignore-scale-from-zero
ignore azure csi topology label for similarity checks and populate it for scale from zero
2021-12-23 03:30:37 -08:00
Marwan Ahmed fd089c2d15 avoid double wrapping scale up error 2021-12-22 15:47:05 +02:00
Kubernetes Prow Robot 7b19d33de7
Merge pull request #4345 from sergelogvinov/create-timeout
Increase server create timeout
2021-12-22 03:23:35 -08:00
Kubernetes Prow Robot 44982d5ee7
Merge pull request #4520 from gjtempleton/CA-AWS-Instance-List-Update-13-12-21-default
CA - AWS - Update Static Instance List
2021-12-22 03:17:35 -08:00
Marwan Ahmed 26569925db ignore azure csi topology label for similarity checks and populate it for scale from zero 2021-12-21 20:44:49 +02:00
Marwan Ahmed e0952eb29d
fix scale set log formatter 2021-12-21 17:57:35 +02:00
Marwan Ahmed 286f44e351 fix pod equivalency checks for pods with projected volumes 2021-12-21 17:02:30 +02:00
Bartłomiej Wróblewski dc71801b7a Add MigLister interface with default no-op implementation 2021-12-21 10:41:00 +00:00
Kubernetes Prow Robot 51dbafe27b
Merge pull request #4543 from BigDarkClown/mm2
Cache Mig instances in GceCache
2021-12-21 02:10:45 -08:00
Bartłomiej Wróblewski 577a5c69c1 Add unit tests for MigInfoProvider instances cache refreshes 2021-12-21 09:55:52 +00:00
Bartłomiej Wróblewski a65b84197b Cache Mig instances in GceCache 2021-12-21 09:55:45 +00:00
Kubernetes Prow Robot 1b6323372b
Merge pull request #4541 from marwanad/improve-azure-cache-logging
improve logging for scale set size changes
2021-12-20 18:02:45 -08:00
Kubernetes Prow Robot 6c9b0e9bfc
Merge pull request #4539 from ialidzhikov/fix/scale-up-on-volume-limits
Add `--feature-gates` flag to support scale up on volume limits (CSI migration enabled)
2021-12-20 09:10:56 -08:00
Kubernetes Prow Robot bd52a45e80
Merge pull request #4542 from yaroslava-serdiuk/templates
Use node version in gce os reserved
2021-12-20 06:09:25 -08:00
Marwan Ahmed 091c72cbb0 cleanup scale set size logs 2021-12-20 15:14:05 +02:00
Kubernetes Prow Robot 9d912f3d61
Merge pull request #4512 from BigDarkClown/mm
Refresh instances to mig cache in MigInfoProvider
2021-12-20 05:09:25 -08:00
Yaroslava Serdiuk 836eda0de6 Use node version in os reserved 2021-12-20 12:44:45 +00:00
Bartłomiej Wróblewski 386c95325e Add unit tests for MigInfoProvider instances to mig cache refreshes 2021-12-20 09:46:41 +00:00
Bartłomiej Wróblewski 577ac88555 Refresh instances to mig cache in MigInfoProvider 2021-12-20 09:27:32 +00:00
ialidzhikov 986d62fb96 Add `--feature-gates` flag to support scale up on volume limits (CSI migration enabled)
Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>
2021-12-19 15:38:17 +02:00
ialidzhikov 7e60fd8749 cluster-autoscaler: Update READMe.md
Signed-off-by: ialidzhikov <i.alidjikov@gmail.com>
2021-12-17 11:56:31 +02:00
Yaroslava Serdiuk 4effd91800 Add version to gce Mig 2021-12-16 10:41:02 +00:00