Commit Graph

5208 Commits

Author SHA1 Message Date
Sherman Grewal 8b624757bf Add NAMESPACE as an environment variable to the updater deployment config 2021-08-05 23:32:10 -04:00
Kubernetes Prow Robot 2dd92cbf37 allow adding annotations for priority-expander configmap 2021-08-06 08:09:19 +08:00
Kubernetes Prow Robot ca49c2c7ae
Merge pull request #4050 from afirth/patch-1
Add example to AWS readme if taint has value
2021-08-05 13:29:20 -07:00
Kubernetes Prow Robot 4b4bc85aa1
Merge pull request #4046 from sylr/aws-log
Improve misleading log
2021-08-05 12:23:19 -07:00
Kubernetes Prow Robot 9d0946bccb
Merge pull request #4241 from towca/jtuznik/n2-pricing
GCE: add pricing info for new N2 instance types
2021-08-05 01:45:23 -07:00
Jakub Tużnik 19dffbc145 GCE: add pricing info for new N2 instance types 2021-08-05 10:29:38 +02:00
Kubernetes Prow Robot 9d54f7b782
Merge pull request #4239 from BigDarkClown/move-update-labels
Move UpdateDeprecatedTemplateLabels function
2021-08-04 07:49:24 -07:00
Bartłomiej Wróblewski 1e4cb1eafe Move UpdateDeprecatedTemplateLabels function
This is a useful function, we will benefit from
having it more accessible then it is currently.
2021-08-04 14:32:39 +00:00
Kubernetes Prow Robot c563a40a60
Merge pull request #4235 from DataDog/fix-tests-and-gcp-pricing
cluster-autoscaler: fix unit tests
2021-08-03 03:12:48 -07:00
Benjamin Pineau 79c63a7b3c cluster-autoscaler: fix tests and GCE NodePrice
Recent changes configured providers to set stable nodes labels names
exclusively (ie. LabelTopologyZone and not LabelZoneFailureDomain, etc),
with older labels names backfilled at nodeInfos templates generation time
(from GetNodeInfoFromTemplate), which isn't invoked from most tests cases.
GCE NodePirce() might have been dereferencing potentially missing labels.
And run hack/update-gofmt.sh where hack/verify-all.sh fails, to pass CI.
2021-08-03 08:28:49 +02:00
Kubernetes Prow Robot 21fc0c1889
Merge pull request #4053 from codablock/old-labels
Also set new (non-beta/non-deprecated) labels in buildGenericLabels
2021-08-01 18:21:21 -07:00
by211 f2eefa9a26
Fix markdown code not showing correctly 2021-07-31 12:25:03 -05:00
Alexander Block 6d84abf0de Remove obsolete comment
arch is not hardcoded anymore
2021-07-29 16:45:09 +02:00
Alexander Block 8f11490c0c Introduce UpdateDeprecatedTemplateLabels to set beta/deprecated labels
And at the same time only set stable labels in all buildGenericLabels
implementations.

This fixes issues when a node group has 0 nodes yet and node labels are
built using buildGenericLabels and the node-template labels.

Issues include (anti-)affinity and nodeSelectors for the given labels,
giving false-negative results for candidate nodes, which leads to ASGs
never scaling up.
2021-07-29 16:45:08 +02:00
Kubernetes Prow Robot 1ecc8b43e1
Merge pull request #4225 from DataDog/gce-createinstances-basename
GCE: CreateInstances() should use BaseInstanceName
2021-07-29 05:10:19 -07:00
Benjamin Pineau 655bc6fd4a GCE: CreateInstances() should use BaseInstanceName
The new `CreateInstances()` upscale method replacing `Resize()` API
calls generates new instances names based on the MIG's name (from
`mig.GceRef()`).

Before that change, `Resize()`-initiated upscales were prompting MIGs to
spawn instances named after MIG's `BaseInstanceName` attribute.

Accordingly, `GetMigForInstance()` (still) uses MIG's `BaseInstanceName`
to map instances to their parent MIG and discover which MIGs needs an
immediate refresh.

Down the line the `clusterstate.updateReadinessStats()` periodic
goroutines won't be able to map new ready nodes to their parent MIGs
(until the cache is backfilled upward from k8s node's providerid, ie.
from an hourly goroutine), and those MIGs will be considered non-ready
(because MIG's size>0 while the MIG has no known ready instances).

So after a first upscale, MIGs (having a BaseInstanceName that is not
the MIG's Name) won't be re-upscalable for a while. Example symptoms:

```
cluster-autoscaler W0719 12:35:43.166563 6 clusterstate.go:447] Failed to find readiness information for https://www.googleapis.com/compute/v1/projects/REDACTED-PROJECT/zones/europe-west3-b/instanceGroups/REDACTED-MIGNAME
cluster-autoscaler W0719 12:35:43.193469 6 clusterstate.go:626] Readiness for node group https://www.googleapis.com/compute/v1/projects/REDACTED-PROJECT/zones/europe-west3-b/instanceGroups/REDACTED-MIGNAME not found
```

Beside mapping cache issue, this changed the instance names prefixes for
some users, while it might make sense to keep using basenames when
explicitely provided (might have an use for eg. identification, or name
length limits) and avoid a breaking change before `CreateInstances` hits
a release.
2021-07-29 12:41:12 +02:00
Kubernetes Prow Robot b6a5cdc800
Merge pull request #4226 from stevehipwell/extra-args-multi-string
[chart] Add multi string arg support
2021-07-26 03:06:33 -07:00
Steve Hipwell c8c9a3eea3 [chart] Add multi string arg support
Signed-off-by: Steve Hipwell <steve.hipwell@gmail.com>
2021-07-26 10:45:25 +01:00
Kubernetes Prow Robot 7b9725f731
Merge pull request #4215 from jayantjain93/vendor-v1.23.0-alpha.0
Update dependencies to k8s 1.23.0-alpha-0
2021-07-23 02:31:17 -07:00
Jayant Jain 74762f2cc5 Update dependencies to k8s 1.23.0-alpha-0 2021-07-23 08:31:43 +00:00
Kubernetes Prow Robot af9abfec05
Merge pull request #4216 from jayantjain93/update-go-version
Update go version to 1.16 in Dockerfile and Github Tests
2021-07-23 01:25:16 -07:00
Jayant Jain 649d9e88a9 update go version to 1.16 for dockerfile 2021-07-23 07:49:14 +00:00
Kubernetes Prow Robot 0946e38fda
Merge pull request #4127 from shreyas-srinivas/ca_aws_region_fix
Fix: CA does not work properly while using AWS EC2 IMDSv2
2021-07-22 05:43:52 -07:00
darkpssngr b2f7fcc984
Update cluster-autoscaler/cloudprovider/aws/README.md
Co-authored-by: Guy Templeton <guyjtempleton@googlemail.com>
2021-07-20 20:53:06 +05:30
caogj 1b707e9d4d fix flag usage 2021-07-20 21:53:27 +08:00
Kubernetes Prow Robot 3a69f118d9
Merge pull request #4207 from MyannaHarris/update_cas_aws_example_memory
[Examples] Update the AWS examples to have 600Mi of memory
2021-07-19 14:58:52 -07:00
Kubernetes Prow Robot 92ec53d0d5
Merge pull request #4106 from ockam-network/master
docs: Install parameters
2021-07-19 14:18:53 -07:00
MyannaHarris dcdb952322 [Examples] Update the AWS examples to have 600Mi of memory
This change updates the AWS examples to have 600Mi of memory because CAS downloads a pricing file that contains EC2 instance info at startup which grows each time there's new EC2 instance information available. Currently the largest region is hitting the 300Mi limit when downloading that file, so we are increasing the memory limit in our examples for customers.
2021-07-19 11:03:20 -07:00
Kubernetes Prow Robot d74c8e3b11
Merge pull request #4200 from elmiko/update-capi-owners
update cluster-autoscaler CAPI provider owners
2021-07-16 08:40:09 -07:00
Michael McCune 0499b886d4 update cluster-autoscaler CAPI provider owners
This change is adding github users arunmk, mrajashree, jackfrancis,
shysank, and randomvariable to the reviews for the cluster-api
provider. It also removes frobware and ncdc from the approvers and
reviewers.
2021-07-15 14:36:19 -04:00
Kubernetes Prow Robot fa39793465
Merge pull request #4193 from Ritikaa96/update-faq-cluster-autoscaler
Cluster-autoscaler-FAQ: Updating link addresses for MinAvailable and Cloud-controller-manager
2021-07-14 02:10:27 -07:00
Kubernetes Prow Robot 5e0fbd943d
Merge pull request #4195 from mattxwang/master
docs: replaces Travis CI badge with GitHub Actions badges
2021-07-14 00:30:27 -07:00
Matthew Wang 9802a19ee5
docs: replaces Travis CI badge with GitHub Actions badges 2021-07-13 10:20:32 -07:00
Ritikaa96 099f2a30d6 Updating link addresses for MinAvailable and Cloud-controller-manager
Signed-off-by: Ritikaa96 <ritika@india.nec.com>
2021-07-13 18:23:54 +05:30
Kubernetes Prow Robot d221b0f620
Merge pull request #4187 from MaciekPytel/1_22_beta_1
Update dependencies to k8s 1.22.0-beta.1
2021-07-09 04:48:53 -07:00
Maciek Pytel 1d0a9e4630 Update dependencies to k8s 1.22.0-beta.1
Some changes in scheduler framework initialization
in response to upstream refactors.
2021-07-09 13:30:18 +02:00
Kubernetes Prow Robot 389cfd2e18
Merge pull request #4184 from MaciekPytel/fix_ut
Skipping metrics tests added in #4022
2021-07-08 06:34:54 -07:00
Maciek Pytel 52392b3707 Skipping metrics tests added in #4022
Each test works in isolation, but they cause panic when the entire
suite is run (ex. make test-in-docker), because the underlying
metrics library panics when the same metric is registered twice.
2021-07-08 15:14:26 +02:00
Kubernetes Prow Robot 7f6f9531d9
Merge pull request #4183 from stevehipwell/chart-semver
[chart] Bump version to valid semver
2021-07-08 03:08:53 -07:00
Steve Hipwell 41520aace5 [chart] Bump version to valid semver
Signed-off-by: Steve Hipwell <steve.hipwell@proagrica.com>
2021-07-08 10:50:51 +01:00
Kubernetes Prow Robot 23fdf63bfc
Merge pull request #4154 from MarcusNoble/chart_rbac
fix: add missing RBAC permissions to autoscaler chart
2021-07-08 01:36:53 -07:00
Kubernetes Prow Robot 9f84d391f6
Merge pull request #4022 from amrmahdi/amrh/nodegroupminmaxmetrics
[cluster-autoscaler] Publish node group min/max metrics
2021-07-05 07:38:54 -07:00
Marcus Noble 3341c863e0
Merge branch 'master' into chart_rbac 2021-07-05 15:12:17 +01:00
Kubernetes Prow Robot 7858da6716
Merge pull request #4175 from mweibel/patch-1
update cluster-autoscaler chart to 1.21.0
2021-07-05 03:46:54 -07:00
Michael Weibel 7ddbe272f8
update cluster-autoscaler chart to 1.21.0 2021-07-05 12:33:46 +02:00
Benjamin Pineau b21211cf5b Metrics for AWS API calls
Provide metrics for AWS API calls; helps identifying slowness,
throttling causes, and errors surges.
2021-07-04 14:17:15 +02:00
Kubernetes Prow Robot a4576749bb
Merge pull request #4173 from x13n/docs
Document DaemonSet eviction opt in/out behavior
2021-07-02 06:00:13 -07:00
Kubernetes Prow Robot a7ddc7e2c2
Merge pull request #4169 from rimusz/master
Add support for AWS Osaka region
2021-07-01 01:37:54 -07:00
Daniel Kłobuszewski 45f5b121a5 Document DaemonSet eviction opt in/out behavior 2021-06-30 08:56:42 +02:00
Kubernetes Prow Robot a839343bbd
Merge pull request #4172 from x13n/master
Allow DaemonSet pods to opt in/out from eviction
2021-06-29 05:38:40 -07:00