Commit Graph

5067 Commits

Author SHA1 Message Date
Michael McCune 0499b886d4 update cluster-autoscaler CAPI provider owners
This change is adding github users arunmk, mrajashree, jackfrancis,
shysank, and randomvariable to the reviews for the cluster-api
provider. It also removes frobware and ncdc from the approvers and
reviewers.
2021-07-15 14:36:19 -04:00
Kubernetes Prow Robot fa39793465
Merge pull request #4193 from Ritikaa96/update-faq-cluster-autoscaler
Cluster-autoscaler-FAQ: Updating link addresses for MinAvailable and Cloud-controller-manager
2021-07-14 02:10:27 -07:00
Kubernetes Prow Robot 5e0fbd943d
Merge pull request #4195 from mattxwang/master
docs: replaces Travis CI badge with GitHub Actions badges
2021-07-14 00:30:27 -07:00
Matthew Wang 9802a19ee5
docs: replaces Travis CI badge with GitHub Actions badges 2021-07-13 10:20:32 -07:00
Ritikaa96 099f2a30d6 Updating link addresses for MinAvailable and Cloud-controller-manager
Signed-off-by: Ritikaa96 <ritika@india.nec.com>
2021-07-13 18:23:54 +05:30
Kubernetes Prow Robot d221b0f620
Merge pull request #4187 from MaciekPytel/1_22_beta_1
Update dependencies to k8s 1.22.0-beta.1
2021-07-09 04:48:53 -07:00
Maciek Pytel 1d0a9e4630 Update dependencies to k8s 1.22.0-beta.1
Some changes in scheduler framework initialization
in response to upstream refactors.
2021-07-09 13:30:18 +02:00
Kubernetes Prow Robot 389cfd2e18
Merge pull request #4184 from MaciekPytel/fix_ut
Skipping metrics tests added in #4022
2021-07-08 06:34:54 -07:00
Maciek Pytel 52392b3707 Skipping metrics tests added in #4022
Each test works in isolation, but they cause panic when the entire
suite is run (ex. make test-in-docker), because the underlying
metrics library panics when the same metric is registered twice.
2021-07-08 15:14:26 +02:00
Kubernetes Prow Robot 7f6f9531d9
Merge pull request #4183 from stevehipwell/chart-semver
[chart] Bump version to valid semver
2021-07-08 03:08:53 -07:00
Steve Hipwell 41520aace5 [chart] Bump version to valid semver
Signed-off-by: Steve Hipwell <steve.hipwell@proagrica.com>
2021-07-08 10:50:51 +01:00
Kubernetes Prow Robot 23fdf63bfc
Merge pull request #4154 from MarcusNoble/chart_rbac
fix: add missing RBAC permissions to autoscaler chart
2021-07-08 01:36:53 -07:00
Kubernetes Prow Robot 9f84d391f6
Merge pull request #4022 from amrmahdi/amrh/nodegroupminmaxmetrics
[cluster-autoscaler] Publish node group min/max metrics
2021-07-05 07:38:54 -07:00
Marcus Noble 3341c863e0
Merge branch 'master' into chart_rbac 2021-07-05 15:12:17 +01:00
Kubernetes Prow Robot 7858da6716
Merge pull request #4175 from mweibel/patch-1
update cluster-autoscaler chart to 1.21.0
2021-07-05 03:46:54 -07:00
Michael Weibel 7ddbe272f8
update cluster-autoscaler chart to 1.21.0 2021-07-05 12:33:46 +02:00
Kubernetes Prow Robot a4576749bb
Merge pull request #4173 from x13n/docs
Document DaemonSet eviction opt in/out behavior
2021-07-02 06:00:13 -07:00
Kubernetes Prow Robot a7ddc7e2c2
Merge pull request #4169 from rimusz/master
Add support for AWS Osaka region
2021-07-01 01:37:54 -07:00
Daniel Kłobuszewski 45f5b121a5 Document DaemonSet eviction opt in/out behavior 2021-06-30 08:56:42 +02:00
Kubernetes Prow Robot a839343bbd
Merge pull request #4172 from x13n/master
Allow DaemonSet pods to opt in/out from eviction
2021-06-29 05:38:40 -07:00
Daniel Kłobuszewski 44b8d67d50 Allow DaemonSet pods to opt in/out from eviction 2021-06-29 11:58:14 +02:00
rimas 3fa210fbc0 Add support for AWS Osaka region 2021-06-28 12:08:01 +03:00
Kubernetes Prow Robot 682dec3eb0
Merge pull request #4166 from mweibel/patch-1
add Standard_HB120rs_v3
2021-06-28 00:29:25 -07:00
Michael Weibel 09b07ca549
add Standard_HB120rs_v3 2021-06-25 18:37:05 +02:00
Kubernetes Prow Robot 509b3e337c
Merge pull request #4162 from x13n/master
Add a flag to control DaemonSet eviction on non-empty nodes
2021-06-25 02:20:45 -07:00
Daniel Kłobuszewski 081c4664d3 Add a flag to control DaemonSet eviction on non-empty nodes 2021-06-25 11:06:10 +02:00
Kubernetes Prow Robot 07c7607974
Merge pull request #4158 from olagacek/master
Use CreateInstances() API when scaling up in GCE cloud provider
2021-06-23 07:42:11 -07:00
Aleksandra Gacek 674de4f2af Use CreateInstances() API when scaling up in GCE cloud provider 2021-06-23 16:28:46 +02:00
Kubernetes Prow Robot 267f306d45
Merge pull request #4153 from marvinpinto/hetzner-additions
Add the ability to spin up Hetzner servers from custom snapshots
2021-06-23 04:40:10 -07:00
Marvin Pinto 838ea229c0 Fixes to address code review comments 2021-06-22 06:58:15 -04:00
Marvin Pinto b2ead3b2e3 Add the ability to spin up Hetzner servers from custom snapshots
This comes in handy when using tools such as Packer to generate customized
images.
2021-06-21 17:50:09 -04:00
Kubernetes Prow Robot 1d4e6dff1b
Merge pull request #4151 from hetznercloud/fix-nil-pointer-dereference
Watch Action instead of polling Server Status
2021-06-21 06:46:51 -07:00
Marcus Noble f5cf35ba39 fix: add missing RBAC permissions to autoscaler chart 2021-06-21 07:32:20 +01:00
Ferdinand Hofherr 7f19fb1c8e Watch Action instead of polling Server Status
We have recevied an issue from one of our customers that their
autoscaler pod regularly crashes due to a nil pointer dereference panic.
Analyzing the code we found out that the autoscaler polls the server
status to find out if a server is running.

The Hetzner Cloud Go client is implemented in such a way that it does
not return an error if a resource could not be found. Instead it returns
nil for the error and the resource. Ususally this is not an issue.
However, in case the server creation fails the server gets deleted from
Hetzner Cloud. This in turn leads to nil being returned and the
abovementioned panic.

The Hetzner Cloud API implements a concept called Actions. Whenever a
long running process is triggered we return an object which can
be used to get information about the progress of the task. The Action
object reliably allows to detect if a server has been created and
provides access to any error that may have occured.

This commit replaces polling the server status with using the action
object.
2021-06-18 13:44:16 +02:00
Kubernetes Prow Robot 6e3e05d80c
Merge pull request #4149 from olagacek/master
Allow overriding userAgent in Custom GCE client in gce cloud provider.
2021-06-17 04:40:19 -07:00
Aleksandra Gacek 6fd23f9c49 Allow overriding userAgent in Custom GCE client in gce cloud provider. 2021-06-17 13:21:30 +02:00
Kubernetes Prow Robot 1b502e09f6
Merge pull request #4147 from timoreimann/digitalocean/support-reading-access-token-from-file
digitalocean: support reading access token from file
2021-06-16 08:15:59 -07:00
Timo Reimann 05e2011096 digitalocean: support reading access token from file
This makes it possible to securely store the access token in a file and
load it into the cloud provider from there.

Document DigitalOcean's cloud config format while we are here.
2021-06-16 17:03:17 +02:00
Kubernetes Prow Robot 7d7df8c48c
Merge pull request #4143 from BigDarkClown/master
Skip iteration loop if node creation failed
2021-06-16 07:55:59 -07:00
Bartłomiej Wróblewski 5076047bf8 Skip iteration loop if node creation failed 2021-06-16 14:40:15 +00:00
Kubernetes Prow Robot 6e276be8f1
Merge pull request #4145 from timoreimann/digitalocean/remove-tag-references-from-readme
digitalocean: remove tag references from README
2021-06-16 06:33:59 -07:00
Kubernetes Prow Robot cdf62a1508
Merge pull request #4144 from timoreimann/digitalocean/do-not-refresh-on-startup
digitalocean: do not Refresh() on startup
2021-06-16 06:31:59 -07:00
Kubernetes Prow Robot 84f7628cf1
Merge pull request #4146 from pgier/fix-addon-resizer-example-config
improve addon-resizer deployment example
2021-06-16 04:57:58 -07:00
Timo Reimann 0de2f81f73 digitalocean: remove tag references from README
An initial version of the DigitalOcean cloud provider implementation
relied on tags to define the behavior but has since been transitioned to
using the public DOKS API. Update the README accordingly.
2021-06-15 23:36:58 +02:00
Paul Gier 92751c4a5a
improve addon-resizer deployment example
This fixes the comment at the beginning of the addon-resizer deployment example
so that it describes the correct key names to use for configuring the cpu
and memory parameters.
2021-06-15 16:36:02 -05:00
Timo Reimann 7faca8b10b digitalocean: do not Refresh() on startup
If the API is temporarily unavailable, cluster-autoscaler will be
crash-looping on startup during the initial call to Refresh(). This
makes for a bad user/operator experience since it aggravates
differentiating between API and cluster/workload problems.

Let autoscaler start up and retry fetching node pool information from
the API as part of the pre-existing, periodic sync. This should be no
different to experiencing transient API problems during runtime.
2021-06-15 23:30:46 +02:00
Kubernetes Prow Robot f57b8f9f0f
Merge pull request #4137 from jayantjain93/permission-error
Adding support for PERMISSIONS_ERROR in gce cloud provider
2021-06-11 07:38:59 -07:00
Jayant Jain 671df22f9a Adding support for PERMISSIONS_ERROR in gce cloud provider 2021-06-11 12:42:04 +00:00
Kubernetes Prow Robot 0623a00d29
Merge pull request #4130 from marwanad/dont-decrement-azure-cache
dont proactively decrement azure cache for unregistered nodes
2021-06-10 22:06:13 -07:00
Kubernetes Prow Robot 2201e9bc7b
Merge pull request #4109 from Kafei59/feat/ovh-us
feat(ovh): enable OVHcloud provider for US side
2021-06-10 16:56:12 -07:00