autoscaler/cluster-autoscaler
Nic Cope 6a704a6cf4 Break down cloud provider builder by provider
The Build method was getting pretty big, this hopefully makes it a little
more readable. It also fixes a few minor error shadowing bugs.
2017-12-11 13:09:56 -08:00
..
Godeps Godeps update 2017-11-28 14:00:58 +01:00
_override/google.golang.org Add appropriate license to _override 2017-11-28 14:31:20 +01:00
cloudprovider Break down cloud provider builder by provider 2017-12-11 13:09:56 -08:00
clusterstate Increase MaxNodeStartupTime to 15 minutes. 2017-11-13 15:14:47 +01:00
config respect minimum cores/memory limit during scale down 2017-09-13 10:10:47 +02:00
core Remove the Polling Autoscaler. 2017-12-11 13:09:56 -08:00
deploy bump container image version to 0.6.0 2017-08-09 16:03:09 -07:00
estimator Small optimize code 2017-09-04 23:50:45 +03:00
expander Create node groups with GPU in scale-up.go 2017-12-11 13:12:22 +01:00
metrics Add metrics for autoprovisioning 2017-10-31 17:42:58 +01:00
proposals Update metrics documentation 2017-11-07 17:37:10 +01:00
simulator Source fix after godep update 2017-11-28 14:01:43 +01:00
utils Create node groups with GPU in scale-up.go 2017-12-11 13:12:22 +01:00
vendor Remove Windows-specific libraries from godeps 2017-11-28 14:48:05 +01:00
.gitignore Cluster-Autoscaler - Kubernetes client deps 2016-04-20 11:49:38 +02:00
Dockerfile cluster-autoscaler: Add ca-certificates to the docker image 2017-05-09 12:30:59 +09:00
FAQ.md Release notes for Cluster Autoscaler 1.0.3 2017-11-17 13:35:32 +01:00
Makefile Extra checks when pushing an image to gcr repository 2017-11-17 15:49:52 +01:00
OWNERS Make assignees approvers and reviewers 2016-12-14 16:42:04 -08:00
README.md Merge pull request #485 from mwielgus/azure-readme 2017-11-23 08:34:23 +01:00
fix_gopath.sh Rename override to _override to allow ./... patterns in go command 2017-11-28 14:21:17 +01:00
kubernetes.sync Source fix after godep update 2017-11-28 14:01:43 +01:00
main.go Break down cloud provider builder by provider 2017-12-11 13:09:56 -08:00
push_image.sh Extra checks when pushing an image to gcr repository 2017-11-17 15:49:52 +01:00
run.sh Cluster-Autoscaler: added wrapper script to pass signals 2017-02-28 17:39:29 +01:00
update_toc.py Fix update_toc.py script to stop appending empty lines 2017-06-30 14:18:18 +02:00
version.go Cluster Autoscaler 1.1.0 2017-12-08 10:16:14 -06:00

README.md

Cluster Autoscaler

Introduction

Cluster Autoscaler is a tool that automatically adjusts the size of the Kubernetes cluster when:

  • there are pods that failed to run in the cluster due to insufficient resources.
  • some nodes in the cluster are so underutilized, for an extended period of time, that they can be deleted and their pods will be easily placed on some other, existing nodes.

FAQ/Documentation

Is available HERE.

Releases

We strongly recommend using Cluster Autoscaler with version for which it was meant. We don't do ANY cross version testing so if you put the newest Cluster Autoscaler on an old cluster there is a big chance that it won't work as expected.

Kubernetes Version CA Version
1.8.X 1.0.X
1.7.X 0.6.X
1.6.X 0.5.X, 0.6.X*
1.5.X 0.4.X
1.4.X 0.3.X

*Cluster Autoscaler 0.5.X is the official version shipped with k8s 1.6. We've done some basic tests using k8s 1.6 / CA 0.6 and we're not aware of any problems with this setup. However, CA internally simulates k8s scheduler and using different versions of scheduler code can lead to subtle issues.

Notable changes

CA version 1.0.3:

  • Adds support for safe-to-evict annotation on pod. Pods with this annotation can be evicted even if they don't meet other requirements for it.
  • Fixes an issue when too many nodes with GPUs could be added during scale-up (https://github.com/kubernetes/kubernetes/issues/54959).

CA Version 1.0.2:

CA Version 1.0.1:

CA Version 1.0:

With this release we graduated Cluster Autoscaler to GA.

  • Support for 1000 nodes running 30 pods each. See: Scalability testing report
  • Support for 10 min graceful termination.
  • Improved eventing and monitoring.
  • Node allocatable support.
  • Removed Azure support. See: PR removing support with reasoning behind this decision
  • cluster-autoscaler.kubernetes.io/scale-down-disabled` annotation for marking nodes that should not be scaled down.
  • scale-down-delay-after-deleteandscale-down-delay-after-failureflags replacedscale-down-trial-interval`

CA Version 0.6:

CA Version 0.5.4:

  • Fixes problems with node drain when pods are ignoring SIGTERM.

CA Version 0.5.3:

CA Version 0.5.2:

CA Version 0.5.1:

CA Version 0.5:

  • CA continues to operate even if some nodes are unready and is able to scale-down them.
  • CA exports its status to kube-system/cluster-autoscaler-status config map.
  • CA respects PodDisruptionBudgets.
  • Azure support.
  • Alpha support for dynamic config changes.
  • Multiple expanders to decide which node group to scale up.

CA Version 0.4:

  • Bulk empty node deletions.
  • Better scale-up estimator based on binpacking.
  • Improved logging.

CA Version 0.3:

  • AWS support.
  • Performance improvements around scale down.

Deployment

Cluster Autoscaler runs on the Kubernetes master node (at least in the default setup on GCE and GKE). It is possible to run customized Cluster Autoscaler inside of the cluster but then extra care needs to be taken to ensure that Cluster Autoscaler is up and running. User can put it into kube-system namespace (Cluster Autoscaler doesn't scale down node with non-manifest based kube-system pods running on them) and mark with scheduler.alpha.kubernetes.io/critical-pod annotation (so that the rescheduler, if enabled, will kill other pods to make space for it to run).

Right now it is possible to run Cluster Autoscaler on: