13 KiB
Cluster Autoscaler on Azure
The cluster autoscaler on Azure dynamically scales Kubernetes worker nodes. It runs as a deployment in your cluster.
This README will help you get cluster autoscaler running on your Azure Kubernetes cluster.
Kubernetes Version
Kubernetes v1.10.x or later is required to use cluster autoscaler on Azure. See the "Releases" section in the README for more information.
CA Version
Cluster autoscaler v1.2.x or later is required for Azure. See the "Releases" section in the README for more information.
NOTE: In the deployment manifests referenced below, be sure to replace the
{{ ca_version }}placeholder with an actual release, such asv1.14.2.
Permissions
Get Azure credentials by running the following Azure CLI command:
# replace <subscription-id> with yours.
az ad sp create-for-rbac --role="Contributor" --scopes="/subscriptions/<subscription-id>" --output json
This will create a new service principal with "Contributor" role scoped to your subscription. Save the JSON output, because it will be needed to configure the cluster autoscaler deployment in the next step.
Scaling a VMSS node group to and from 0
If you are using nodeSelector, you need to tag the VMSS with a node-template key "k8s.io_cluster-autoscaler_node-template_label_" for using labels and "k8s.io_cluster-autoscaler_node-template_taint_" if you are using taints.
Note that these tags use the pipe
_character compared to a forward slash due to Azure tag name restrictions.
Examples
Labels
To add the label of foo=bar to a node from a VMSS pool, you would add the following tag to the VMSS k8s.io_cluster-autoscaler_node-template_label_foo: bar.
You can also use forward slashes in the labels by setting them as an underscore in the tag name. For example to add the label of k8s.io/foo=bar to a node from a VMSS pool, you would add the following tag to the VMSS k8s.io_cluster-autoscaler_node-template_label_k8s.io_foo: bar
Taints
To add the taint of foo=bar:NoSchedule to a node from a VMSS pool, you would add the following tag to the VMSS k8s.io_cluster-autoscaler_node-template_taint_foo: bar:NoSchedule.
You can also use forward slashes in taints by setting them as an underscore in the tag name. For example to add the taint of k8s.io/foo=bar:NoSchedule to a node from a VMSS pool, you would add the following tag to the VMSS k8s.io_cluster-autoscaler_node-template_taint_k8s.io_foo: bar:NoSchedule
Deployment manifests
Cluster autoscaler supports four Kubernetes cluster options on Azure:
- vmss: Autoscale VMSS instances by setting the Azure cloud provider's
vmTypeparameter tovmssor to an empty string. This supports clusters deployed with aks-engine. - standard: Autoscale VMAS instances by setting the Azure cloud provider's
vmTypeparameter tostandard. This supports clusters deployed with aks-engine. - aks: Supports an Azure Kubernetes Service (AKS) cluster.
NOTE: only the
vmssoption supports scaling down to zero nodes.
NOTE: The
subscriptionIDparameter is optional. When skipped, the subscription will be fetched from the instance metadata.
VMSS deployment
Prerequisites:
- Get Azure credentials from the Permissions step above.
- Get the name of the VM scale set associated with the cluster's node pool. You can find this in the Azure Portal or with the
az vmss listcommand.
Make a copy of cluster-autoscaler-vmss.yaml. Fill in the placeholder values for the cluster-autoscaler-azure secret data by base64-encoding each of your Azure credential fields.
- ClientID:
<base64-encoded-client-id> - ClientSecret:
<base64-encoded-client-secret> - ResourceGroup:
<base64-encoded-resource-group> - SubscriptionID:
<base64-encoded-subscription-id> - TenantID:
<base64-encoded-tenant-id>
NOTE: Use a command such as
echo $CLIENT_ID | base64to encode each of the fields above.
NOTE (optional) to specify the TTL of VMSS ASG cache to prevent throttling issue, please provide the env
AZURE_ASG_CACHE_TTLin seconds which is set to one hour by default.
In the cluster-autoscaler spec, find the image: field and replace {{ ca_version }} with a specific cluster autoscaler release.
Auto-Discovery Setup
To run a cluster-autoscaler which auto-discovers VMSSs with nodes use the --node-group-auto-discovery flag.
For example, --node-group-auto-discovery=label:cluster-autoscaler-enabled=true,cluster-autoscaler-name=<YOUR CLUSTER NAME> will find the VMSSs tagged with those tags containing those values.
Note that:
- It is recommended to use a second tag like
cluster-autoscaler-name=<YOUR CLUSTER NAME>whencluster-autoscaler-enabled=trueis used across many clusters to prevent VMSSs from different clusters recognized as the node groups - There are no
--nodesflags passed to cluster-autoscaler because the node groups are automatically discovered by tags - No min/max values are provided when using Auto-Discovery, cluster-autoscaler will detect the "min" and "max" tags on the VMSS resource in Azure, adjusting the desired number of nodes within these limits.
kubectl apply -f examples/cluster-autoscaler-autodiscover.yaml
Explicit setup
Below that, in the command: section, update the --nodes= arguments to reference your node limits and VMSS name. For example, if node pool "k8s-nodepool-1-vmss" should scale from 1 to 10 nodes:
- --nodes=1:10:k8s-nodepool-1-vmss
or to autoscale multiple VM scale sets:
- --nodes=1:10:k8s-nodepool-1-vmss
- --nodes=1:10:k8s-nodepool-2-vmss
Note that it doesn't mean the number of nodes in nodepool is restricted in the range from 1 to 10. It means when ca is downscaling (upscaling) the nodepool, it will never break the limit of 1 (10). If the current node pool size is lower than the specified minimum or greater than the specified maximum when you enable autoscaling, the autoscaler waits to take effect until a new node is needed in the node pool or until a node can be safely deleted from the node pool.
To allow scaling similar node pools simultaneously, or when using separate node groups per zone and to keep nodes balanced across zones, use the --balance-similar-node-groups flag (default false). Add it to the command section to enable it:
- --balance-similar-node-groups=true
See the FAQ for more details.
Save the updated deployment manifest, then deploy cluster-autoscaler by running:
kubectl create -f cluster-autoscaler-vmss.yaml
To run a cluster autoscaler pod on a master node, the deployment should tolerate the master taint, and nodeSelector should be used to schedule pods. Use cluster-autoscaler-vmss-master.yaml in this case.
To run a cluster autoscaler pod with Azure managed service identity (MSI), use cluster-autoscaler-vmss-msi.yaml instead.
Standard deployment
Prerequisites:
- Get Azure credentials from the Permissions step above.
- Get the name of the initial Azure deployment resource for the cluster. You can find this in the Azure Portal or with the
az deployment listcommand. If there are multiple deployments, get the name of the first one.
Make a copy of cluster-autoscaler-standard-master.yaml. Fill in the placeholder values for the cluster-autoscaler-azure secret data by base64-encoding each of your Azure credential fields.
- ClientID:
<base64-encoded-client-id> - ClientSecret:
<base64-encoded-client-secret> - ResourceGroup:
<base64-encoded-resource-group> - SubscriptionID:
<base64-encoded-subscription-id> - TenantID:
<base64-encoded-tenant-id> - Deployment:
<base64-encoded-azure-initial-deployment-name>
NOTE: Use a command such as
echo $CLIENT_ID | base64to encode each of the fields above.
In the cluster-autoscaler spec, find the image: field and replace {{ ca_version }} with a specific cluster autoscaler release.
Below that, in the command: section, update the --nodes= arguments to reference your node limits and node pool name (tips: node pool name is NOT availability set name, e.g., the corresponding node pool name of the availability set
agentpool1-availabilitySet-xxxxxxxx would be agentpool1). For example, if node pool "k8s-nodepool-1" should scale from 1 to 10 nodes:
- --nodes=1:10:k8s-nodepool-1
or to autoscale multiple VM scale sets:
- --nodes=1:10:k8s-nodepool-1
- --nodes=1:10:k8s-nodepool-2
Create the Azure deploy parameters secret cluster-autoscaler-azure-deploy-parameters by running:
kubectl -n kube-system create secret generic cluster-autoscaler-azure-deploy-parameters --from-file=deploy-parameters=./_output/<your-output-path>/azuredeploy.parameters.json
Then deploy cluster-autoscaler by running:
kubectl create -f cluster-autoscaler-standard-master.yaml
To run a cluster autoscaler pod with Azure managed service identity (MSI), use cluster-autoscaler-standard-msi.yaml instead.
WARNING: Cluster autoscaler depends on user-provided deployment parameters to provision new nodes. After upgrading your Kubernetes cluster, cluster autoscaler must also be redeployed with new parameters to prevent provisioning nodes with an old version.
AKS deployment
AKS + VMSS
Autoscaling VM scale sets with AKS is supported for Kubernetes v1.12.4 and later. The option to enable cluster autoscaler is available in the Azure Portal or with the Azure CLI:
az aks create \
--resource-group myResourceGroup \
--name myAKSCluster \
--kubernetes-version 1.13.5 \
--node-count 1 \
--enable-vmss \
--enable-cluster-autoscaler \
--min-count 1 \
--max-count 3
AKS + Availability Set
The CLI based deployment only support VMSS and manual deployment is needed if availability set is used.
Prerequisites:
- Get Azure credentials from the Permissions step above.
- Get the cluster name with the
az aks listcommand. - Get the name of a node pool from the value of the label agentpool
kubectl get nodes --show-labels
Make a copy of cluster-autoscaler-aks.yaml. Fill in the placeholder values for
the cluster-autoscaler-azure secret data by base64-encoding each of your Azure credential fields.
- ClientID:
<base64-encoded-client-id> - ClientSecret:
<base64-encoded-client-secret> - ResourceGroup:
<base64-encoded-resource-group>(Note: ResourceGroup is case-sensitive) - SubscriptionID:
<base64-encoded-subscription-id> - TenantID:
<base64-encoded-tenant-id> - ClusterName:
<base64-encoded-clustername> - NodeResourceGroup:
<base64-encoded-node-resource-group>(Note: node resource group is not resource group and can be obtained in the corresponding label of the nodepool)
NOTE: Use a command such as
echo $CLIENT_ID | base64to encode each of the fields above.
In the cluster-autoscaler spec, find the image: field and replace {{ ca_version }} with a specific cluster autoscaler release.
Below that, in the command: section, update the --nodes= arguments to reference your node limits and node pool name. For example, if node pool "k8s-nodepool-1" should scale from 1 to 10 nodes:
- --nodes=1:10:k8s-nodepool-1
or to autoscale multiple VM scale sets:
- --nodes=1:10:k8s-nodepool-1
- --nodes=1:10:k8s-nodepool-2
Then deploy cluster-autoscaler by running
kubectl create -f cluster-autoscaler-aks.yaml
Please see the AKS autoscaler documentation for details.