cleanup clusterapi scale from zero implementation

This commit is a combination of several commits. Significant details are preserved below. * update functions for resource annotations This change converts some of the functions that look at annotation for resource usage to indicate their usage in the function name. This helps to make room for allowing the infrastructure reference as an alternate source for the capacity information. * migrate capacity logic into a single function This change moves the logic to collect the instance capacity from the TemplateNodeInfo function into a method of the unstructuredScalableResource named InstanceCapacity. This new function is created to house the logic that will decide between annotations and the infrastructure reference when calculating the capacity for the node. * add ability to lookup infrastructure references This change supplements the annotation lookups by adding the logic to read the infrastructure reference if it exists. This is done to determine if the machine template exposes a capacity field in its status. For more information on how this mechanism works, please see the cluster-api enhancement[0]. * add documentation for capi scaling from zero * improve tests for clusterapi scale from zero this change adds functionality to test the dynamic client behavior of getting the infrastructure machine templates. * update README with information about rbac changes this adds more information about the rbac changes necessary for the scale from zero support to work. * remove extra check for scaling from zero since the CanScaleFromZero function checks to see if both CPU and memory are present, there is no need to check a second time. This also adds some documentation to the CanScaleFromZero function to make it clearer what is happening. * update unit test for capi scale from zero adding a few more cases and details to the scale from zero unit tests, including ensuring that the int based annotations do not accept other unit types. [0] https://github.com/kubernetes-sigs/cluster-api/blob/main/docs/proposals/20210310-opt-in-autoscaling-from-zero.md
2022-02-25 08:36:31 -05:00 · 2022-02-25 08:36:31 -05:00 · 1a65fde540
parent de90a462c7
commit 1a65fde540
10 changed files with 725 additions and 264 deletions
--- a/cluster-autoscaler/cloudprovider/clusterapi/README.md
+++ b/cluster-autoscaler/cloudprovider/clusterapi/README.md
@ -163,6 +163,70 @@ There are two annotations that control how a cluster resource should be scaled:
 The autoscaler will monitor any `MachineSet` or `MachineDeployment` containing
 both of these annotations.

+### Scale from zero support
+
+The Cluster API community has defined an opt-in method for infrastructure
+providers to enable scaling from zero-sized node groups in the
+[Opt-in Autoscaling from Zero enhancement](https://github.com/kubernetes-sigs/cluster-api/blob/main/docs/proposals/20210310-opt-in-autoscaling-from-zero.md).
+As defined in the enhancement, each provider may add support for scaling from
+zero to their provider, but they are not required to do so. If you are expecting
+built-in support for scaling from zero, please check with the Cluster API
+infrastructure providers that you are using.
+
+If your Cluster API provider does not have support for scaling from zero, you
+may still use this feature through the capacity annotations. You may add these
+annotations to your MachineDeployments, or MachineSets if you are not using
+MachineDeployments (it is not needed on both), to instruct the cluster
+autoscaler about the sizing of the nodes in the node group. At the minimum,
+you must specify the CPU and memory annotations, these annotations should
+match the expected capacity of the nodes created from the infrastructure.
+
+For example, if my MachineDeployment will create nodes that have "16000m" CPU,
+"128G" memory, 2 NVidia GPUs, and can support 200 max pods, the folllowing
+annotations will instruct the autoscaler how to expand the node group from
+zero replicas:
+
+```yaml
+apiVersion: cluster.x-k8s.io/v1alpha4
+kind: MachineDeployment
+metadata:
+  annotations:
+    cluster.x-k8s.io/cluster-api-autoscaler-node-group-max-size: "5"
+    cluster.x-k8s.io/cluster-api-autoscaler-node-group-min-size: "0"
+    capacity.cluster-autoscaler.kubernetes.io/memory: "128G"
+    capacity.cluster-autoscaler.kubernetes.io/cpu: "16"
+    capacity.cluster-autoscaler.kubernetes.io/gpu-type: "nvidia.com/gpu"
+    capacity.cluster-autoscaler.kubernetes.io/gpu-count: "2"
+    capacity.cluster-autoscaler.kubernetes.io/maxPods: "200"
+```
+
+*Note* the `maxPods` annotation will default to `110` if it is not supplied.
+This value is inspired by the Kubernetes best practices
+[Considerations for large clusters](https://kubernetes.io/docs/setup/best-practices/cluster-large/).
+
+#### RBAC changes for scaling from zero
+
+If you are using the opt-in support for scaling from zero as defined by the
+Cluster API infrastructure provider, you will need to add the infrastructure
+machine template types to your role permissions for the service account
+associated with the cluster autoscaler deployment. The service account will
+need permission to `get` and `list` the infrastructure machine templates for
+your infrastructure provider.
+
+For example, when using the [Kubemark provider](https://github.com/kubernetes-sigs/cluster-api-provider-kubemark)
+you will need to set the following permissions:
+
+```yaml
+rules:
+  - apiGroups:
+    - infrastructure.cluster.x-k8s.io
+    resources:
+    - kubemarkmachinetemplates
+    verbs:
+    - get
+    - list
+```
+
 ## Specifying a Custom Resource Group

 By default all Kubernetes resources consumed by the Cluster API provider will
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_autodiscovery_test.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_autodiscovery_test.go
@ -204,17 +204,17 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch         bool
 	}{{
 		name:                "no clustername, namespace, or label selector specified should match any MachineSet",
-		testSpec:            createTestSpec(RandomString(6), RandomString(6), RandomString(6), 1, false, nil),
+		testSpec:            createTestSpec(RandomString(6), RandomString(6), RandomString(6), 1, false, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{labelSelector: labels.NewSelector()},
 		shouldMatch:         true,
 	}, {
 		name:                "no clustername, namespace, or label selector specified should match any MachineDeployment",
-		testSpec:            createTestSpec(RandomString(6), RandomString(6), RandomString(6), 1, true, nil),
+		testSpec:            createTestSpec(RandomString(6), RandomString(6), RandomString(6), 1, true, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{labelSelector: labels.NewSelector()},
 		shouldMatch:         true,
 	}, {
 		name:     "clustername specified does not match MachineSet, namespace matches, no labels specified",
-		testSpec: createTestSpec("default", RandomString(6), RandomString(6), 1, false, nil),
+		testSpec: createTestSpec("default", RandomString(6), RandomString(6), 1, false, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
 			namespace:     "default",
@ -223,7 +223,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: false,
 	}, {
 		name:     "clustername specified does not match MachineDeployment, namespace matches, no labels specified",
-		testSpec: createTestSpec("default", RandomString(6), RandomString(6), 1, true, nil),
+		testSpec: createTestSpec("default", RandomString(6), RandomString(6), 1, true, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
 			namespace:     "default",
@ -232,7 +232,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: false,
 	}, {
 		name:     "namespace specified does not match MachineSet, clusterName matches, no labels specified",
-		testSpec: createTestSpec(RandomString(6), "foo", RandomString(6), 1, false, nil),
+		testSpec: createTestSpec(RandomString(6), "foo", RandomString(6), 1, false, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
 			namespace:     "default",
@ -241,7 +241,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: false,
 	}, {
 		name:     "clustername specified does not match MachineDeployment, namespace matches, no labels specified",
-		testSpec: createTestSpec(RandomString(6), "foo", RandomString(6), 1, true, nil),
+		testSpec: createTestSpec(RandomString(6), "foo", RandomString(6), 1, true, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
 			namespace:     "default",
@ -250,7 +250,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: false,
 	}, {
 		name:     "namespace and clusterName matches MachineSet, no labels specified",
-		testSpec: createTestSpec("default", "foo", RandomString(6), 1, false, nil),
+		testSpec: createTestSpec("default", "foo", RandomString(6), 1, false, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
 			namespace:     "default",
@ -259,7 +259,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: true,
 	}, {
 		name:     "namespace and clusterName matches MachineDeployment, no labels specified",
-		testSpec: createTestSpec("default", "foo", RandomString(6), 1, true, nil),
+		testSpec: createTestSpec("default", "foo", RandomString(6), 1, true, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
 			namespace:     "default",
@ -268,7 +268,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: true,
 	}, {
 		name:     "namespace and clusterName matches MachineSet, does not match label selector",
-		testSpec: createTestSpec("default", "foo", RandomString(6), 1, false, nil),
+		testSpec: createTestSpec("default", "foo", RandomString(6), 1, false, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
 			namespace:     "default",
@ -277,7 +277,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: false,
 	}, {
 		name:     "namespace and clusterName matches MachineDeployment, does not match label selector",
-		testSpec: createTestSpec("default", "foo", RandomString(6), 1, true, nil),
+		testSpec: createTestSpec("default", "foo", RandomString(6), 1, true, nil, nil),
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
 			namespace:     "default",
@ -286,7 +286,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: false,
 	}, {
 		name:             "namespace, clusterName, and label selector matches MachineSet",
-		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, false, nil),
+		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, false, nil, nil),
 		additionalLabels: map[string]string{"color": "green"},
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
@ -296,7 +296,7 @@ func Test_allowedByAutoDiscoverySpec(t *testing.T) {
 		shouldMatch: true,
 	}, {
 		name:             "namespace, clusterName, and label selector matches MachineDeployment",
-		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, true, nil),
+		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, true, nil, nil),
 		additionalLabels: map[string]string{"color": "green"},
 		autoDiscoveryConfig: &clusterAPIAutoDiscoveryConfig{
 			clusterName:   "foo",
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_controller.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_controller.go
@ -17,6 +17,7 @@ limitations under the License.
 package clusterapi

 import (
+	"context"
 	"fmt"
 	"os"
 	"strings"
@ -53,6 +54,7 @@ const (
 	resourceNameMachineSet        = "machinesets"
 	resourceNameMachineDeployment = "machinedeployments"
 	failedMachinePrefix           = "failed-machine-"
+	machineTemplateKind           = "MachineTemplate"
 	machineDeploymentKind         = "MachineDeployment"
 	machineSetKind                = "MachineSet"
 	machineKind                   = "Machine"
@ -708,3 +710,21 @@ func (c *machineController) allowedByAutoDiscoverySpecs(r *unstructured.Unstruct

 	return false
 }
+
+// Get an infrastructure machine template given its GVR, name, and namespace.
+func (c *machineController) getInfrastructureResource(resource schema.GroupVersionResource, name string, namespace string) (*unstructured.Unstructured, error) {
+	infra, err := c.managementClient.
+		Resource(resource).
+		Namespace(namespace).
+		Get(
+			context.Background(),
+			name,
+			metav1.GetOptions{},
+		)
+	if err != nil {
+		klog.V(4).Infof("Unable to read infrastructure reference, error: %v", err)
+		return nil, err
+	}
+
+	return infra, err
+}
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_controller_test.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_controller_test.go
@ -43,6 +43,7 @@ import (
 	fakekube "k8s.io/client-go/kubernetes/fake"
 	fakescale "k8s.io/client-go/scale/fake"
 	clientgotesting "k8s.io/client-go/testing"
+	klog "k8s.io/klog/v2"

 	"k8s.io/autoscaler/cluster-autoscaler/cloudprovider"
 )
@ -55,12 +56,14 @@ type testConfig struct {
 	namespace         string
 	machineDeployment *unstructured.Unstructured
 	machineSet        *unstructured.Unstructured
+	machineTemplate   *unstructured.Unstructured
 	machines          []*unstructured.Unstructured
 	nodes             []*corev1.Node
 }

 type testSpec struct {
 	annotations             map[string]string
+	capacity                map[string]string
 	machineDeploymentName   string
 	machineSetName          string
 	clusterName             string
@ -91,27 +94,40 @@ func mustCreateTestController(t *testing.T, testConfigs ...*testConfig) (*machin
 		if config.machineDeployment != nil {
 			machineObjects = append(machineObjects, config.machineDeployment)
 		}
+
+		if config.machineTemplate != nil {
+			machineObjects = append(machineObjects, config.machineTemplate)
+		}
 	}

 	kubeclientSet := fakekube.NewSimpleClientset(nodeObjects...)
 	dynamicClientset := fakedynamic.NewSimpleDynamicClientWithCustomListKinds(
 		runtime.NewScheme(),
 		map[schema.GroupVersionResource]string{
-			{Group: "cluster.x-k8s.io", Version: "v1alpha3", Resource: "machinedeployments"}: "kindList",
-			{Group: "cluster.x-k8s.io", Version: "v1alpha3", Resource: "machines"}:           "kindList",
-			{Group: "cluster.x-k8s.io", Version: "v1alpha3", Resource: "machinesets"}:        "kindList",
-			{Group: "cluster.x-k8s.io", Version: "v1beta1", Resource: "machinedeployments"}:  "kindList",
-			{Group: "cluster.x-k8s.io", Version: "v1beta1", Resource: "machines"}:            "kindList",
-			{Group: "cluster.x-k8s.io", Version: "v1beta1", Resource: "machinesets"}:         "kindList",
-			{Group: "custom.x-k8s.io", Version: "v1beta1", Resource: "machinedeployments"}:   "kindList",
-			{Group: "custom.x-k8s.io", Version: "v1beta1", Resource: "machines"}:             "kindList",
-			{Group: "custom.x-k8s.io", Version: "v1beta1", Resource: "machinesets"}:          "kindList",
+			{Group: "cluster.x-k8s.io", Version: "v1alpha3", Resource: "machinedeployments"}:             "kindList",
+			{Group: "cluster.x-k8s.io", Version: "v1alpha3", Resource: "machines"}:                       "kindList",
+			{Group: "cluster.x-k8s.io", Version: "v1alpha3", Resource: "machinesets"}:                    "kindList",
+			{Group: "cluster.x-k8s.io", Version: "v1beta1", Resource: "machinedeployments"}:              "kindList",
+			{Group: "cluster.x-k8s.io", Version: "v1beta1", Resource: "machines"}:                        "kindList",
+			{Group: "cluster.x-k8s.io", Version: "v1beta1", Resource: "machinesets"}:                     "kindList",
+			{Group: "custom.x-k8s.io", Version: "v1beta1", Resource: "machinedeployments"}:               "kindList",
+			{Group: "custom.x-k8s.io", Version: "v1beta1", Resource: "machines"}:                         "kindList",
+			{Group: "custom.x-k8s.io", Version: "v1beta1", Resource: "machinesets"}:                      "kindList",
+			{Group: "infrastructure.cluster.x-k8s.io", Version: "v1beta1", Resource: "machinetemplates"}: "kindList",
 		},
 		machineObjects...,
 	)
 	discoveryClient := &fakediscovery.FakeDiscovery{
 		Fake: &clientgotesting.Fake{
 			Resources: []*metav1.APIResourceList{
+				{
+					GroupVersion: "infrastructure.cluster.x-k8s.io/v1beta1",
+					APIResources: []metav1.APIResource{
+						{
+							Name: "machinetemplates",
+						},
+					},
+				},
 				{
 					GroupVersion: fmt.Sprintf("%s/v1beta1", customCAPIGroup),
 					APIResources: []metav1.APIResource{
@ -249,35 +265,36 @@ func mustCreateTestController(t *testing.T, testConfigs ...*testConfig) (*machin
 	}
 }

-func createMachineSetTestConfig(namespace, clusterName, namePrefix string, nodeCount int, annotations map[string]string) *testConfig {
-	return createTestConfigs(createTestSpecs(namespace, clusterName, namePrefix, 1, nodeCount, false, annotations)...)[0]
+func createMachineSetTestConfig(namespace, clusterName, namePrefix string, nodeCount int, annotations map[string]string, capacity map[string]string) *testConfig {
+	return createTestConfigs(createTestSpecs(namespace, clusterName, namePrefix, 1, nodeCount, false, annotations, capacity)...)[0]
 }

-func createMachineSetTestConfigs(namespace, clusterName, namePrefix string, configCount, nodeCount int, annotations map[string]string) []*testConfig {
-	return createTestConfigs(createTestSpecs(namespace, clusterName, namePrefix, configCount, nodeCount, false, annotations)...)
+func createMachineSetTestConfigs(namespace, clusterName, namePrefix string, configCount, nodeCount int, annotations map[string]string, capacity map[string]string) []*testConfig {
+	return createTestConfigs(createTestSpecs(namespace, clusterName, namePrefix, configCount, nodeCount, false, annotations, capacity)...)
 }

-func createMachineDeploymentTestConfig(namespace, clusterName, namePrefix string, nodeCount int, annotations map[string]string) *testConfig {
-	return createTestConfigs(createTestSpecs(namespace, clusterName, namePrefix, 1, nodeCount, true, annotations)...)[0]
+func createMachineDeploymentTestConfig(namespace, clusterName, namePrefix string, nodeCount int, annotations map[string]string, capacity map[string]string) *testConfig {
+	return createTestConfigs(createTestSpecs(namespace, clusterName, namePrefix, 1, nodeCount, true, annotations, capacity)...)[0]
 }

-func createMachineDeploymentTestConfigs(namespace, clusterName, namePrefix string, configCount, nodeCount int, annotations map[string]string) []*testConfig {
-	return createTestConfigs(createTestSpecs(namespace, clusterName, namePrefix, configCount, nodeCount, true, annotations)...)
+func createMachineDeploymentTestConfigs(namespace, clusterName, namePrefix string, configCount, nodeCount int, annotations map[string]string, capacity map[string]string) []*testConfig {
+	return createTestConfigs(createTestSpecs(namespace, clusterName, namePrefix, configCount, nodeCount, true, annotations, capacity)...)
 }

-func createTestSpecs(namespace, clusterName, namePrefix string, scalableResourceCount, nodeCount int, isMachineDeployment bool, annotations map[string]string) []testSpec {
+func createTestSpecs(namespace, clusterName, namePrefix string, scalableResourceCount, nodeCount int, isMachineDeployment bool, annotations map[string]string, capacity map[string]string) []testSpec {
 	var specs []testSpec

 	for i := 0; i < scalableResourceCount; i++ {
-		specs = append(specs, createTestSpec(namespace, clusterName, fmt.Sprintf("%s-%d", namePrefix, i), nodeCount, isMachineDeployment, annotations))
+		specs = append(specs, createTestSpec(namespace, clusterName, fmt.Sprintf("%s-%d", namePrefix, i), nodeCount, isMachineDeployment, annotations, capacity))
 	}

 	return specs
 }

-func createTestSpec(namespace, clusterName, name string, nodeCount int, isMachineDeployment bool, annotations map[string]string) testSpec {
+func createTestSpec(namespace, clusterName, name string, nodeCount int, isMachineDeployment bool, annotations map[string]string, capacity map[string]string) testSpec {
 	return testSpec{
 		annotations:             annotations,
+		capacity:                capacity,
 		machineDeploymentName:   name,
 		machineSetName:          name,
 		clusterName:             clusterName,
@ -316,6 +333,15 @@ func createTestConfigs(specs ...testSpec) []*testConfig {
 				"spec": map[string]interface{}{
 					"clusterName": spec.clusterName,
 					"replicas":    int64(spec.nodeCount),
+					"template": map[string]interface{}{
+						"spec": map[string]interface{}{
+							"infrastructureRef": map[string]interface{}{
+								"apiVersion": "infrastructure.cluster.x-k8s.io/v1beta1",
+								"kind":       machineTemplateKind,
+								"name":       "TestMachineTemplate",
+							},
+						},
+					},
 				},
 				"status": map[string]interface{}{},
 			},
@ -345,6 +371,15 @@ func createTestConfigs(specs ...testSpec) []*testConfig {
 					"spec": map[string]interface{}{
 						"clusterName": spec.clusterName,
 						"replicas":    int64(spec.nodeCount),
+						"template": map[string]interface{}{
+							"spec": map[string]interface{}{
+								"infrastructureRef": map[string]interface{}{
+									"apiVersion": "infrastructure.cluster.x-k8s.io/v1beta1",
+									"kind":       machineTemplateKind,
+									"name":       "TestMachineTemplate",
+								},
+							},
+						},
 					},
 					"status": map[string]interface{}{},
 				},
@ -371,6 +406,24 @@ func createTestConfigs(specs ...testSpec) []*testConfig {
 			UID:  config.machineSet.GetUID(),
 		}

+		if spec.capacity != nil {
+			klog.V(4).Infof("adding capacity to machine template")
+			config.machineTemplate = &unstructured.Unstructured{
+				Object: map[string]interface{}{
+					"apiVersion": "infrastructure.cluster.x-k8s.io/v1beta1",
+					"kind":       machineTemplateKind,
+					"metadata": map[string]interface{}{
+						"name":      "TestMachineTemplate",
+						"namespace": spec.namespace,
+						"uid":       "TestMachineTemplate",
+					},
+				},
+			}
+			unstructured.SetNestedStringMap(config.machineTemplate.Object, spec.capacity, "status", "capacity")
+		} else {
+			klog.V(4).Infof("not adding capacity")
+		}
+
 		for j := 0; j < spec.nodeCount; j++ {
 			config.nodes[j], config.machines[j] = makeLinkedNodeAndMachine(j, spec.namespace, spec.clusterName, machineOwner, machineSetLabels)
 		}
@ -585,7 +638,7 @@ func TestControllerFindMachine(t *testing.T) {
 			testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 				nodeGroupMinSizeAnnotationKey: "1",
 				nodeGroupMaxSizeAnnotationKey: "10",
-			})
+			}, nil)
 			if tc.name == "" {
 				tc.name = testConfig.machines[0].GetName()
 			}
@ -613,7 +666,7 @@ func TestControllerFindMachineOwner(t *testing.T) {
 	testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	controller, stop := mustCreateTestController(t, testConfig)
 	defer stop()
@ -662,7 +715,7 @@ func TestControllerFindMachineByProviderID(t *testing.T) {
 	testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	controller, stop := mustCreateTestController(t, testConfig)
 	defer stop()
@ -724,7 +777,7 @@ func TestControllerFindNodeByNodeName(t *testing.T) {
 	testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	controller, stop := mustCreateTestController(t, testConfig)
 	defer stop()
@ -824,7 +877,7 @@ func TestControllerListMachinesForScalableResource(t *testing.T) {
 		testConfig1 := createMachineSetTestConfig(namespace, clusterName, RandomString(6), 5, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "10",
-		})
+		}, nil)

 		// Construct a second set of objects and add the machines,
 		// nodes and the additional machineset to the existing set of
@ -833,7 +886,7 @@ func TestControllerListMachinesForScalableResource(t *testing.T) {
 		testConfig2 := createMachineSetTestConfig(namespace, clusterName, RandomString(6), 5, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "10",
-		})
+		}, nil)

 		test(t, testConfig1, testConfig2)
 	})
@ -844,7 +897,7 @@ func TestControllerListMachinesForScalableResource(t *testing.T) {
 		testConfig1 := createMachineDeploymentTestConfig(namespace, clusterName, RandomString(6), 5, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "10",
-		})
+		}, nil)

 		// Construct a second set of objects and add the machines,
 		// nodes, machineset, and the additional machineset to the existing set of
@ -853,7 +906,7 @@ func TestControllerListMachinesForScalableResource(t *testing.T) {
 		testConfig2 := createMachineDeploymentTestConfig(namespace, clusterName, RandomString(6), 5, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "10",
-		})
+		}, nil)

 		test(t, testConfig1, testConfig2)
 	})
@ -884,7 +937,7 @@ func TestControllerLookupNodeGroupForNonExistentNode(t *testing.T) {
 		testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "10",
-		})
+		}, nil)
 		test(t, testConfig)
 	})

@ -892,7 +945,7 @@ func TestControllerLookupNodeGroupForNonExistentNode(t *testing.T) {
 		testConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "10",
-		})
+		}, nil)
 		test(t, testConfig)
 	})
 }
@ -923,7 +976,7 @@ func TestControllerNodeGroupForNodeWithMissingMachineOwner(t *testing.T) {
 		testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "10",
-		})
+		}, nil)
 		test(t, testConfig)
 	})

@ -931,7 +984,7 @@ func TestControllerNodeGroupForNodeWithMissingMachineOwner(t *testing.T) {
 		testConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "10",
-		})
+		}, nil)
 		test(t, testConfig)
 	})
 }
@ -940,7 +993,7 @@ func TestControllerNodeGroupForNodeWithMissingSetMachineOwner(t *testing.T) {
 	testConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	controller, stop := mustCreateTestController(t, testConfig)
 	defer stop()
@ -982,7 +1035,7 @@ func TestControllerNodeGroupForNodeWithPositiveScalingBounds(t *testing.T) {
 		testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "1",
-		})
+		}, nil)
 		test(t, testConfig)
 	})

@ -990,7 +1043,7 @@ func TestControllerNodeGroupForNodeWithPositiveScalingBounds(t *testing.T) {
 		testConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 			nodeGroupMinSizeAnnotationKey: "1",
 			nodeGroupMaxSizeAnnotationKey: "1",
-		})
+		}, nil)
 		test(t, testConfig)
 	})
 }
@ -1022,14 +1075,14 @@ func TestControllerNodeGroups(t *testing.T) {
 	assertNodegroupLen(t, controller, 0)

 	// Test #2: add 5 machineset-based nodegroups
-	machineSetConfigs := createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 5, 1, annotations)
+	machineSetConfigs := createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 5, 1, annotations, nil)
 	if err := addTestConfigs(t, controller, machineSetConfigs...); err != nil {
 		t.Fatalf("unexpected error: %v", err)
 	}
 	assertNodegroupLen(t, controller, 5)

 	// Test #2: add 2 machinedeployment-based nodegroups
-	machineDeploymentConfigs := createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 2, 1, annotations)
+	machineDeploymentConfigs := createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 2, 1, annotations, nil)
 	if err := addTestConfigs(t, controller, machineDeploymentConfigs...); err != nil {
 		t.Fatalf("unexpected error: %v", err)
 	}
@ -1053,14 +1106,14 @@ func TestControllerNodeGroups(t *testing.T) {
 	}

 	// Test #5: machineset with no scaling bounds results in no nodegroups
-	machineSetConfigs = createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 5, 1, annotations)
+	machineSetConfigs = createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 5, 1, annotations, nil)
 	if err := addTestConfigs(t, controller, machineSetConfigs...); err != nil {
 		t.Fatalf("unexpected error: %v", err)
 	}
 	assertNodegroupLen(t, controller, 0)

 	// Test #6: machinedeployment with no scaling bounds results in no nodegroups
-	machineDeploymentConfigs = createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 2, 1, annotations)
+	machineDeploymentConfigs = createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 2, 1, annotations, nil)
 	if err := addTestConfigs(t, controller, machineDeploymentConfigs...); err != nil {
 		t.Fatalf("unexpected error: %v", err)
 	}
@ -1072,7 +1125,7 @@ func TestControllerNodeGroups(t *testing.T) {
 	}

 	// Test #7: machineset with bad scaling bounds results in an error and no nodegroups
-	machineSetConfigs = createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 5, 1, annotations)
+	machineSetConfigs = createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 5, 1, annotations, nil)
 	if err := addTestConfigs(t, controller, machineSetConfigs...); err != nil {
 		t.Fatalf("unexpected error: %v", err)
 	}
@ -1081,7 +1134,7 @@ func TestControllerNodeGroups(t *testing.T) {
 	}

 	// Test #8: machinedeployment with bad scaling bounds results in an error and no nodegroups
-	machineDeploymentConfigs = createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 2, 1, annotations)
+	machineDeploymentConfigs = createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 2, 1, annotations, nil)
 	if err := addTestConfigs(t, controller, machineDeploymentConfigs...); err != nil {
 		t.Fatalf("unexpected error: %v", err)
 	}
@ -1145,13 +1198,13 @@ func TestControllerNodeGroupsNodeCount(t *testing.T) {

 	t.Run("MachineSet", func(t *testing.T) {
 		for _, tc := range testCases {
-			test(t, tc, createMachineSetTestConfigs(RandomString(6), RandomString(6), RandomString(6), tc.nodeGroups, tc.nodesPerGroup, annotations))
+			test(t, tc, createMachineSetTestConfigs(RandomString(6), RandomString(6), RandomString(6), tc.nodeGroups, tc.nodesPerGroup, annotations, nil))
 		}
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
 		for _, tc := range testCases {
-			test(t, tc, createMachineDeploymentTestConfigs(RandomString(6), RandomString(6), RandomString(6), tc.nodeGroups, tc.nodesPerGroup, annotations))
+			test(t, tc, createMachineDeploymentTestConfigs(RandomString(6), RandomString(6), RandomString(6), tc.nodeGroups, tc.nodesPerGroup, annotations, nil))
 		}
 	})
 }
@ -1160,7 +1213,7 @@ func TestControllerFindMachineFromNodeAnnotation(t *testing.T) {
 	testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	controller, stop := mustCreateTestController(t, testConfig)
 	defer stop()
@ -1208,7 +1261,7 @@ func TestControllerMachineSetNodeNamesWithoutLinkage(t *testing.T) {
 	testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 3, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	controller, stop := mustCreateTestController(t, testConfig)
 	defer stop()
@ -1250,7 +1303,7 @@ func TestControllerMachineSetNodeNamesUsingProviderID(t *testing.T) {
 	testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 3, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	controller, stop := mustCreateTestController(t, testConfig)
 	defer stop()
@ -1302,7 +1355,7 @@ func TestControllerMachineSetNodeNamesUsingStatusNodeRefName(t *testing.T) {
 	testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 3, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	controller, stop := mustCreateTestController(t, testConfig)
 	defer stop()
@ -1393,7 +1446,7 @@ func TestControllerGetAPIVersionGroupWithMachineDeployments(t *testing.T) {
 	testConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "1",
-	})
+	}, nil)
 	if err := os.Setenv(CAPIGroupEnvVar, customCAPIGroup); err != nil {
 		t.Fatalf("unexpected error: %v", err)
 	}
@ -1663,7 +1716,7 @@ func Test_machineController_allowedByAutoDiscoverySpecs(t *testing.T) {
 		shouldMatch        bool
 	}{{
 		name:     "autodiscovery specs includes permissive spec that should match any MachineSet",
-		testSpec: createTestSpec(RandomString(6), RandomString(6), RandomString(6), 1, false, nil),
+		testSpec: createTestSpec(RandomString(6), RandomString(6), RandomString(6), 1, false, nil, nil),
 		autoDiscoverySpecs: []*clusterAPIAutoDiscoveryConfig{
 			{labelSelector: labels.NewSelector()},
 			{clusterName: "foo", namespace: "bar", labelSelector: labels.Nothing()},
@ -1671,7 +1724,7 @@ func Test_machineController_allowedByAutoDiscoverySpecs(t *testing.T) {
 		shouldMatch: true,
 	}, {
 		name:     "autodiscovery specs includes permissive spec that should match any MachineDeployment",
-		testSpec: createTestSpec(RandomString(6), RandomString(6), RandomString(6), 1, true, nil),
+		testSpec: createTestSpec(RandomString(6), RandomString(6), RandomString(6), 1, true, nil, nil),
 		autoDiscoverySpecs: []*clusterAPIAutoDiscoveryConfig{
 			{labelSelector: labels.NewSelector()},
 			{clusterName: "foo", namespace: "bar", labelSelector: labels.Nothing()},
@ -1679,7 +1732,7 @@ func Test_machineController_allowedByAutoDiscoverySpecs(t *testing.T) {
 		shouldMatch: true,
 	}, {
 		name:             "autodiscovery specs includes a restrictive spec that should match specific MachineSet",
-		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, false, nil),
+		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, false, nil, nil),
 		additionalLabels: map[string]string{"color": "green"},
 		autoDiscoverySpecs: []*clusterAPIAutoDiscoveryConfig{
 			{clusterName: "foo", namespace: "default", labelSelector: labels.SelectorFromSet(labels.Set{"color": "green"})},
@ -1688,7 +1741,7 @@ func Test_machineController_allowedByAutoDiscoverySpecs(t *testing.T) {
 		shouldMatch: true,
 	}, {
 		name:             "autodiscovery specs includes a restrictive spec that should match specific MachineDeployment",
-		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, true, nil),
+		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, true, nil, nil),
 		additionalLabels: map[string]string{"color": "green"},
 		autoDiscoverySpecs: []*clusterAPIAutoDiscoveryConfig{
 			{clusterName: "foo", namespace: "default", labelSelector: labels.SelectorFromSet(labels.Set{"color": "green"})},
@ -1697,7 +1750,7 @@ func Test_machineController_allowedByAutoDiscoverySpecs(t *testing.T) {
 		shouldMatch: true,
 	}, {
 		name:             "autodiscovery specs does not include any specs that should match specific MachineSet",
-		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, false, nil),
+		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, false, nil, nil),
 		additionalLabels: map[string]string{"color": "green"},
 		autoDiscoverySpecs: []*clusterAPIAutoDiscoveryConfig{
 			{clusterName: "test", namespace: "default", labelSelector: labels.SelectorFromSet(labels.Set{"color": "blue"})},
@ -1706,7 +1759,7 @@ func Test_machineController_allowedByAutoDiscoverySpecs(t *testing.T) {
 		shouldMatch: false,
 	}, {
 		name:             "autodiscovery specs does not include any specs that should match specific MachineDeployment",
-		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, true, nil),
+		testSpec:         createTestSpec("default", "foo", RandomString(6), 1, true, nil, nil),
 		additionalLabels: map[string]string{"color": "green"},
 		autoDiscoverySpecs: []*clusterAPIAutoDiscoveryConfig{
 			{clusterName: "test", namespace: "default", labelSelector: labels.SelectorFromSet(labels.Set{"color": "blue"})},
@ -1736,9 +1789,9 @@ func Test_machineController_allowedByAutoDiscoverySpecs(t *testing.T) {
 }

 func Test_machineController_listScalableResources(t *testing.T) {
-	uniqueMDConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, nil)
+	uniqueMDConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, nil, nil)

-	mdTestConfigs := createMachineDeploymentTestConfigs(RandomString(6), RandomString(6), RandomString(6), 5, 1, nil)
+	mdTestConfigs := createMachineDeploymentTestConfigs(RandomString(6), RandomString(6), RandomString(6), 5, 1, nil, nil)
 	mdTestConfigs = append(mdTestConfigs, uniqueMDConfig)

 	allMachineDeployments := make([]*unstructured.Unstructured, 0, len(mdTestConfigs))
@ -1746,9 +1799,9 @@ func Test_machineController_listScalableResources(t *testing.T) {
 		allMachineDeployments = append(allMachineDeployments, mdTestConfigs[i].machineDeployment)
 	}

-	uniqueMSConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, nil)
+	uniqueMSConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, nil, nil)

-	msTestConfigs := createMachineSetTestConfigs(RandomString(6), RandomString(6), RandomString(6), 5, 1, nil)
+	msTestConfigs := createMachineSetTestConfigs(RandomString(6), RandomString(6), RandomString(6), 5, 1, nil, nil)
 	msTestConfigs = append(msTestConfigs, uniqueMSConfig)

 	allMachineSets := make([]*unstructured.Unstructured, 0, len(msTestConfigs))
@ -1851,12 +1904,12 @@ func Test_machineController_nodeGroupForNode(t *testing.T) {
 	uniqueMDConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	mdTestConfigs := createMachineDeploymentTestConfigs(RandomString(6), RandomString(6), RandomString(6), 5, 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)
 	mdTestConfigs = append(mdTestConfigs, uniqueMDConfig)

 	allMachineDeployments := make([]*unstructured.Unstructured, 0, len(mdTestConfigs))
@ -1867,12 +1920,12 @@ func Test_machineController_nodeGroupForNode(t *testing.T) {
 	uniqueMSConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	msTestConfigs := createMachineSetTestConfigs(RandomString(6), RandomString(6), RandomString(6), 5, 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)
 	msTestConfigs = append(msTestConfigs, uniqueMSConfig)

 	allMachineSets := make([]*unstructured.Unstructured, 0, len(msTestConfigs))
@ -1957,12 +2010,12 @@ func Test_machineController_nodeGroups(t *testing.T) {
 	uniqueMDConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	mdTestConfigs := createMachineDeploymentTestConfigs(RandomString(6), RandomString(6), RandomString(6), 5, 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)
 	mdTestConfigs = append(mdTestConfigs, uniqueMDConfig)

 	allMachineDeployments := make([]*unstructured.Unstructured, 0, len(mdTestConfigs))
@ -1973,12 +2026,12 @@ func Test_machineController_nodeGroups(t *testing.T) {
 	uniqueMSConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)

 	msTestConfigs := createMachineSetTestConfigs(RandomString(6), RandomString(6), RandomString(6), 5, 1, map[string]string{
 		nodeGroupMinSizeAnnotationKey: "1",
 		nodeGroupMaxSizeAnnotationKey: "10",
-	})
+	}, nil)
 	msTestConfigs = append(msTestConfigs, uniqueMSConfig)

 	allMachineSets := make([]*unstructured.Unstructured, 0, len(msTestConfigs))
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_nodegroup.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_nodegroup.go
@ -23,10 +23,8 @@ import (
 	"github.com/pkg/errors"

 	corev1 "k8s.io/api/core/v1"
-	"k8s.io/apimachinery/pkg/api/resource"
 	metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"
 	"k8s.io/apimachinery/pkg/apis/meta/v1/unstructured"
-	gpuapis "k8s.io/autoscaler/cluster-autoscaler/utils/gpu"
 	kubeletapis "k8s.io/kubelet/pkg/apis"
 	schedulerframework "k8s.io/kubernetes/pkg/scheduler/framework"

@ -35,17 +33,12 @@ import (
 )

 const (
-	// deprecatedMachineDeleteAnnotationKey should not be removed until minimum cluster-api support is v1alpha3
-	deprecatedMachineDeleteAnnotationKey = "cluster.k8s.io/delete-machine"
-	// TODO: determine what currently relies on deprecatedMachineAnnotationKey to determine when it can be removed
-	deprecatedMachineAnnotationKey = "cluster.k8s.io/machine"
-	machineDeleteAnnotationKey     = "machine.openshift.io/cluster-api-delete-machine"
-	machineAnnotationKey           = "machine.openshift.io/machine"
-	debugFormat                    = "%s (min: %d, max: %d, replicas: %d)"
+	debugFormat = "%s (min: %d, max: %d, replicas: %d)"

-	// This default for the maximum number of pods comes from the machine-config-operator
-	// see https://github.com/openshift/machine-config-operator/blob/2f1bd6d99131fa4471ed95543a51dec3d5922b2b/templates/worker/01-worker-kubelet/_base/files/kubelet.yaml#L19
-	defaultMaxPods = 250
+	// The default for the maximum number of pods is inspired by the Kubernetes
+	// best practices documentation for large clusters.
+	// see https://kubernetes.io/docs/setup/best-practices/cluster-large/
+	defaultMaxPods = 110
 )

 type nodegroup struct {
@ -254,45 +247,11 @@ func (ng *nodegroup) TemplateNodeInfo() (*schedulerframework.NodeInfo, error) {
 		return nil, cloudprovider.ErrNotImplemented
 	}

-	cpu, err := ng.scalableResource.InstanceCPUCapacity()
+	capacity, err := ng.scalableResource.InstanceCapacity()
 	if err != nil {
 		return nil, err
 	}

-	mem, err := ng.scalableResource.InstanceMemoryCapacity()
-	if err != nil {
-		return nil, err
-	}
-
-	gpu, err := ng.scalableResource.InstanceGPUCapacity()
-	if err != nil {
-		return nil, err
-	}
-
-	pod, err := ng.scalableResource.InstanceMaxPodsCapacity()
-	if err != nil {
-		return nil, err
-	}
-
-	if cpu.IsZero() || mem.IsZero() {
-		return nil, cloudprovider.ErrNotImplemented
-	}
-
-	if gpu.IsZero() {
-		gpu = zeroQuantity.DeepCopy()
-	}
-
-	if pod.IsZero() {
-		pod = *resource.NewQuantity(defaultMaxPods, resource.DecimalSI)
-	}
-
-	capacity := map[corev1.ResourceName]resource.Quantity{
-		corev1.ResourceCPU:        cpu,
-		corev1.ResourceMemory:     mem,
-		corev1.ResourcePods:       pod,
-		gpuapis.ResourceNvidiaGPU: gpu,
-	}
-
 	nodeName := fmt.Sprintf("%s-asg-%d", ng.scalableResource.Name(), rand.Int63())
 	node := corev1.Node{
 		ObjectMeta: metav1.ObjectMeta{
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_nodegroup_test.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_nodegroup_test.go
@ -19,13 +19,14 @@ package clusterapi
 import (
 	"context"
 	"fmt"
-	"k8s.io/client-go/tools/cache"
 	"path"
 	"sort"
 	"strings"
 	"testing"
 	"time"

+	"k8s.io/client-go/tools/cache"
+
 	corev1 "k8s.io/api/core/v1"
 	metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"
 	"k8s.io/apimachinery/pkg/apis/meta/v1/unstructured"
@ -201,7 +202,7 @@ func TestNodeGroupNewNodeGroupConstructor(t *testing.T) {
 	t.Run("MachineSet", func(t *testing.T) {
 		for _, tc := range testCases {
 			t.Run(tc.description, func(t *testing.T) {
-				test(t, tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), tc.nodeCount, tc.annotations))
+				test(t, tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), tc.nodeCount, tc.annotations, nil))
 			})
 		}
 	})
@ -209,7 +210,7 @@ func TestNodeGroupNewNodeGroupConstructor(t *testing.T) {
 	t.Run("MachineDeployment", func(t *testing.T) {
 		for _, tc := range testCases {
 			t.Run(tc.description, func(t *testing.T) {
-				test(t, tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), tc.nodeCount, tc.annotations))
+				test(t, tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), tc.nodeCount, tc.annotations, nil))
 			})
 		}
 	})
@ -293,7 +294,7 @@ func TestNodeGroupIncreaseSizeErrors(t *testing.T) {
 					nodeGroupMinSizeAnnotationKey: "1",
 					nodeGroupMaxSizeAnnotationKey: "10",
 				}
-				test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+				test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 			})
 		}
 	})
@ -305,7 +306,7 @@ func TestNodeGroupIncreaseSizeErrors(t *testing.T) {
 					nodeGroupMinSizeAnnotationKey: "1",
 					nodeGroupMaxSizeAnnotationKey: "10",
 				}
-				test(t, &tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+				test(t, &tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 			})
 		}
 	})
@ -371,7 +372,7 @@ func TestNodeGroupIncreaseSize(t *testing.T) {
 			expected:    4,
 			delta:       1,
 		}
-		test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+		test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
@ -381,7 +382,7 @@ func TestNodeGroupIncreaseSize(t *testing.T) {
 			expected:    4,
 			delta:       1,
 		}
-		test(t, &tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+		test(t, &tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 	})
 }

@ -513,7 +514,7 @@ func TestNodeGroupDecreaseTargetSize(t *testing.T) {
 			delta:               -1,
 			expectedError:       true,
 		}
-		test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+		test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 	})

 	t.Run("MachineSet", func(t *testing.T) {
@ -524,7 +525,7 @@ func TestNodeGroupDecreaseTargetSize(t *testing.T) {
 			expected:            3,
 			delta:               -1,
 		}
-		test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+		test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
@ -536,7 +537,7 @@ func TestNodeGroupDecreaseTargetSize(t *testing.T) {
 			delta:               -1,
 			expectedError:       true,
 		}
-		test(t, &tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+		test(t, &tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 	})
 }

@ -618,7 +619,7 @@ func TestNodeGroupDecreaseSizeErrors(t *testing.T) {
 					nodeGroupMinSizeAnnotationKey: "1",
 					nodeGroupMaxSizeAnnotationKey: "10",
 				}
-				test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+				test(t, &tc, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 			})
 		}
 	})
@ -630,7 +631,7 @@ func TestNodeGroupDecreaseSizeErrors(t *testing.T) {
 					nodeGroupMinSizeAnnotationKey: "1",
 					nodeGroupMaxSizeAnnotationKey: "10",
 				}
-				test(t, &tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations))
+				test(t, &tc, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), int(tc.initial), annotations, nil))
 			})
 		}
 	})
@ -708,17 +709,37 @@ func TestNodeGroupDeleteNodes(t *testing.T) {
 	// sorting and the expected semantics in test() will fail.

 	t.Run("MachineSet", func(t *testing.T) {
-		test(t, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 10, map[string]string{
-			nodeGroupMinSizeAnnotationKey: "1",
-			nodeGroupMaxSizeAnnotationKey: "10",
-		}))
+		test(
+			t,
+			createMachineSetTestConfig(
+				RandomString(6),
+				RandomString(6),
+				RandomString(6),
+				10,
+				map[string]string{
+					nodeGroupMinSizeAnnotationKey: "1",
+					nodeGroupMaxSizeAnnotationKey: "10",
+				},
+				nil,
+			),
+		)
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
-		test(t, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 10, map[string]string{
-			nodeGroupMinSizeAnnotationKey: "1",
-			nodeGroupMaxSizeAnnotationKey: "10",
-		}))
+		test(
+			t,
+			createMachineDeploymentTestConfig(
+				RandomString(6),
+				RandomString(6),
+				RandomString(6),
+				10,
+				map[string]string{
+					nodeGroupMinSizeAnnotationKey: "1",
+					nodeGroupMaxSizeAnnotationKey: "10",
+				},
+				nil,
+			),
+		)
 	})
 }

@ -789,16 +810,16 @@ func TestNodeGroupMachineSetDeleteNodesWithMismatchedNodes(t *testing.T) {
 	t.Run("MachineSet", func(t *testing.T) {
 		namespace := RandomString(6)
 		clusterName := RandomString(6)
-		testConfig0 := createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 1, 2, annotations)
-		testConfig1 := createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 1, 2, annotations)
+		testConfig0 := createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 1, 2, annotations, nil)
+		testConfig1 := createMachineSetTestConfigs(namespace, clusterName, RandomString(6), 1, 2, annotations, nil)
 		test(t, 2, append(testConfig0, testConfig1...))
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
 		namespace := RandomString(6)
 		clusterName := RandomString(6)
-		testConfig0 := createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 1, 2, annotations)
-		testConfig1 := createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 1, 2, annotations)
+		testConfig0 := createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 1, 2, annotations, nil)
+		testConfig1 := createMachineDeploymentTestConfigs(namespace, clusterName, RandomString(6), 1, 2, annotations, nil)
 		test(t, 2, append(testConfig0, testConfig1...))
 	})
 }
@ -968,17 +989,37 @@ func TestNodeGroupDeleteNodesTwice(t *testing.T) {
 	// sorting and the expected semantics in test() will fail.

 	t.Run("MachineSet", func(t *testing.T) {
-		test(t, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 10, map[string]string{
-			nodeGroupMinSizeAnnotationKey: "1",
-			nodeGroupMaxSizeAnnotationKey: "10",
-		}))
+		test(
+			t,
+			createMachineSetTestConfig(
+				RandomString(6),
+				RandomString(6),
+				RandomString(6),
+				10,
+				map[string]string{
+					nodeGroupMinSizeAnnotationKey: "1",
+					nodeGroupMaxSizeAnnotationKey: "10",
+				},
+				nil,
+			),
+		)
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
-		test(t, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 10, map[string]string{
-			nodeGroupMinSizeAnnotationKey: "1",
-			nodeGroupMaxSizeAnnotationKey: "10",
-		}))
+		test(
+			t,
+			createMachineDeploymentTestConfig(
+				RandomString(6),
+				RandomString(6),
+				RandomString(6),
+				10,
+				map[string]string{
+					nodeGroupMinSizeAnnotationKey: "1",
+					nodeGroupMaxSizeAnnotationKey: "10",
+				},
+				nil,
+			),
+		)
 	})
 }

@ -1097,17 +1138,37 @@ func TestNodeGroupDeleteNodesSequential(t *testing.T) {
 	// sorting and the expected semantics in test() will fail.

 	t.Run("MachineSet", func(t *testing.T) {
-		test(t, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 10, map[string]string{
-			nodeGroupMinSizeAnnotationKey: "1",
-			nodeGroupMaxSizeAnnotationKey: "10",
-		}))
+		test(
+			t,
+			createMachineSetTestConfig(
+				RandomString(6),
+				RandomString(6),
+				RandomString(6),
+				10,
+				map[string]string{
+					nodeGroupMinSizeAnnotationKey: "1",
+					nodeGroupMaxSizeAnnotationKey: "10",
+				},
+				nil,
+			),
+		)
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
-		test(t, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 10, map[string]string{
-			nodeGroupMinSizeAnnotationKey: "1",
-			nodeGroupMaxSizeAnnotationKey: "10",
-		}))
+		test(
+			t,
+			createMachineDeploymentTestConfig(
+				RandomString(6),
+				RandomString(6),
+				RandomString(6),
+				10,
+				map[string]string{
+					nodeGroupMinSizeAnnotationKey: "1",
+					nodeGroupMaxSizeAnnotationKey: "10",
+				},
+				nil,
+			),
+		)
 	})
 }

@ -1178,17 +1239,37 @@ func TestNodeGroupWithFailedMachine(t *testing.T) {
 	// sorting and the expected semantics in test() will fail.

 	t.Run("MachineSet", func(t *testing.T) {
-		test(t, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 10, map[string]string{
-			nodeGroupMinSizeAnnotationKey: "1",
-			nodeGroupMaxSizeAnnotationKey: "10",
-		}))
+		test(
+			t,
+			createMachineSetTestConfig(
+				RandomString(6),
+				RandomString(6),
+				RandomString(6),
+				10,
+				map[string]string{
+					nodeGroupMinSizeAnnotationKey: "1",
+					nodeGroupMaxSizeAnnotationKey: "10",
+				},
+				nil,
+			),
+		)
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
-		test(t, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 10, map[string]string{
-			nodeGroupMinSizeAnnotationKey: "1",
-			nodeGroupMaxSizeAnnotationKey: "10",
-		}))
+		test(
+			t,
+			createMachineDeploymentTestConfig(
+				RandomString(6),
+				RandomString(6),
+				RandomString(6),
+				10,
+				map[string]string{
+					nodeGroupMinSizeAnnotationKey: "1",
+					nodeGroupMaxSizeAnnotationKey: "10",
+				},
+				nil,
+			),
+		)
 	})
 }

@ -1221,16 +1302,18 @@ func TestNodeGroupTemplateNodeInfo(t *testing.T) {
 		{
 			name: "When the NodeGroup can scale from zero",
 			nodeGroupAnnotations: map[string]string{
-				memoryKey: "2048",
-				cpuKey:    "2",
+				memoryKey:   "2048Mi",
+				cpuKey:      "2",
+				gpuTypeKey:  gpuapis.ResourceNvidiaGPU,
+				gpuCountKey: "1",
 			},
 			config: testCaseConfig{
 				expectedErr: nil,
 				expectedCapacity: map[corev1.ResourceName]int64{
 					corev1.ResourceCPU:        2,
 					corev1.ResourceMemory:     2048 * 1024 * 1024,
-					corev1.ResourcePods:       250,
-					gpuapis.ResourceNvidiaGPU: 0,
+					corev1.ResourcePods:       110,
+					gpuapis.ResourceNvidiaGPU: 1,
 				},
 				expectedNodeLabels: map[string]string{
 					"kubernetes.io/os":        "linux",
@ -1243,7 +1326,7 @@ func TestNodeGroupTemplateNodeInfo(t *testing.T) {
 		{
 			name: "When the NodeGroup can scale from zero and the nodegroup adds labels to the Node",
 			nodeGroupAnnotations: map[string]string{
-				memoryKey: "2048",
+				memoryKey: "2048Mi",
 				cpuKey:    "2",
 			},
 			config: testCaseConfig{
@ -1253,10 +1336,9 @@ func TestNodeGroupTemplateNodeInfo(t *testing.T) {
 					"anotherLabel":   "anotherValue",
 				},
 				expectedCapacity: map[corev1.ResourceName]int64{
-					corev1.ResourceCPU:        2,
-					corev1.ResourceMemory:     2048 * 1024 * 1024,
-					corev1.ResourcePods:       250,
-					gpuapis.ResourceNvidiaGPU: 0,
+					corev1.ResourceCPU:    2,
+					corev1.ResourceMemory: 2048 * 1024 * 1024,
+					corev1.ResourcePods:   110,
 				},
 				expectedNodeLabels: map[string]string{
 					"kubernetes.io/os":        "linux",
@ -1271,7 +1353,7 @@ func TestNodeGroupTemplateNodeInfo(t *testing.T) {
 		{
 			name: "When the NodeGroup can scale from zero and the Node still exists, it includes the known node labels",
 			nodeGroupAnnotations: map[string]string{
-				memoryKey: "2048",
+				memoryKey: "2048Mi",
 				cpuKey:    "2",
 			},
 			config: testCaseConfig{
@ -1285,13 +1367,12 @@ func TestNodeGroupTemplateNodeInfo(t *testing.T) {
 				},
 				nodegroupLabels: map[string]string{
 					"nodeGroupLabel": "value",
-					"anotherLabel":   "anotherValue",
+					"anotherLabel":   "nodeGroupValue",
 				},
 				expectedCapacity: map[corev1.ResourceName]int64{
-					corev1.ResourceCPU:        2,
-					corev1.ResourceMemory:     2048 * 1024 * 1024,
-					corev1.ResourcePods:       250,
-					gpuapis.ResourceNvidiaGPU: 0,
+					corev1.ResourceCPU:    2,
+					corev1.ResourceMemory: 2048 * 1024 * 1024,
+					corev1.ResourcePods:   110,
 				},
 				expectedNodeLabels: map[string]string{
 					"kubernetes.io/os":                 "windows",
@ -1299,7 +1380,7 @@ func TestNodeGroupTemplateNodeInfo(t *testing.T) {
 					"kubernetes.io/arch":               "arm64",
 					"beta.kubernetes.io/arch":          "amd64",
 					"nodeGroupLabel":                   "value",
-					"anotherLabel":                     "anotherValue",
+					"anotherLabel":                     "nodeGroupValue",
 					"node.kubernetes.io/instance-type": "instance1",
 				},
 			},
@ -1375,7 +1456,16 @@ func TestNodeGroupTemplateNodeInfo(t *testing.T) {
 	for _, tc := range testCases {
 		t.Run(tc.name, func(t *testing.T) {
 			t.Run("MachineSet", func(t *testing.T) {
-				test(t, createMachineSetTestConfig(testNamespace, RandomString(6), RandomString(6), 10, cloudprovider.JoinStringMaps(enableScaleAnnotations, tc.nodeGroupAnnotations)),
+				test(
+					t,
+					createMachineSetTestConfig(
+						testNamespace,
+						RandomString(6),
+						RandomString(6),
+						10,
+						cloudprovider.JoinStringMaps(enableScaleAnnotations, tc.nodeGroupAnnotations),
+						nil,
+					),
 					tc.config,
 				)
 			})
@ -1383,7 +1473,14 @@ func TestNodeGroupTemplateNodeInfo(t *testing.T) {
 			t.Run("MachineDeployment", func(t *testing.T) {
 				test(
 					t,
-					createMachineDeploymentTestConfig(testNamespace, RandomString(6), RandomString(6), 10, cloudprovider.JoinStringMaps(enableScaleAnnotations, tc.nodeGroupAnnotations)),
+					createMachineDeploymentTestConfig(
+						testNamespace,
+						RandomString(6),
+						RandomString(6),
+						10,
+						cloudprovider.JoinStringMaps(enableScaleAnnotations, tc.nodeGroupAnnotations),
+						nil,
+					),
 					tc.config,
 				)
 			})
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_unstructured.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_unstructured.go
@ -20,14 +20,17 @@ import (
 	"context"
 	"fmt"
 	"path"
+	"strings"
 	"time"

 	"github.com/pkg/errors"
 	apiv1 "k8s.io/api/core/v1"
+	corev1 "k8s.io/api/core/v1"
 	"k8s.io/apimachinery/pkg/api/resource"
 	metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"
 	"k8s.io/apimachinery/pkg/apis/meta/v1/unstructured"
 	"k8s.io/apimachinery/pkg/runtime/schema"
+	klog "k8s.io/klog/v2"
 )

 type unstructuredScalableResource struct {
@ -190,26 +193,141 @@ func (r unstructuredScalableResource) Taints() []apiv1.Taint {
 	return ret
 }

+// A node group can scale from zero if it can inform about the CPU and memory
+// capacity of the nodes within the group.
 func (r unstructuredScalableResource) CanScaleFromZero() bool {
-	return scaleFromZeroEnabled(r.unstructured.GetAnnotations())
+	capacity, err := r.InstanceCapacity()
+	if err != nil {
+		return false
+	}
+	// CPU and memory are the minimum necessary for scaling from zero
+	_, cpuOk := capacity[corev1.ResourceCPU]
+	_, memOk := capacity[corev1.ResourceMemory]
+
+	return cpuOk && memOk
 }

-func (r unstructuredScalableResource) InstanceCPUCapacity() (resource.Quantity, error) {
+// Inspect the annotations on the scalable resource, and the status.capacity
+// field of the machine template infrastructure resource to build the projected
+// capacity for this node group. The returned map will be empty if the
+// provider does not support scaling from zero, or the annotations have not
+// been added.
+func (r unstructuredScalableResource) InstanceCapacity() (map[corev1.ResourceName]resource.Quantity, error) {
+	capacityAnnotations := map[corev1.ResourceName]resource.Quantity{}
+
+	cpu, err := r.InstanceCPUCapacityAnnotation()
+	if err != nil {
+		return nil, err
+	}
+	if !cpu.IsZero() {
+		capacityAnnotations[corev1.ResourceCPU] = cpu
+	}
+
+	mem, err := r.InstanceMemoryCapacityAnnotation()
+	if err != nil {
+		return nil, err
+	}
+	if !mem.IsZero() {
+		capacityAnnotations[corev1.ResourceMemory] = mem
+	}
+
+	gpuCount, err := r.InstanceGPUCapacityAnnotation()
+	if err != nil {
+		return nil, err
+	}
+	gpuType := r.InstanceGPUTypeAnnotation()
+	if !gpuCount.IsZero() && gpuType != "" {
+		capacityAnnotations[corev1.ResourceName(gpuType)] = gpuCount
+	}
+
+	maxPods, err := r.InstanceMaxPodsCapacityAnnotation()
+	if err != nil {
+		return nil, err
+	}
+	if maxPods.IsZero() {
+		maxPods = *resource.NewQuantity(defaultMaxPods, resource.DecimalSI)
+	}
+	capacityAnnotations[corev1.ResourcePods] = maxPods
+
+	infraObj, err := r.readInfrastructureReferenceResource()
+	if err != nil || infraObj == nil {
+		// because it is possible that the infrastructure provider does not implement
+		// the capacity in the infrastructure reference, if there are annotations we
+		// should return them here.
+		// Check against 1 here because the max pods is always set.
+		if len(capacityAnnotations) > 1 {
+			return capacityAnnotations, nil
+		}
+		return nil, err
+	}
+	capacityInfraStatus := resourceCapacityFromInfrastructureObject(infraObj)
+
+	// The annotations should override any values from the status block of the machine template.
+	// We loop through the status block capacity first, then overwrite any values with the
+	// annotation capacities.
+	capacity := map[corev1.ResourceName]resource.Quantity{}
+	for k, v := range capacityInfraStatus {
+		capacity[k] = v
+	}
+	for k, v := range capacityAnnotations {
+		capacity[k] = v
+	}
+
+	return capacity, nil
+}
+
+func (r unstructuredScalableResource) InstanceCPUCapacityAnnotation() (resource.Quantity, error) {
 	return parseCPUCapacity(r.unstructured.GetAnnotations())
 }

-func (r unstructuredScalableResource) InstanceMemoryCapacity() (resource.Quantity, error) {
+func (r unstructuredScalableResource) InstanceMemoryCapacityAnnotation() (resource.Quantity, error) {
 	return parseMemoryCapacity(r.unstructured.GetAnnotations())
 }

-func (r unstructuredScalableResource) InstanceGPUCapacity() (resource.Quantity, error) {
-	return parseGPUCapacity(r.unstructured.GetAnnotations())
+func (r unstructuredScalableResource) InstanceGPUCapacityAnnotation() (resource.Quantity, error) {
+	return parseGPUCount(r.unstructured.GetAnnotations())
 }

-func (r unstructuredScalableResource) InstanceMaxPodsCapacity() (resource.Quantity, error) {
+func (r unstructuredScalableResource) InstanceGPUTypeAnnotation() string {
+	return parseGPUType(r.unstructured.GetAnnotations())
+}
+
+func (r unstructuredScalableResource) InstanceMaxPodsCapacityAnnotation() (resource.Quantity, error) {
 	return parseMaxPodsCapacity(r.unstructured.GetAnnotations())
 }

+func (r unstructuredScalableResource) readInfrastructureReferenceResource() (*unstructured.Unstructured, error) {
+	infraref, found, err := unstructured.NestedStringMap(r.unstructured.Object, "spec", "template", "spec", "infrastructureRef")
+	if !found || err != nil {
+		return nil, nil
+	}
+
+	apiversion, ok := infraref["apiVersion"]
+	if !ok {
+		return nil, nil
+	}
+	kind, ok := infraref["kind"]
+	if !ok {
+		return nil, nil
+	}
+	name, ok := infraref["name"]
+	if !ok {
+		return nil, nil
+	}
+	// kind needs to be lower case and plural
+	kind = fmt.Sprintf("%ss", strings.ToLower(kind))
+	gvk := schema.FromAPIVersionAndKind(apiversion, kind)
+	res := schema.GroupVersionResource{Group: gvk.Group, Version: gvk.Version, Resource: gvk.Kind}
+
+	infra, err := r.controller.getInfrastructureResource(res, name, r.Namespace())
+	if err != nil {
+		klog.V(4).Infof("Unable to read infrastructure reference, error: %v", err)
+		return nil, err
+	}
+
+	return infra, nil
+}
+
 func newUnstructuredScalableResource(controller *machineController, u *unstructured.Unstructured) (*unstructuredScalableResource, error) {
 	minSize, maxSize, err := parseScalingBounds(u.GetAnnotations())
 	if err != nil {
@ -223,3 +341,21 @@ func newUnstructuredScalableResource(controller *machineController, u *unstructu
 		minSize:      minSize,
 	}, nil
 }
+
+func resourceCapacityFromInfrastructureObject(infraobj *unstructured.Unstructured) map[corev1.ResourceName]resource.Quantity {
+	capacity := map[corev1.ResourceName]resource.Quantity{}
+
+	infracap, found, err := unstructured.NestedStringMap(infraobj.Object, "status", "capacity")
+	if !found || err != nil {
+		return capacity
+	}
+
+	for k, v := range infracap {
+		// if we cannot parse the quantity, don't add it to the capacity
+		if value, err := resource.ParseQuantity(v); err == nil {
+			capacity[corev1.ResourceName(k)] = value
+		}
+	}
+
+	return capacity
+}
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_unstructured_test.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_unstructured_test.go
@ -18,11 +18,20 @@ package clusterapi

 import (
 	"context"
+	"fmt"
 	"testing"
+	"time"

 	"k8s.io/apimachinery/pkg/api/resource"
 	metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"
-	"k8s.io/autoscaler/cluster-autoscaler/utils/units"
+	"k8s.io/apimachinery/pkg/apis/meta/v1/unstructured"
+	"k8s.io/client-go/tools/cache"
+)
+
+const (
+	cpuStatusKey       = "cpu"
+	memoryStatusKey    = "memory"
+	nvidiaGpuStatusKey = "nvidia.com/gpu"
 )

 func TestSetSize(t *testing.T) {
@ -81,6 +90,7 @@ func TestSetSize(t *testing.T) {
 				nodeGroupMinSizeAnnotationKey: "1",
 				nodeGroupMaxSizeAnnotationKey: "10",
 			},
+			nil,
 		))
 	})

@ -93,6 +103,7 @@ func TestSetSize(t *testing.T) {
 				nodeGroupMinSizeAnnotationKey: "1",
 				nodeGroupMaxSizeAnnotationKey: "10",
 			},
+			nil,
 		))
 	})
 }
@ -193,11 +204,11 @@ func TestReplicas(t *testing.T) {
 	}

 	t.Run("MachineSet", func(t *testing.T) {
-		test(t, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), initialReplicas, nil))
+		test(t, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), initialReplicas, nil, nil))
 	})

 	t.Run("MachineDeployment", func(t *testing.T) {
-		test(t, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), initialReplicas, nil))
+		test(t, createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), initialReplicas, nil, nil))
 	})
 }

@ -252,6 +263,7 @@ func TestSetSizeAndReplicas(t *testing.T) {
 				nodeGroupMinSizeAnnotationKey: "1",
 				nodeGroupMaxSizeAnnotationKey: "10",
 			},
+			nil,
 		))
 	})

@ -264,57 +276,51 @@ func TestSetSizeAndReplicas(t *testing.T) {
 				nodeGroupMinSizeAnnotationKey: "1",
 				nodeGroupMaxSizeAnnotationKey: "10",
 			},
+			nil,
 		))
 	})
 }

 func TestAnnotations(t *testing.T) {
 	cpuQuantity := resource.MustParse("2")
-	memQuantity := resource.MustParse("1024")
+	memQuantity := resource.MustParse("1024Mi")
 	gpuQuantity := resource.MustParse("1")
 	maxPodsQuantity := resource.MustParse("42")
 	annotations := map[string]string{
-		cpuKey:     cpuQuantity.String(),
-		memoryKey:  memQuantity.String(),
-		gpuKey:     gpuQuantity.String(),
-		maxPodsKey: maxPodsQuantity.String(),
+		cpuKey:      cpuQuantity.String(),
+		memoryKey:   memQuantity.String(),
+		gpuCountKey: gpuQuantity.String(),
+		maxPodsKey:  maxPodsQuantity.String(),
 	}

-	// convert the initial memory value from Mebibytes to bytes as this conversion happens internally
-	// when we use InstanceMemoryCapacity()
-	memVal, _ := memQuantity.AsInt64()
-	memQuantityAsBytes := resource.NewQuantity(memVal*units.MiB, resource.DecimalSI)
-
-	test := func(t *testing.T, testConfig *testConfig) {
+	test := func(t *testing.T, testConfig *testConfig, testResource *unstructured.Unstructured) {
 		controller, stop := mustCreateTestController(t, testConfig)
 		defer stop()

-		testResource := testConfig.machineSet
-
 		sr, err := newUnstructuredScalableResource(controller, testResource)
 		if err != nil {
 			t.Fatal(err)
 		}

-		if cpu, err := sr.InstanceCPUCapacity(); err != nil {
+		if cpu, err := sr.InstanceCPUCapacityAnnotation(); err != nil {
 			t.Fatal(err)
 		} else if cpuQuantity.Cmp(cpu) != 0 {
 			t.Errorf("expected %v, got %v", cpuQuantity, cpu)
 		}

-		if mem, err := sr.InstanceMemoryCapacity(); err != nil {
+		if mem, err := sr.InstanceMemoryCapacityAnnotation(); err != nil {
 			t.Fatal(err)
-		} else if memQuantityAsBytes.Cmp(mem) != 0 {
+		} else if memQuantity.Cmp(mem) != 0 {
 			t.Errorf("expected %v, got %v", memQuantity, mem)
 		}

-		if gpu, err := sr.InstanceGPUCapacity(); err != nil {
+		if gpu, err := sr.InstanceGPUCapacityAnnotation(); err != nil {
 			t.Fatal(err)
 		} else if gpuQuantity.Cmp(gpu) != 0 {
 			t.Errorf("expected %v, got %v", gpuQuantity, gpu)
 		}

-		if maxPods, err := sr.InstanceMaxPodsCapacity(); err != nil {
+		if maxPods, err := sr.InstanceMaxPodsCapacityAnnotation(); err != nil {
 			t.Fatal(err)
 		} else if maxPodsQuantity.Cmp(maxPods) != 0 {
 			t.Errorf("expected %v, got %v", maxPodsQuantity, maxPods)
@ -322,7 +328,13 @@ func TestAnnotations(t *testing.T) {
 	}

 	t.Run("MachineSet", func(t *testing.T) {
-		test(t, createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, annotations))
+		testConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, annotations, nil)
+		test(t, testConfig, testConfig.machineSet)
+	})
+
+	t.Run("MachineDeployment", func(t *testing.T) {
+		testConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, annotations, nil)
+		test(t, testConfig, testConfig.machineDeployment)
 	})
 }

@ -330,40 +342,103 @@ func TestCanScaleFromZero(t *testing.T) {
 	testConfigs := []struct {
 		name        string
 		annotations map[string]string
+		capacity    map[string]string
 		canScale    bool
 	}{
 		{
-			"MachineSet can scale from zero",
+			"can scale from zero",
 			map[string]string{
 				cpuKey:    "1",
-				memoryKey: "1024",
+				memoryKey: "1024Mi",
+			},
+			nil,
+			true,
+		},
+		{
+			"with missing CPU info cannot scale from zero",
+			map[string]string{
+				memoryKey: "1024Mi",
+			},
+			nil,
+			false,
+		},
+		{
+			"with missing Memory info cannot scale from zero",
+			map[string]string{
+				cpuKey: "1",
+			},
+			nil,
+			false,
+		},
+		{
+			"with no information cannot scale from zero",
+			map[string]string{},
+			nil,
+			false,
+		},
+		{
+			"with capacity in machine template can scale from zero",
+			map[string]string{},
+			map[string]string{
+				cpuStatusKey:    "1",
+				memoryStatusKey: "4G",
 			},
 			true,
 		},
 		{
-			"MachineSet with missing CPU info cannot scale from zero",
+			"with missing cpu capacity in machine template cannot scale from zero",
+			map[string]string{},
 			map[string]string{
-				memoryKey: "1024",
+				memoryStatusKey: "4G",
 			},
 			false,
 		},
 		{
-			"MachineSet with missing Memory info cannot scale from zero",
+			"with missing memory capacity in machine template cannot scale from zero",
+			map[string]string{},
+			map[string]string{
+				cpuStatusKey: "1",
+			},
+			false,
+		},
+		{
+			"with both annotations and capacity in machine template can scale from zero",
+			map[string]string{
+				cpuKey:    "1",
+				memoryKey: "1024Mi",
+			},
+			map[string]string{
+				cpuStatusKey:    "1",
+				memoryStatusKey: "4G",
+			},
+			true,
+		},
+		{
+			"with incomplete annotations and capacity in machine template cannot scale from zero",
 			map[string]string{
 				cpuKey: "1",
 			},
+			map[string]string{
+				nvidiaGpuStatusKey: "1",
+			},
 			false,
 		},
 		{
-			"MachineSet with no information cannot scale from zero",
-			map[string]string{},
-			false,
+			"with complete information split across annotations and capacity in machine template can scale from zero",
+			map[string]string{
+				cpuKey: "1",
+			},
+			map[string]string{
+				memoryStatusKey: "4G",
+			},
+			true,
 		},
 	}

 	for _, tc := range testConfigs {
-		t.Run(tc.name, func(t *testing.T) {
-			msTestConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, tc.annotations)
+		testname := fmt.Sprintf("MachineSet %s", tc.name)
+		t.Run(testname, func(t *testing.T) {
+			msTestConfig := createMachineSetTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, tc.annotations, tc.capacity)
 			controller, stop := mustCreateTestController(t, msTestConfig)
 			defer stop()

@ -380,4 +455,25 @@ func TestCanScaleFromZero(t *testing.T) {
 			}
 		})
 	}
+
+	for _, tc := range testConfigs {
+		testname := fmt.Sprintf("MachineDeployment %s", tc.name)
+		t.Run(testname, func(t *testing.T) {
+			msTestConfig := createMachineDeploymentTestConfig(RandomString(6), RandomString(6), RandomString(6), 1, tc.annotations, tc.capacity)
+			controller, stop := mustCreateTestController(t, msTestConfig)
+			defer stop()
+
+			testResource := msTestConfig.machineDeployment
+
+			sr, err := newUnstructuredScalableResource(controller, testResource)
+			if err != nil {
+				t.Fatal(err)
+			}
+
+			canScale := sr.CanScaleFromZero()
+			if canScale != tc.canScale {
+				t.Errorf("expected %v, got %v", tc.canScale, canScale)
+			}
+		})
+	}
 }
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_utils.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_utils.go
@ -25,7 +25,6 @@ import (
 	"k8s.io/apimachinery/pkg/api/resource"
 	metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"
 	"k8s.io/apimachinery/pkg/apis/meta/v1/unstructured"
-	"k8s.io/autoscaler/cluster-autoscaler/utils/units"
 )

 const (
@ -33,10 +32,11 @@ const (
 	deprecatedNodeGroupMaxSizeAnnotationKey = "cluster.k8s.io/cluster-api-autoscaler-node-group-max-size"
 	deprecatedClusterNameLabel              = "cluster.k8s.io/cluster-name"

-	cpuKey     = "machine.openshift.io/vCPU"
-	memoryKey  = "machine.openshift.io/memoryMb"
-	gpuKey     = "machine.openshift.io/GPU"
-	maxPodsKey = "machine.openshift.io/maxPods"
+	cpuKey      = "capacity.cluster-autoscaler.kubernetes.io/cpu"
+	memoryKey   = "capacity.cluster-autoscaler.kubernetes.io/memory"
+	gpuTypeKey  = "capacity.cluster-autoscaler.kubernetes.io/gpu-type"
+	gpuCountKey = "capacity.cluster-autoscaler.kubernetes.io/gpu-count"
+	maxPodsKey  = "capacity.cluster-autoscaler.kubernetes.io/maxPods"
 )

 var (
@ -63,7 +63,23 @@ var (
 	// machine set has a non-integral max annotation value.
 	errInvalidMaxAnnotation = errors.New("invalid max annotation")

-	zeroQuantity = resource.MustParse("0")
+	// machineDeleteAnnotationKey is the annotation used by cluster-api to indicate
+	// that a machine should be deleted. Because this key can be affected by the
+	// CAPI_GROUP env variable, it is initialized here.
+	machineDeleteAnnotationKey = getMachineDeleteAnnotationKey()
+
+	// machineAnnotationKey is the annotation used by the cluster-api on Node objects
+	// to specify the name of the related Machine object. Because this can be affected
+	// by the CAPI_GROUP env variable, it is initialized here.
+	machineAnnotationKey = getMachineAnnotationKey()
+
+	// nodeGroupMinSizeAnnotationKey and nodeGroupMaxSizeAnnotationKey are the keys
+	// used in MachineSet and MachineDeployment annotations to specify the limits
+	// for the node group. Because the keys can be affected by the CAPI_GROUP env
+	// variable, they are initialized here.
+	nodeGroupMinSizeAnnotationKey = getNodeGroupMinSizeAnnotationKey()
+	nodeGroupMaxSizeAnnotationKey = getNodeGroupMaxSizeAnnotationKey()
+	zeroQuantity                  = resource.MustParse("0")
 )

 type normalizedProviderID string
@ -155,7 +171,7 @@ func normalizedProviderString(s string) normalizedProviderID {
 	return normalizedProviderID(split[len(split)-1])
 }

-func scaleFromZeroEnabled(annotations map[string]string) bool {
+func scaleFromZeroAnnotationsEnabled(annotations map[string]string) bool {
 	cpu := annotations[cpuKey]
 	mem := annotations[memoryKey]

@ -172,31 +188,42 @@ func parseKey(annotations map[string]string, key string) (resource.Quantity, err
 	return zeroQuantity.DeepCopy(), nil
 }

+func parseIntKey(annotations map[string]string, key string) (resource.Quantity, error) {
+	if val, exists := annotations[key]; exists && val != "" {
+		valInt, err := strconv.ParseInt(val, 10, 0)
+		if err != nil {
+			return zeroQuantity.DeepCopy(), fmt.Errorf("value %q from annotation %q expected to be an integer: %v", val, key, err)
+		}
+		return *resource.NewQuantity(valInt, resource.DecimalSI), nil
+	}
+	return zeroQuantity.DeepCopy(), nil
+}
+
 func parseCPUCapacity(annotations map[string]string) (resource.Quantity, error) {
 	return parseKey(annotations, cpuKey)
 }

 func parseMemoryCapacity(annotations map[string]string) (resource.Quantity, error) {
-	// The value for the memoryKey is expected to be an integer representing Mebibytes. e.g. "1024".
-	// https://www.iec.ch/si/binary.htm
-	val, exists := annotations[memoryKey]
-	if exists && val != "" {
-		valInt, err := strconv.ParseInt(val, 10, 0)
-		if err != nil {
-			return zeroQuantity.DeepCopy(), fmt.Errorf("value %q from annotation %q expected to be an integer: %v", val, memoryKey, err)
-		}
-		// Convert from Mebibytes to bytes
-		return *resource.NewQuantity(valInt*units.MiB, resource.DecimalSI), nil
-	}
-	return zeroQuantity.DeepCopy(), nil
+	return parseKey(annotations, memoryKey)
 }

-func parseGPUCapacity(annotations map[string]string) (resource.Quantity, error) {
-	return parseKey(annotations, gpuKey)
+func parseGPUCount(annotations map[string]string) (resource.Quantity, error) {
+	return parseIntKey(annotations, gpuCountKey)
+}
+
+// The GPU type is not currently considered by the autoscaler when planning
+// expansion, but most likely will be in the future. This method is being added
+// in expectation of that arrival.
+// see https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/utils/gpu/gpu.go
+func parseGPUType(annotations map[string]string) string {
+	if val, found := annotations[gpuTypeKey]; found {
+		return val
+	}
+	return ""
 }

 func parseMaxPodsCapacity(annotations map[string]string) (resource.Quantity, error) {
-	return parseKey(annotations, maxPodsKey)
+	return parseIntKey(annotations, maxPodsKey)
 }

 func clusterNameFromResource(r *unstructured.Unstructured) string {
--- a/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_utils_test.go
+++ b/cluster-autoscaler/cloudprovider/clusterapi/clusterapi_utils_test.go
@ -452,7 +452,6 @@ func TestScaleFromZeroEnabled(t *testing.T) {
 		annotations: map[string]string{
 			"foo":  "bar",
 			cpuKey: "1",
-			gpuKey: "2",
 		},
 		enabled: false,
 	}, {
@ -460,12 +459,12 @@ func TestScaleFromZeroEnabled(t *testing.T) {
 		annotations: map[string]string{
 			"foo":     "bar",
 			cpuKey:    "1",
-			memoryKey: "2",
+			memoryKey: "2Mi",
 		},
 		enabled: true,
 	}} {
 		t.Run(tc.description, func(t *testing.T) {
-			got := scaleFromZeroEnabled(tc.annotations)
+			got := scaleFromZeroAnnotationsEnabled(tc.annotations)
 			if tc.enabled != got {
 				t.Errorf("expected %t, got %t", tc.enabled, got)
 			}
@ -542,20 +541,20 @@ func TestParseMemoryCapacity(t *testing.T) {
 		expectedQuantity: zeroQuantity.DeepCopy(),
 		expectedError:    true,
 	}, {
-		description:      "valid quantity",
-		annotations:      map[string]string{memoryKey: "456"},
+		description:      "quantity as with no unit type",
+		annotations:      map[string]string{memoryKey: "1024"},
+		expectedQuantity: *resource.NewQuantity(1024, resource.DecimalSI),
 		expectedError:    false,
-		expectedQuantity: *resource.NewQuantity(456*units.MiB, resource.DecimalSI),
 	}, {
 		description:      "quantity with unit type (Mi)",
 		annotations:      map[string]string{memoryKey: "456Mi"},
-		expectedError:    true,
-		expectedQuantity: zeroQuantity.DeepCopy(),
+		expectedError:    false,
+		expectedQuantity: *resource.NewQuantity(456*units.MiB, resource.DecimalSI),
 	}, {
 		description:      "quantity with unit type (Gi)",
 		annotations:      map[string]string{memoryKey: "8Gi"},
-		expectedError:    true,
-		expectedQuantity: zeroQuantity.DeepCopy(),
+		expectedError:    false,
+		expectedQuantity: *resource.NewQuantity(8*units.GiB, resource.DecimalSI),
 	}} {
 		t.Run(tc.description, func(t *testing.T) {
 			got, err := parseMemoryCapacity(tc.annotations)
@ -586,17 +585,22 @@ func TestParseGPUCapacity(t *testing.T) {
 		expectedError:    false,
 	}, {
 		description:      "bad quantity",
-		annotations:      map[string]string{gpuKey: "not-a-quantity"},
+		annotations:      map[string]string{gpuCountKey: "not-a-quantity"},
 		expectedQuantity: zeroQuantity.DeepCopy(),
 		expectedError:    true,
 	}, {
 		description:      "valid quantity",
-		annotations:      map[string]string{gpuKey: "13"},
+		annotations:      map[string]string{gpuCountKey: "13"},
 		expectedError:    false,
 		expectedQuantity: resource.MustParse("13"),
+	}, {
+		description:      "valid quantity, bad unit type",
+		annotations:      map[string]string{gpuCountKey: "13Mi"},
+		expectedQuantity: zeroQuantity.DeepCopy(),
+		expectedError:    true,
 	}} {
 		t.Run(tc.description, func(t *testing.T) {
-			got, err := parseGPUCapacity(tc.annotations)
+			got, err := parseGPUCount(tc.annotations)
 			if tc.expectedError && err == nil {
 				t.Fatal("expected an error")
 			}
@ -632,6 +636,11 @@ func TestParseMaxPodsCapacity(t *testing.T) {
 		annotations:      map[string]string{maxPodsKey: "13"},
 		expectedError:    false,
 		expectedQuantity: resource.MustParse("13"),
+	}, {
+		description:      "valid quantity, bad unit type",
+		annotations:      map[string]string{maxPodsKey: "13Mi"},
+		expectedQuantity: zeroQuantity.DeepCopy(),
+		expectedError:    true,
 	}} {
 		t.Run(tc.description, func(t *testing.T) {
 			got, err := parseMaxPodsCapacity(tc.annotations)