Merge pull request #31440 from MikeSpreitzer/note-apf-autoupdate

Catch APF description up with recent developments
2022-02-03 03:05:45 -08:00 · 2022-02-03 03:05:45 -08:00 · 2d6d22ddec
parent d7e1bcaa36 2bfc31833f
commit 2d6d22ddec
2 changed files with 156 additions and 82 deletions
--- a/content/en/docs/concepts/cluster-administration/flow-control.md
+++ b/content/en/docs/concepts/cluster-administration/flow-control.md
@ -42,21 +42,21 @@ Fairness feature enabled.
 ## Enabling/Disabling API Priority and Fairness
 The API Priority and Fairness feature is controlled by a feature gate
-and is enabled by default.  See
+and is enabled by default.  See [Feature
-[Feature Gates](/docs/reference/command-line-tools-reference/feature-gates/)
+Gates](/docs/reference/command-line-tools-reference/feature-gates/)
 for a general explanation of feature gates and how to enable and
 disable them.  The name of the feature gate for APF is
 "APIPriorityAndFairness".  This feature also involves an {{<
 glossary_tooltip term_id="api-group" text="API Group" >}} with: (a) a
-`v1alpha1` version, disabled by default, and (b) a `v1beta1`
+`v1alpha1` version, disabled by default, and (b) `v1beta1` and
-version,  enabled by default.  You can disable the feature
+`v1beta2` versions, enabled by default.  You can disable the feature
-gate and API group v1beta1 version by adding the following
+gate and API group beta versions by adding the following
 command-line flags to your `kube-apiserver` invocation:
 ```shell
 kube-apiserver \
 --feature-gates=APIPriorityAndFairness=false \
--runtime-config=flowcontrol.apiserver.k8s.io/v1beta1=false \
+--runtime-config=flowcontrol.apiserver.k8s.io/v1beta1=false,flowcontrol.apiserver.k8s.io/v1beta2=false \
 # …and other flags as usual
 ```
@ -127,86 +127,13 @@ any of the limitations imposed by this feature. These exemptions prevent an
 improperly-configured flow control configuration from totally disabling an API
 server.
 ## Defaults
 The Priority and Fairness feature ships with a suggested configuration that
 should suffice for experimentation; if your cluster is likely to
 experience heavy load then you should consider what configuration will work
 best. The suggested configuration groups requests into five priority
 classes:
 * The `system` priority level is for requests from the `system:nodes` group,
  i.e. Kubelets, which must be able to contact the API server in order for
  workloads to be able to schedule on them.
 * The `leader-election` priority level is for leader election requests from
  built-in controllers (in particular, requests for `endpoints`, `configmaps`,
  or `leases` coming from the `system:kube-controller-manager` or
  `system:kube-scheduler` users and service accounts in the `kube-system`
  namespace). These are important to isolate from other traffic because failures
  in leader election cause their controllers to fail and restart, which in turn
  causes more expensive traffic as the new controllers sync their informers.
 * The `workload-high` priority level is for other requests from built-in
  controllers.
 * The `workload-low` priority level is for requests from any other service
  account, which will typically include all requests from controllers running in
  Pods.
 * The `global-default` priority level handles all other traffic, e.g.
  interactive `kubectl` commands run by nonprivileged users.
 Additionally, there are two PriorityLevelConfigurations and two FlowSchemas that
 are built in and may not be overwritten:
 * The special `exempt` priority level is used for requests that are not subject
  to flow control at all: they will always be dispatched immediately. The
  special `exempt` FlowSchema classifies all requests from the `system:masters`
  group into this priority level. You may define other FlowSchemas that direct
  other requests to this priority level, if appropriate.
 * The special `catch-all` priority level is used in combination with the special
  `catch-all` FlowSchema to make sure that every request gets some kind of
  classification. Typically you should not rely on this catch-all configuration,
  and should create your own catch-all FlowSchema and PriorityLevelConfiguration
  (or use the `global-default` configuration that is installed by default) as
  appropriate. To help catch configuration errors that miss classifying some
  requests, the mandatory `catch-all` priority level only allows one concurrency
  share and does not queue requests, making it relatively likely that traffic
  that only matches the `catch-all` FlowSchema will be rejected with an HTTP 429
  error.
 ## Health check concurrency exemption
 The suggested configuration gives no special treatment to the health
 check requests on kube-apiservers from their local kubelets --- which
 tend to use the secured port but supply no credentials.  With the
 suggested config, these requests get assigned to the `global-default`
 FlowSchema and the corresponding `global-default` priority level,
 where other traffic can crowd them out.
 If you add the following additional FlowSchema, this exempts those
 requests from rate limiting.
 {{< caution >}}
 Making this change also allows any hostile party to then send
 health-check requests that match this FlowSchema, at any volume they
 like.  If you have a web traffic filter or similar external security
 mechanism to protect your cluster's API server from general internet
 traffic, you can configure rules to block any health check requests
 that originate from outside your cluster.
 {{< /caution >}}
 {{< codenew file="priority-and-fairness/health-for-strangers.yaml" >}}
 ## Resources
 The flow control API involves two kinds of resources.
-[PriorityLevelConfigurations](/docs/reference/generated/kubernetes-api/{{< param "version" >}}/#prioritylevelconfiguration-v1beta1-flowcontrol-apiserver-k8s-io)
+[PriorityLevelConfigurations](/docs/reference/generated/kubernetes-api/{{< param "version" >}}/#prioritylevelconfiguration-v1beta2-flowcontrol-apiserver-k8s-io)
 define the available isolation classes, the share of the available concurrency
 budget that each can handle, and allow for fine-tuning queuing behavior.
-[FlowSchemas](/docs/reference/generated/kubernetes-api/{{< param "version" >}}/#flowschema-v1beta1-flowcontrol-apiserver-k8s-io)
+[FlowSchemas](/docs/reference/generated/kubernetes-api/{{< param "version" >}}/#flowschema-v1beta2-flowcontrol-apiserver-k8s-io)
 are used to classify individual inbound requests, matching each to a
 single PriorityLevelConfiguration.  There is also a `v1alpha1` version
 of the same API group, and it has the same Kinds with the same syntax and
@ -329,6 +256,153 @@ omitted entirely), in which case all requests matched by this FlowSchema will be
 considered part of a single flow. The correct choice for a given FlowSchema
 depends on the resource and your particular environment.
 ## Defaults
 Each kube-apiserver maintains two sorts of APF configuration objects:
 mandatory and suggested.
 ### Mandatory Configuration Objects
 The four mandatory configuration objects reflect fixed built-in
 guardrail behavior.  This is behavior that the servers have before
 those objects exist, and when those objects exist their specs reflect
 this behavior.  The four mandatory objects are as follows.
 * The mandatory `exempt` priority level is used for requests that are
  not subject to flow control at all: they will always be dispatched
  immediately. The mandatory `exempt` FlowSchema classifies all
  requests from the `system:masters` group into this priority
  level. You may define other FlowSchemas that direct other requests
  to this priority level, if appropriate.
 * The mandatory `catch-all` priority level is used in combination with
  the mandatory `catch-all` FlowSchema to make sure that every request
  gets some kind of classification. Typically you should not rely on
  this catch-all configuration, and should create your own catch-all
  FlowSchema and PriorityLevelConfiguration (or use the suggested
  `global-default` priority level that is installed by default) as
  appropriate. Because it is not expected to be used normally, the
  mandatory `catch-all` priority level has a very small concurrency
  share and does not queue requests.
 ### Suggested Configuration Objects
 The suggested FlowSchemas and PriorityLevelConfigurations constitute a
 reasonable default configuration.  You can modify these and/or create
 additional configuration objects if you want.  If your cluster is
 likely to experience heavy load then you should consider what
 configuration will work best.
 The suggested configuration groups requests into six priority levels:
 * The `node-high` priority level is for health updates from nodes.
 * The `system` priority level is for non-health requests from the
  `system:nodes` group, i.e. Kubelets, which must be able to contact
  the API server in order for workloads to be able to schedule on
  them.
 * The `leader-election` priority level is for leader election requests from
  built-in controllers (in particular, requests for `endpoints`, `configmaps`,
  or `leases` coming from the `system:kube-controller-manager` or
  `system:kube-scheduler` users and service accounts in the `kube-system`
  namespace). These are important to isolate from other traffic because failures
  in leader election cause their controllers to fail and restart, which in turn
  causes more expensive traffic as the new controllers sync their informers.
 * The `workload-high` priority level is for other requests from built-in
  controllers.
 * The `workload-low` priority level is for requests from any other service
  account, which will typically include all requests from controllers running in
  Pods.
 * The `global-default` priority level handles all other traffic, e.g.
  interactive `kubectl` commands run by nonprivileged users.
 The suggested FlowSchemas serve to steer requests into the above
 priority levels, and are not enumerated here.
 ### Maintenance of the Mandatory and Suggested Configuration Objects
 Each `kube-apiserver` independently maintains the mandatory and
 suggested configuration objects, using initial and periodic behavior.
 Thus, in a situation with a mixture of servers of different versions
 there may be thrashing as long as different servers have different
 opinions of the proper content of these objects.
 Each `kube-apiserver` makes an inital maintenance pass over the
 mandatory and suggested configuration objects, and after that does
 periodic maintenance (once per minute) of those objects.
 For the mandatory configuration objects, maintenance consists of
 ensuring that the object exists and, if it does, has the proper spec.
 The server refuses to allow a creation or update with a spec that is
 inconsistent with the server's guardrail behavior.
 Maintenance of suggested configuration objects is designed to allow
 their specs to be overridden.  Deletion, on the other hand, is not
 respected: maintenance will restore the object.  If you do not want a
 suggested configuration object then you need to keep it around but set
 its spec to have minimal consequences.  Maintenance of suggested
 objects is also designed to support automatic migration when a new
 version of the `kube-apiserver` is rolled out, albeit potentially with
 thrashing while there is a mixed population of servers.
 Maintenance of a suggested configuration object consists of creating
 it --- with the server's suggested spec --- if the object does not
 exist.  OTOH, if the object already exists, maintenance behavior
 depends on whether the `kube-apiservers` or the users control the
 object.  In the former case, the server ensures that the object's spec
 is what the server suggests; in the latter case, the spec is left
 alone.
 The question of who controls the object is answered by first looking
 for an annotation with key `apf.kubernetes.io/autoupdate-spec`.  If
 there is such an annotation and its value is `true` then the
 kube-apiservers control the object.  If there is such an annotation
 and its value is `false` then the users control the object.  If
 neither of those condtions holds then the `metadata.generation` of the
 object is consulted.  If that is 1 then the kube-apiservers control
 the object.  Otherwise the users control the object.  These rules were
 introduced in release 1.22 and their consideration of
 `metadata.generation` is for the sake of migration from the simpler
 earlier behavior.  Users who wish to control a suggested configuration
 object should set its `apf.kubernetes.io/autoupdate-spec` annotation
 to `false`.
 Maintenance of a mandatory or suggested configuration object also
 includes ensuring that it has an `apf.kubernetes.io/autoupdate-spec`
 annotation that accurately reflects whether the kube-apiservers
 control the object.
 Maintenance also includes deleting objects that are neither mandatory
 nor suggested but are annotated
 `apf.kubernetes.io/autoupdate-spec=true`.
 ## Health check concurrency exemption
 The suggested configuration gives no special treatment to the health
 check requests on kube-apiservers from their local kubelets --- which
 tend to use the secured port but supply no credentials.  With the
 suggested config, these requests get assigned to the `global-default`
 FlowSchema and the corresponding `global-default` priority level,
 where other traffic can crowd them out.
 If you add the following additional FlowSchema, this exempts those
 requests from rate limiting.
 {{< caution >}}
 Making this change also allows any hostile party to then send
 health-check requests that match this FlowSchema, at any volume they
 like.  If you have a web traffic filter or similar external security
 mechanism to protect your cluster's API server from general internet
 traffic, you can configure rules to block any health check requests
 that originate from outside your cluster.
 {{< /caution >}}
 {{< codenew file="priority-and-fairness/health-for-strangers.yaml" >}}
 ## Diagnostics
 Every HTTP response from an API server with the priority and fairness feature
--- a/content/en/examples/priority-and-fairness/health-for-strangers.yaml
+++ b/content/en/examples/priority-and-fairness/health-for-strangers.yaml
@ -1,4 +1,4 @@
-apiVersion: flowcontrol.apiserver.k8s.io/v1beta1
+apiVersion: flowcontrol.apiserver.k8s.io/v1beta2
 kind: FlowSchema
 metadata:
  name: health-for-strangers