linkerd2

Commit Graph

Author	SHA1	Message	Date
Alejandro Pedraza	7428d4aa51	Removed dupe imports (#10049 ) * Removed dupe imports My IDE (vim-gopls) has been complaining for a while, so I decided to take care of it. Found via [staticcheck](https://github.com/dominikh/go-tools) * Add stylecheck to go-lint checks	2023-01-10 14:34:56 -05:00
Steve Jenson	309e8d1210	Validate CNI configurations during pod startup (#9678 ) When users use CNI, we want to ensure that network rewriting inside the pod is setup before allowing linkerd to start. When rewriting isn't happening, we want to exit with a clear error message and enough information in the container log for the administrator to either file a bug report with us or fix their configuration. This change adds a validator initContainer to all injected workloads, when linkerd is installed with "cniEnabled=false". The validator replaces the noop init container, and will prevent pods from starting up if iptables is not configured. Part of #8120 Signed-off-by: Steve Jenson <stevej@buoyant.io>	2022-10-26 11:14:45 +01:00
Alejandro Pedraza	e6fa5a7156	Replace usage of io/ioutil package (#9613 ) `io/ioutil` has been deprecated since go 1.16 and the linter started to complain about it.	2022-10-13 12:10:58 -05:00
Alejandro Pedraza	b65364704b	Add config proxyInit.runAsUser to facilitate 2.11.x->2.12.0 upgrade (#9201 ) In 2.11.x, proxyInit.runAsRoot was true by default, which caused the proxy-init's runAsUser field to be 0. proxyInit.runAsRoot is now defaulted to false in 2.12.0, but runAsUser still isn't configurable, and when following the upgrade instructions here, helm doesn't change runAsUser and so it conflicts with the new value for runAsRoot=false, resulting in the pods erroring with this message: Error: container's runAsUser breaks non-root policy (pod: "linkerd-identity-bc649c5f9-ckqvg_linkerd(fb3416d2-c723-4664-acf1-80a64a734561)", container: linkerd-init) This PR adds a new default for runAsUser to avoid this issue.	2022-08-19 09:07:13 -05:00
Eliza Weisman	f6c6ff965c	inject: fix --default-inbound-policy not setting annotation (#9197 ) Depends on #9195 Currently, `linkerd inject --default-inbound-policy` does not set the `config.linkerd.io/default-inbound-policy` annotation on the injected resource(s). The `inject` command does _try_ to set that annotation if it's set in the `Values` generated by `proxyFlagSet`: `14d1dbb3b7/cli/cmd/inject.go (L485-L487)` ...but, the flag in the proxy `FlagSet` doesn't set `Values.Proxy.DefaultInboundPolicy`, it sets `Values.PolicyController.DefaultAllowPolicy`: `7c5e3aaf40/cli/cmd/options.go (L375-L379)` This is because the flag set is shared across `linkerd inject` and `linkerd install` subcommands, and in `linkerd install`, we want to set the default policy for the whole cluster by configuring the policy controller. In `linkerd inject`, though, we want to add the annotation to the injected pods only. This branch fixes this issue by changing the flag so that it sets the `Values.Proxy.DefaultInboundPolicy` instead of the `Values.PolicyController.DefaultAllowPolicy` value. In `linkerd install`, we then set `Values.PolicyController.DefaultAllowPolicy` based on the value of `Values.Proxy.DefaultInboundPolicy`, while in `inject`, we will now actually add the annotation. This branch is based on PR #9195, which adds validation to reject invalid values for `--default-inbound-policy`, rather than on `main`. This is because the validation code added in that PR had to be moved around a bit, since it now needs to validate the `Values.Proxy.DefaultInboundPolicy` value rather than the `Values.PolicyController.DefaultAllowPolicy` value. I thought using #9195 as a base branch was better than basing this on `main` and then having to resolve merge conflicts later. When that PR merges, this can be rebased onto `main`. Fixes #9168	2022-08-18 17:16:27 -07:00
Kevin Leimkuhler	ddc214acdf	Validate `--default-inbound-policy` values (#9195 ) Closes #9148 With this change, the value of `—default-inbound-policy` is verified to be one of the accepted values. When the value is not an accepted value we now error ```shell $ linkerd install --default-inbound-policy=everybody Error: --default-inbound-policy must be one of: all-authenticated, all-unauthenticated, cluster-authenticated, cluster-unauthenticated, deny (got everybody) Usage: linkerd install [flags] ... ``` A unit test has also been added. Signed-off-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2022-08-17 19:42:01 -06:00
Dani Baeyens	074f5e6cdf	Allows RSA signed trust anchors on linkerd cli (#7771 ) (#8868 ) * Allows RSA signed trust anchors on linkerd cli (#7771) Linkerd currently forces using an ECDSA P-256 issuer certificate along with a ECDSA trust anchor. Still, it's still cryptographically valid to have an ECDSA P-256 issuer certificate issued by an RSA signed CA. CheckCertAlgoRequirements checks if CA cert uses ECDSA or RSA 2048/4096 signing algorithm. Fixes #7771 Signed-off-by: Baeyens, Daniel <daniel.baeyens@gmail.com> Co-authored-by: Alejandro Pedraza <alejandro@buoyant.io>	2022-08-08 08:04:24 -05:00
Matei David	e4f7788c14	Change default iptables mode to legacy (#9097 ) Some hosts may not have 'nft' modules available. Currently, proxy-init defaults to using 'iptables-nft'; if the host does not have support for nft modules, the init container will crash, blocking all injected workloads from starting up. This change defaults the 'iptablesMode' value to 'legacy'. * Update linkerd-control-plane/values file default * Update proxy-init partial to default to 'legacy' when no mode is specified * Change expected values in 'pkg/charts/linkerd2/values_test.go' and in 'cli/cmd/install_test' * Update golden files Fixes #9053 Signed-off-by: Matei David <matei@buoyant.io>	2022-08-05 10:45:29 -06:00
Kevin Leimkuhler	c6693a5ae3	Add `policyController.probeNetworks` configuration value (#9091 ) Closes #8945 This adds the `policyController.probeNetworks` configuration value so that users can configure the networks from which probes are expected to be performed. By default, we allow all networks (`0.0.0.0/0`). Additionally, this value differs from `clusterNetworks` is that it is a list of networks, and thus we have to join the values in the Helm templating. Signed-off-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2022-08-05 10:43:22 -06:00
Kevin Leimkuhler	c006f7b4a2	Allow disabling `linkerd-await` on control plane pods (#9059 ) > In some circumstances, the lifecycle.postStart hook can cause the linkerd-proxy > container to get stuck waiting for identity verification. After the > linkerd-await timeout, the container will be restarted and the proxy starts > without further incident. The linkerd-control-plane helm chart currently has a > way to disable the lifecycle hook for injected proxies, but not for proxies on > the control plane pods. > > This commit adds a new value to the linkerd-control-plane chart of > proxy.controlPlaneAwait that can be used to disable the postStart lifecycle hook > on the destination and proxy-injector pods. This is defaulted to true to > maintain current behavior. > > The linkerd-control-plane chart was templated, setting proxy.controlPlaneAwait > to true and false, verifying that the postStart lifecycle hook was either > present or absent depending on the proxy.controlPlaneAwait value. > > Fixes #8738 This continues the now stale #8739 and removes the version bumps that were requested. Signed-off-by: Jacob Lambert [calrisian777@gmail.com](mailto:calrisian777@gmail.com) Co-authored-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2022-08-03 16:09:42 -04:00
Matei David	9dd51d3897	Add `iptablesMode` flag to proxy-init (#8887 ) This change introduces a new value to be used at install (or upgrade) time. The value (`proxyInit.iptablesMode=nft\|legacy`) is responsible for starting the proxy-init container in nft or legacy mode. By default, the init container will use iptables-nft. When the mode is set to `nft`, it will instead use iptables-nft. Most modern Linux distributions support both, but a subset (such as RHEL based families) only support iptables-nft and nf_tables. Signed-off-by: Matei David <matei@buoyant.io>	2022-07-27 21:45:19 -07:00
Alex Leong	df177e67eb	Add HttpRoute CRD (#8675 ) Fixes #8660 We add the HttpRoute CRD to the CRDs installed with `linkerd install --crds` and `linkerd upgrade --crds`. You can use the `--set installHttpRoute=false` to skip installing this CRD. Signed-off-by: Alex Leong <alex@buoyant.io>	2022-06-29 09:50:23 -07:00
Alex Leong	893fa78671	Split HA functionality into multiple configurable values (#8445 ) Some autoscalers, namely Karpenter, don't allow podAntiAffinity and the enablePodAntiAffinity flag is currently overloaded with other HA requirements. This commit splits out the PDB and updateStrategy configuration into separate value inputs. Fixes #8062 Signed-off-by: Alex Leong <alex@buoyant.io> Co-authored-by: Evan Hines <evan@firebolt.io>	2022-05-10 09:49:58 -07:00
Oliver Gould	33c1d610ad	test: Diff structured YAML when possible (#8432 ) When we compare generated manifests against fixtures, we do a simple string comparison to compare output. The diffed data can be pretty hard to understand. This change adds a new test helper, `DiffTestYAML` that parses strings as arbitrary YAML data structures and uses `deep.Equal` to generate a diff of the datastructures. Now, when a test fails, we'll get output like: ``` install_test.go:244: YAML mismatches install_output.golden: slice[32].map[spec].map[template].map[spec].map[containers].slice[3].map[image]: PolicyControllerImageName:PolicyControllerVersion != SomeOtherImage:PolicyControllerVersion ``` While testing this, it became apparent that several of our generated golden files were not actually valid YAML, due to the `LinkerdVersion` value being unset. This has been fixed. Signed-off-by: Oliver Gould <ver@buoyant.io>	2022-05-10 08:40:29 -07:00
Oliver Gould	7d1e4a6953	refactor: Split CRD & Control Plane upgrade logic (#8423 ) This change follows on `4f3c374`, which split the install logic for CRDs and the core control plane, by splitting the upgrade logic for the CRDs and the core control plane. Signed-off-by: Oliver Gould <ver@buoyant.io>	2022-05-04 16:11:48 -07:00
Oliver Gould	4f3c374bb7	refactor: Split CRD and control-plane installation (#8401 ) We currently have singular `install` and `render` functions, each of which takes a `crds` bool that completely alters the behavior of the function. This change splits this behavior into distinct functions so we have `installCRDs`/`renderCRDs` and `installControlPlane`/ `renderControlPlane`. Signed-off-by: Oliver Gould <ver@buoyant.io>	2022-05-03 17:21:27 -07:00
Alex Leong	820fac758c	Fix panic in install --ignore-cluster (#8377 ) Fixes #8364 When `linkerd install` is called with the `--ignore-cluster`, we pass `nil` for the `k8sAPI`. This causes a panic when using this client for validation. We add a conditional so that we skip this validation when the `k8sAPI` is `nil`. Signed-off-by: Alex Leong <alex@buoyant.io>	2022-05-02 12:06:48 -07:00
Alex Leong	6762dd28ac	Add --crds flag to install/upgrade and remove config/control-plane stages (#8251 ) Fixes: #8173 In order to support having custom resources in the default Linkerd installation, it is necessary to add a separate install step to install CRDs before the core install. The Linkerd Helm charts already accomplish this by having CRDs in a separate chart. We add this functionality to the CLI by adding a `--crds` flag to `linkerd install` and `linkerd upgrade` which outputs manifests for the CRDs only and remove the CRD manifests when the `--crds` flag is not set. To avoid a compounding of complexity, we remove the `config` and `control-plane` stages from install/upgrade. The effect of this is that we drop support for splitting up an install by privilege level (cluster admin vs Linkerd admin). The Linkerd install flow is now always a 2-step process where `linkerd install --crds` must be run first to install CRDs only and then `linkerd install` is run to install everything else. This more closely aligns the CLI install flow with the Helm install flow where the CRDs are a separate chart. Attempting to run `linkerd install` before the CRDs are installed will result in a helpful error message. Similarly, upgrade is also a 2-step process of `linkerd upgrade --crds` follow by `linkerd upgrade`. Signed-off-by: Alex Leong <alex@buoyant.io>	2022-04-28 09:36:14 -07:00
Oliver Gould	425a43def5	Enable gocritic linting (#7906 ) [gocritic][gc] helps to enforce some consistency and check for potential errors. This change applies linting changes and enables gocritic via golangci-lint. [gc]: https://github.com/go-critic/go-critic Signed-off-by: Oliver Gould <ver@buoyant.io>	2022-02-17 22:45:25 +00:00
Matei David	0d59864033	Remove usage of controllerImageVersion values field (#7883 ) Remove usage of controllerImageVersion values field This change removes the unused `controllerImageVersion` field, first from the tests, and then from the actual chart values structure. Note that at this point in time, it is impossible to use `--controller-image-version` through Helm, yet it still seems to be working for the CLI. * We configure the charts to use `linkerdVersionValue` instead of `controlPlaneImageVersion` (or default to it where appropriate). * We add the stringslicevar flag (i.e `--set`) to the flagset we use in upgrade tests. This means instead of testing value overrides through a dedicated flag, we can now make use of `--set` in upgrade tests. We first set the linkerdVersionValue in the install option and then override the policy controller image version and the linkerd controller image version to test flags work as expected. * We remove hardcoded values from healthcheck test. * We remove field from chart values struct. Signed-off-by: Matei David <matei@buoyant.io>	2022-02-17 15:19:08 +00:00
Brian Dunnigan	a8dbe4d1e0	Adding support for injecting Webhook CA bundles with cert-manager CA Injector (#7353 ) (#7354 ) * Adding support for injecting Webhook CA bundles with cert-manager CA Injector (#7353) Currently, users need to pass in the caBundle when doing a helm/CLI install. If the user is already using cert-manager to generate webhook certs, they can use the cert-manager CA injector to populate the caBundle for the Webhooks. Adding inectCaFrom and injectCaFromSecret options to every webhook alongside every caBundle option gives users the ability to add the cert-manager.io/inject-ca-from or cert-manager.io/inject-ca-from-secret annotations to the Webhooks specifying the Certificate or Secret to pull the CA from to accomplish ca bundle injection. Signed-off-by: Brian Dunnigan <bdun1013dev@gmail.com> Co-authored-by: Alejandro Pedraza <alejandro@buoyant.io>	2022-01-03 14:28:30 -05:00
Kevin Leimkuhler	d3c950d682	Add install error for runtime container check (#7468 ) When installing Linkerd on a cluster with the Docker container runtime, `proxyInit.runAsRoot` but be set to `true` in order for Linkerd to operate. This is checked two different ways: `linkerd check --pre` and `linkerd check`. #7457 discussed if it's better to emit this as a warning or error, but after some further discussion it makes more sense as a `linkerd install` runtime error so that a user cannot miss this configuration. It still remains as part of `linkerd check` in case more nodes are added that do not satisfy this condition, or Linkerd is installed through Helm. ```sh $ linkerd install there are nodes using the docker container runtime and proxy-init container must run as root user. try installing linkerd via --set proxyInit.runAsRoot=true $ linkerd install --set proxyInit.runAsRoot=false there are nodes using the docker container runtime and proxy-init container must run as root user. try installing linkerd via --set proxyInit.runAsRoot=true $ linkerd install --set proxyInit.runAsRoot="" there are nodes using the docker container runtime and proxy-init container must run as root user. try installing linkerd via --set proxyInit.runAsRoot=true $ linkerd install --set proxyInit.runAsRoot=true ... $ linkerd install --set proxyInit.runAsRoot=1 ... ``` Signed-off-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2021-12-20 12:08:44 -05:00
Alejandro Pedraza	f9f3ebefa9	Remove namespace from charts and split them into `linkerd-crd` and `linkerd-control-plane` (#6635 ) Fixes #6584 #6620 #7405 # Namespace Removal With this change, the `namespace.yaml` template is rendered only for CLI installs and not Helm, and likewise the `namespace:` entry in the namespace-level objects (using a new `partials.namespace` helper). The `installNamespace` and `namespace` entries in `values.yaml` have been removed. There in the templates where the namespace is required, we moved from `.Values.namespace` to `.Release.Namespace` which is filled-in automatically by Helm. For the CLI, `install.go` now explicitly defines the contents of the `Release` map alongside `Values`. The proxy-injector has a new `linkerd-namespace` argument given the namespace is no longer persisted in the `linkerd-config` ConfigMap, so it has to be passed in. To pass it further down to `injector.Inject()` without modifying the `Handler` signature, a closure was used. ------------ Update: Merged-in #6638: Similar changes for the `linkerd-viz` chart: Stop rendering `namespace.yaml` in the `linkerd-viz` chart. The additional change here is the addition of the `namespace-metadata.yaml` template (and its RBAC), _not_ rendered in CLI installs, which is a Helm `post-install` hook, consisting on a Job that executes a script adding the required annotations and labels to the viz namespace using a PATCH request against kube-api. The script first checks if the namespace doesn't already have an annotations/labels entries, in which case it has to add extra ops in that patch. --------- Update: Merged-in the approved #6643, #6665 and #6669 which address the `linkerd2-cni`, `linkerd-multicluster` and `linkerd-jaeger` charts. Additional changes from what's already mentioned above: - Removes the install-namespace option from `linkerd install-cni`, which isn't found in `linkerd install` nor `linkerd viz install` anyways, and it would add some complexity to support. - Added a dependency on the `partials` chart to the `linkerd-multicluster-link` chart, so that we can tap on the `partials.namespace` helper. - We don't have any more the restriction on having the muticluster objects live in a separate namespace than linkerd. It's still good practice, and that's the default for the CLI install, but I removed that validation. Finally, as a side-effect, the `linkerd mc allow` subcommand was fixed; it has been broken for a while apparently: ```console $ linkerd mc allow --service-account-name foobar Error: template: linkerd-multicluster/templates/remote-access-service-mirror-rbac.yaml:16:7: executing "linkerd-multicluster/templates/remote-access-service-mirror-rbac.yaml" at <include "partials.annotations.created-by" $>: error calling include: template: no template "partials.annotations.created-by" associated with template "gotpl" ``` --------- Update: see helm/helm#5465 describing the current best-practice # Core Helm Charts Split This removes the `linkerd2` chart, and replaces it with the `linkerd-crds` and `linkerd-control-plane` charts. Note that the viz and other extension charts are not concerned by this change. Also note the original `values.yaml` file has been split into both charts accordingly. ### UX ```console $ helm install linkerd-crds --namespace linkerd --create-namespace linkerd/linkerd-crds ... # certs.yaml should contain identityTrustAnchorsPEM and the identity issuer values $ helm install linkerd-control-plane --namespace linkerd -f certs.yaml linkerd/linkerd-control-plane ``` ### Upgrade As explained in #6635, this is a breaking change. Users will have to uninstall the `linkerd2` chart and install these two, and eventually rollout the proxies (they should continue to work during the transition anyway). ### CLI The CLI install/upgrade code was updated to be able to pick the templates from these new charts, but the CLI UX remains identical as before. ### Other changes - The `linkerd-crds` and `linkerd-control-plane` charts now carry a version scheme independent of linkerd's own versioning, as explained in #7405. - These charts are Helm v3, which is reflected in the `Chart.yaml` entries and in the removal of the `requirements.yaml` files. - In the integration tests, replaced the `helm-chart` arg with `helm-charts` containing the path `./charts`, used to build the paths for both charts. ### Followups - Now it's possible to add a `ServiceProfile` instance for Destination in the `linkerd-control-plane` chart.	2021-12-10 15:53:08 -05:00
Tarun Pothulapati	92421d047a	core: use serviceAccountToken volume for pod authentication (#7117 ) Fixes #3260 ## Summary Currently, Linkerd uses a service Account token to validate a pod during the `Certify` request with identity, through which identity is established on the proxy. This works well and good, as Kubernetes attaches the `default` service account token of a namespace as a volume (unless overridden with a specific service account by the user). Catch here being that this token is aimed at the application to talk to the kubernetes API and not specifically for Linkerd. This means that there are [controls outside of Linkerd](https://kubernetes.io/docs/tasks/configure-pod-container/configure-service-account/#use-the-default-service-account-to-access-the-api-server), to manage this service token, which users might want to use, [causing problems with Linkerd](https://github.com/linkerd/linkerd2/issues/3183) as Linkerd might expect it to be present. To have a more granular control over the token, and not rely on the service token that can be managed externally, [Bound Service Tokens](https://github.com/kubernetes/enhancements/tree/master/keps/sig-auth/1205-bound-service-account-tokens) can be used to generate tokens that are specifically for Linkerd, that are bound to a specific pod, along with an expiry. ## Background on Bounded Service Tokens This feature has been GA’ed in Kubernetes 1.20, and is enabled by default in most cloud provider distributions. Using this feature, Kubernetes can be asked to issue specific tokens for linkerd usage (through audience bound configuration), with a specific expiry time (as the validation happens every 24 hours when establishing identity, we can follow the same), bounded to a specific pod (meaning verification fails if the pod object isn’t available). Because of all these bounds, and not being able to use this token for anything else, This feels like the right thing to rely on to validate a pod to issue a certificate. ### Pod Identity Name We still use the same service account name as the pod identity (used with metrics, etc) as these tokens are all generated from the same base service account attached to the pod (could be defualt, or the user overriden one). This can be verified by looking at the `user` field in the `TokenReview` response. <details> <summary>Sample TokenReview response</summary> Here, The new token was created for the vault audience for a pod which had a serviceAccount token volume projection and was using the `mine` serviceAccount in the default namespace. ```json "kind": "TokenReview", "apiVersion": "authentication.k8s.io/v1", "metadata": { "creationTimestamp": null, "managedFields": [ { "manager": "curl", "operation": "Update", "apiVersion": "authentication.k8s.io/v1", "time": "2021-10-19T19:21:40Z", "fieldsType": "FieldsV1", "fieldsV1": {"f:spec":{"f:audiences":{},"f:token":{}}} } ] }, "spec": { "token": "....", "audiences": [ "vault" ] }, "status": { "authenticated": true, "user": { "username": "system:serviceaccount:default:mine", "uid": "889a81bd-e31c-4423-b542-98ddca89bfd9", "groups": [ "system:serviceaccounts", "system:serviceaccounts:default", "system:authenticated" ], "extra": { "authentication.kubernetes.io/pod-name": [ "nginx" ], "authentication.kubernetes.io/pod-uid": [ "ebf36f80-40ee-48ee-a75b-96dcc21466a6" ] } }, "audiences": [ "vault" ] } ``` </details> ## Changes - Update `proxy-injector` and install scripts to include the new projected Volume and VolumeMount. - Update the `identity` pod to validate the token with the linkerd audience key. - Added `identity.serviceAccountTokenProjection` to disable this feature. - Updated err'ing logic with `autoMountServiceAccount: false` to fail only when this feature is disabled. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-11-03 02:03:39 +05:30
Krzysztof Dryś	80bc61aab7	feat: remove cert expiry date in helm charts (#7056 ) Expiry date was not used anywhere in the code and yet it was required on install. All occurrences of `crtExpiry` (template variable) and `identity-issuer-expiry` (annotation) were removed. ## Validation. It seems that `identity-issuer-expiry` was only set and never read. After this change there is no mentions of `identity-issuer-expiry` (rg "identity-issuer-expiry"). There are occurrences of `crtExpiry`, but they are not relevant: ``` > rg crtExpiry pkg/tls/cred.go 99: if crtExpiryError(err) { 234:func crtExpiryError(err error) bool { ``` ## Backward compatibility Helm accepts "unknown" values. This change will not break existing pipelines installing/upgrading Linkerd using Helm. When someone specifies `identity.issuer.crtExpiry` (`--set identity.issuer.crtExpiry=$(date -v+8760H +"%Y-%m-%dT%H:%M:%SZ"`) it will be "just" ignored. Fixes #7024 Signed-off-by: Krzysztof Dryś <krzysztofdrys@gmail.com>	2021-10-08 11:32:05 -05:00
Krzysztof Dryś	d2e441dfe1	feat(linkerd): adding priority to linkerd (priorityclass) (#7005 ) New field: priorityClassName is added to Helm and cli command. Closes #6858 Signed-off-by: Krzysztof Dryś <krzysztofdrys@gmail.com>	2021-10-08 09:04:07 -05:00
Alejandro Pedraza	90f8c9ddf5	Remove `omitWebhookSideEffects` flag/setting (#6942 ) * Remove `omitWebhookSideEffects` flag/setting This was introduced back in #2963 to support k8s with versions before 1.12 that didn't support the `sideEffects` property in webhooks. It's been a while we no longer support 1.12, so we can safely drop this.	2021-09-22 17:03:26 -05:00
Alex Leong	8f15683177	Fix up docker build scripts (#6781 ) A few small improvements to our docker build scripts: * Centralized the list of docker images to a DOCKER_IMAGES variable defined in _docker.sh * Build scripts now honor the TAG variable, if defined * Unused docker-images script has been removed We also update the `--control-plane-version` Linkerd install flag to affect the policy controller version as well. Taken together, this enables the following workflow for building and deploying changes to individual Linkerd components. For example, suppose you wish to deploy changes which only affect the controller image: ```console # Begin by building all images at main with a dev tag > TAG=alex-dev bin/docker-build # OR begin by retagging all images from a recent release > bin/docker-retag-all edge-21.8.4 alex-dev # Make changes and then rebuild specific component > TAG=alex-dev bin/docker-build-controller # Load images into kind > TAG=alex-dev bin/image-load --kind --cluster alex # Install Linkerd > bin/linkerd install --control-plane-version alex-dev --proxy-version alex-dev \| k apply -f - ``` Signed-off-by: Alex Leong <alex@buoyant.io>	2021-09-01 09:37:56 -07:00
Alex Leong	2851254966	Add admission controller to policy controller (#6696 ) We add a validating admission controller to the policy controller which validates `Server` resources. When a `Server` admission request is received, we look at all existing `Server` resources in the cluster and ensure that no other `Server` has an identical selector and port. Signed-off-by: Alex Leong <alex@buoyant.io> Co-authored-by: Oliver Gould <ver@buoyant.io>	2021-08-27 11:26:23 -07:00
Oliver Gould	b98c86700f	Import the linkerd-policy-controller (#6485 ) We've implemented a new controller--in Rust!--that implements discovery APIs for inbound server policies. This change imports this code from linkerd/polixy@25af9b5e. This policy controller watches nodes, pods, and the recently-introduced `policy.linkerd.io` CRD resources. It indexes these resources and serves a gRPC API that will be used by proxies to configure the inbound proxy for policy enforcement. This change introduces a new policy-controller container image and adds a container to the `Linkerd-destination` pod along with a `linkerd-policy` service to be used by proxies. This change adds a `policyController` object to the Helm `values.yaml` that supports configuring the policy controller at runtime. Proxies are not currently configured to use the policy controller at runtime. This will change in an upcoming proxy release.	2021-08-11 12:56:12 -07:00
Alejandro Pedraza	61f443ad05	Schedule heartbeat 10 mins after install (#5973 ) * Schedule heartbeat 10 mins after install ... for the Helm installation method, thus aligning it with the CLI installation method, to reduce the midnight peak on the receiving end. The logic added into the chart is now reused by the CLI as well. Also, set `concurrencyPolicy=Replace` so that when a job fails and it's retried, the retries get canceled when the next scheduled job is triggered. Finally, the go client only failed when the connection failed; successful connections with a non 200 response status were considered successful and thus the job wasn't retried. Fixed that as well.	2021-03-31 07:49:36 -05:00
Tarun Pothulapati	5c1a375a51	destination: pass opaque-ports through cmd flag (#5829 ) * destination: pass opaque-ports through cmd flag Fixes #5817 Currently, Default opaque ports are stored at two places i.e `Values.yaml` and also at `opaqueports/defaults.go`. As these ports are used only in destination, We can instead pass these values as a cmd flag for destination component from Values.yaml and remove defaultPorts in `defaults.go`. This means that users if they override `Values.yaml`'s opauePorts field, That change is propogated both for injection and also discovery like expected. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-03-01 16:00:20 +05:30
Alejandro Pedraza	b53dc3b400	Removed "do-not-edit" entries from values.yaml files (#5758 ) Fixes #5574 and supersedes #5660 - Removed from all the `values.yaml` files all those "do not edit" entries for annotation/label names, hard-coding them in the templates instead. - The `values.go` files got simplified as a result. - The `created-by` annotation was also refactored into a reusable partial. This means we had to add a `partials` dependency to multicluster.	2021-02-19 09:17:45 -05:00
Tarun Pothulapati	a393c42536	values: removal of .global field (#5699 ) * values: removal of .global field Fixes #5425 With the new extension model, We no longer need `Global` field as we don't rely on chart dependencies anymore. This helps us further cleanup Values, and make configuration more simpler. To make upgrades and the usage of new CLI with older config work, We add a new method called `config.RemoveGlobalFieldIfPresent` that is used in the upgrade and `FetchCurrentConfiguration` paths to remove global field and attach its child nodes if global is present. This is verified by the `TestFetchCurrentConfiguration`'s older test that has the global field. We also don't yet remove .global in some helm stable-upgrade tests for the initial install to work. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-02-11 23:38:34 +05:30
Tarun Pothulapati	d0d2e0ea7a	cli: add helm customization flags to core install (#5507 ) * cli: add helm customization flags to core install Fixes #5506 This branch adds helm way of customization through `set`, `set-string`, `values`, `set-files` flags for `linkerd install` cmd along with unit tests. For this to work, the helm v3 engine rendering helpers had to be used instead of our own wrapper type. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-01-21 22:49:50 +05:30
Eugene Formanenko	535a36af7c	Add log-format flag to control plane components (#5537 ) Fixes #5536 Signed-off-by: Eugene Formanenko <mo4islona@gmail.com>	2021-01-15 10:51:32 -05:00
Tarun Pothulapati	836c077898	viz: add render golden tests (#5433 ) * viz: add render golden tests This branch adds golden tests for the viz install. This would be useful to track changes in render as more changes are added. This also moves the common code that is used across extensions to generate diffs into `testutil` to be able to be used widely. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-01-12 11:59:16 +05:30
Tarun Pothulapati	2087c95dd8	viz: move some components into linkerd-viz (#5340 ) * viz: move some components into linkerd-viz This branch moves the grafana,prometheus,web, tap components into a new viz chart, following the same extension model that multi-cluster and jaeger follow. The components in viz are not injected during install time, and will go through the injector. The `viz install` does not have any cli flags to customize the install directly but instead follow the Helm way of customization by using flags such as `set`, `set-string`, `values`, `set-files`. Changes Include - Move `grafana`, `prometheus`, `web`, `tap` templates into viz extension. - Remove all add-on related charts, logic and tests w.r.t CLI & Helm. - Clean up `linkerd2/values.go` & `linkerd2/values.yaml` to not contain fields related to viz components. - Update `linkerd check` Healthchecks to not check for viz components. - Create a new top level `viz` directory with CLI logic and Helm charts. - Clean fields in the `viz/Values.yaml` to be in the `<component>.<property>` model. Ex: `prometheus.resources`, `dashboard.image.tag`, etc so that it is consistent everywhere. Testing ```bash # Install the Core Linkerd Installation ./bin/linkerd install \| k apply -f - # Wait for the proxy-injector to be ready # Install the Viz Extension ./bin/linkerd cli viz install \| k apply -f - # Customized Install ./bin/linkerd cli viz install --set prometheus.enabled=false \| k apply -f - ``` What is not included in this PR: - Move of Controller from core install into the viz extension. - Simplification and refactoring of the core chart i.e removing `.global`, etc. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-12-23 20:17:31 +05:30
Alejandro Pedraza	d661054795	Fix CLI install/upgrade overriding settings in HA (#5399 ) Fixes #5385 ## The problems - `linkerd install --ha` isn't honoring flags - `linkerd upgrade --ha` is overridding existing configs silently or failing with an error - Upgrading HA instances from before 2.9 to version 2.9.1 results in configs being overridden silently, or the upgrade fails with an error ## The cause The change in #5358 attempted to fix `linkerd install --ha` that was only applying some of the `values-ha.yaml` defaults, by calling `charts.NewValues(true)` and merging that with the values built from `values.yaml` overriden by the flags. It turns out the `charts.NewValues()` implementation was by itself merging against `values.yaml` and as a result any flag was getting overridden by its default. This also happened when doing `linkerd upgrade --ha` on an existing instance, which could result in silently overriding settings, or it could also fail loudly like for example when upgrading set up that has an external issuer (in this case the issuer cert won't be able to be read during upgrade and an error would occur as described in #5385). Finally, when doing `linkerd upgrade` (no --ha flag) on an HA install from before 2.9 results in configs getting overridden as well (silently or with an error) because in order to generate the `linkerd-config-overrides` secret, the original install flags are retrieved from `linkerd-config` via the `loadStoredValuesLegacy()` function which then effectively ends up performing a `linkerd upgrade` with all the flags used for `linkerd install` and falls into the same trap as above. ## The fix In `values.go` the faulting merging logic is not used anymore, so now `NewValues()` only returns the default values from `values.yaml` and doesn't require an argument anymore. It calls `readDefaults()` which now only returns the appropriate values depending on whether we're on HA or not. There's a new function `MergeHAValues()` that merges `values-ha.yaml` into the current values (it doesn't look into `values.yaml` anymore), which is only used when processing the `--ha` flag in `options.go`. ## How to test To replicate the issue try setting a custom setting and check it's not applied: ```bash linkerd install --ha --controller-log level debug \| grep log.level - -log-level=info ``` ## Followup This wasn't caught because we don't have HA integration tests. Now that our test infra is based on k3d, it should be easy to make such a test using a cluster with multiple nodes. Either that or issuing `linkerd install --ha` with additional configs and compare against a golden file.	2020-12-18 12:11:52 -05:00
Alex Leong	cdc57d1af0	Use linkerd-jaeger extension for control plane tracing (#5299 ) Now that tracing has been split out of the main control plane and into the linkerd-jaeger extension, we remove references to tracing from the main control plane including: * removing the tracing components from the main control plane chart * removing the tracing injection logic from the main proxy injector and inject CLI (these will be added back into the new injector in the linkerd-jaeger extension) * removing tracing related checks (these will be added back into `linkerd jaeger check`) * removing related tests We also update the `--control-plane-tracing` flag to configure the control plane components to send traces to the linkerd-jaeger extension. To make sure this works even when the linkerd-jaeger extension is installed in a non-default namespace, we also add a `--control-plane-tracing-namespace` flag which can be used to change the namespace that the control plane components send traces to. Note that for now, only the control plane components send traces; the proxies in the control plane do not. This is because the linkerd-jaeger injector is not yet available. However, this change adds the appropriate namespace annotations to the control plane namespace to configure the proxies to send traces to the linkerd-jaeger extension once the linkerd-jaeger injector is available. I tested this by doing the following: 1. bin/linkerd install \| kubectl apply -f - 1. bin/helm install jaeger jaeger/charts/jaeger 1. bin/linkerd upgrade --control-plane-tracing=true \| kubectl apply -f - 1. kubectl -n linkerd-jaeger port-forward svc/jaeger 16686 1. open http://localhost:16686 1. see traces from the linkerd control plane Signed-off-by: Alex Leong <alex@buoyant.io>	2020-12-08 14:34:26 -08:00
hodbn	92eb174e06	Add safe accessor for Global in linkerd-config (#5269 ) CLI crashes if linkerd-config contains unexpected values. Add a safe accessor that initializes an empty Global on the first access. Refactor all accesses to use the newly introduced accessor using gopls. Add test for linkerd-config data without Global. Fixes #5215 Co-authored-by: Itai Schwartz <yitai27@gmail.com> Signed-off-by: Hod Bin Noon <bin.noon.hod@gmail.com>	2020-11-23 12:45:58 -08:00
Tarun Pothulapati	b389054d53	cli: Don't check for SAN in root and intermediate certs (#5237 ) As discussed in #5228, it is not correct for root and intermediate certs to have SAN. This PR updates the check to not verify the intermediate issuer cert with the identity dns name (which checks with SAN and not CN as the the `verify` func is used to verify leaf certs and not root and intermediate certs). This PR also avoids setting a SAN field when generating certs in the `install` command. Fixes #5228	2020-11-18 15:30:39 -08:00
Tarun Pothulapati	262d5e041c	charts: Do not store .component in linkerd-config (#5144 ) * charts: Do not store .component in linkerd-config This removes the `.component` fields from `Values.go` and also prevents them from being emitted into `linkerd-config` by attaching them into a temporary variable during injection. This also simplies inbound and outbound Skip ports helm logic and adds quotes to them. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-11-02 20:41:37 +05:30
Alejandro Pedraza	177669b377	Remove code refs to controllerImageVersion (#5119 ) Followup to #5100 We had both `controllerImageVersion` and `global.controllerImageVersion` configs, but only the latter was taken into account in the chart templates, so this change removes all of its references.	2020-10-21 13:40:25 -05:00
Oliver Gould	84b1a826bd	Replace global.proxy.destinationGetNetworks with global.clusterNetworks (#5110 ) There is no longer a proxy config `DESTINATION_GET_NETWORKS`. Instead of reflecting this implementation in our values.yaml, this changes this variable to the more general `clusterNetworks` to emphasize its similarity to `clusterDomain` for the purposes of discovery.	2020-10-20 19:05:31 -07:00
Oliver Gould	f0820bdfbf	inject: Use 'quote' function in proxy template (#5107 ) As described in #5105, it's not currently possible to set the proxy log level to `off`. The proxy injector's template does not quote the log level value, and so the `off` value is handled as `false`. Thanks, YAML. This change updates the proxy template to use helm's `quote` function throughout, replacing manually quoted values and fixing the quoting for the log level value. We also remove the default logFormat value, as the default is specified in values.yaml.	2020-10-20 15:36:10 -07:00
Oliver Gould	c5d3b281be	Add 100.64.0.0/10 to the set of discoverable networks (#5099 ) It appears that Amazon can use the `100.64.0.0/10` network, which is technically private, for a cluster's Pod network. Wikipedia describes the network as: > Shared address space for communications between a service provider > and its subscribers when using a carrier-grade NAT. In order to avoid requiring additional configuration on EKS clusters, we should permit discovery for this network by default.	2020-10-19 12:59:44 -07:00
Oliver Gould	222c11400b	tests: Set proxy log to linkerd=debug (#5081 ) The proxy log level `linkerd2_proxy=debug` only enables logging from a few proxy modules. We should instead use the more general `linkerd=debug`.	2020-10-14 15:31:03 -07:00
Alex Leong	41c1fc65b0	Upgrade using config overrides (#5005 ) This is a major refactor of the install/upgrade code which removes the config protobuf and replaces it with a config overrides secret which stores overrides to the values struct. Further background on this change can be found here: https://github.com/linkerd/linkerd2/discussions/4966 Note: as-is this PR breaks injection. There is work to move injection onto a Values-based config which must land before this can be merged. A summary of the high level changes: * the install, global, and proxy fields of linkerd-config ConfigMap are no longer populated * the CLI install flow now follows these simple steps: * load default Values from the chart * update the Values based on the provided CLI flags * render the chart with these values * also render a Secret/linkerd-config-overrides which describes the values which have been changed from their defaults * the CLI upgrade flow now follows these simple stesp: * load the default Values from the chart * if Secret/linkerd-config-overrides exists, apply the overrides onto the values * otherwise load the legacy ConfigMap/linkerd-config and use it to updates the values * further update the values based on the provided CLI flags * render the chart and the Secret/linkerd-config-overrides as above * Helm install and upgrade is unchanged Signed-off-by: Alex Leong <alex@buoyant.io>	2020-10-12 14:23:14 -07:00
Alex Leong	530d8beccc	Add podLabels and podAnnotations to Values struct (#5056 ) PR https://github.com/linkerd/linkerd2/pull/5027 added `podLabels` and `podAnnotations` to `values.yaml` to allow setting labels and annotations on pods in the Helm template. However, these fields were not added to the `Values` struct in `Values.go`. This means that these fields were not serialized out to the `linkerd-config` or to the `linkerd-config-overrides`. Furthermore, in PR #5005 which moves to using the `Values` struct more authoritatively, the `podLabels` and `podAnnotations` fields would not take effect at all. Add these fields to the `Values` struct and update all test fixtures accordingly. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-10-09 09:27:28 -07:00

1 2 3

148 Commits