linkerd2

Commit Graph

Author	SHA1	Message	Date
Tarun Pothulapati	faf77798f0	Update check to use new linkerd-config.values (#5023 ) This branch updates the check functionality to read the new `linkerd-config.values` which contains the full Values struct showing the current state of the Linkerd installation. (being added in #5020 ) This is done by adding a new `FetchCurrentConfiguraiton` which first tries to get the latest, if not falls back to the older `linkerd-config` protobuf format.` Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-10-01 11:19:25 -07:00
Alex Leong	6452fbbdfa	Add values to linkerd-config (#5020 ) Fixes #5008 We add a `values` file to the `ConfigMap/linkerd-config` resource. This file holds the full Values which were used to render the chart except that private data such as the identity issuer key are redacted. This file is currently unused but will eventually be used by CLI commands such as `check` and `inject` which need to load Linkerd's configuration (as described in #5009). This is one step in a larger effort to eventually get rid of the other files in `ConfigMap/linkerd-config`. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-09-30 11:37:25 -07:00
Alex Leong	788479b7b0	Fix upgrade test (#5021 ) A conflict between #4911 and #4737 caused unit test to be broken. #4737 added a new test to `upgrade_test.go` and the changes in #4911 updated all of these test to ignore differences in the config overrides secret. Since these two PRs merged in parallel, the new test was missing this update. Update the new test to also ignore differences in the config overrides secret as the other ones do. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-09-29 12:41:42 -07:00
Alex Leong	1784f0643e	Add linkerd-config-overrides secret (#4911 ) This PR adds a new secret to the output of `linkerd install` called `linkerd-config-overrides`. This is the first step towards simplifying the configuration of the linkerd install and upgrade flow through the CLI. This secret contains the subset of the values.yaml which have been overridden. In other words, the subset of values which differ from their default values. The idea is that this will give us a simpler way to produce the `linkerd upgrade` output while still persisting options set during install. This will eventually replace the `linkerd-config` configmap entirely. This PR only adds and populates the new secret. The secret is not yet read or used anywhere. Subsequent PRs will update individual control plane components to accept their configuration through flags and will update the `linkerd upgrade` flow to use this secret instead of the `linkerd-config` configmap. This secret is only generated by the CLI and is not present or required when installing or upgrading with Helm. Here are sample contents of the secret, base64 decoded. Note that identity tls context is saved as an override so that it can be persisted across updates. Since these fields contain private key material, this object must be a secret. This secret is only used for upgrades and thus only the CLI needs to be able to read it. We will not create any RBAC bindings to grant service accounts access to this secret. ``` global: identityTrustAnchorsPEM: \| -----BEGIN CERTIFICATE----- MIIBhDCCASmgAwIBAgIBATAKBggqhkjOPQQDAjApMScwJQYDVQQDEx5pZGVudGl0 eS5saW5rZXJkLmNsdXN0ZXIubG9jYWwwHhcNMjAwODI1MjMzMTU3WhcNMjEwODI1 MjMzMjE3WjApMScwJQYDVQQDEx5pZGVudGl0eS5saW5rZXJkLmNsdXN0ZXIubG9j YWwwWTATBgcqhkjOPQIBBggqhkjOPQMBBwNCAAQ0e7IPBlVZ03TL8UVlODllbh8b 2pcM5mbtSGgpX9z0l3n5M70oHn715xu2szh63oBjPl2ZfOA5Bd43cJIksONQo0Iw QDAOBgNVHQ8BAf8EBAMCAQYwHQYDVR0lBBYwFAYIKwYBBQUHAwEGCCsGAQUFBwMC MA8GA1UdEwEB/wQFMAMBAf8wCgYIKoZIzj0EAwIDSQAwRgIhAI7Sy8P+3TYCJBlK pIJSZD4lGTUyXPD4Chl/FwWdFfvyAiEA6AgCPbNCx1dOZ8RpjsN2icMRA8vwPtTx oSfEG/rBb68= -----END CERTIFICATE----- heartbeatSchedule: '42 23 * * * ' identity: issuer: crtExpiry: "2021-08-25T23:32:17Z" tls: crtPEM: \| -----BEGIN CERTIFICATE----- MIIBhDCCASmgAwIBAgIBATAKBggqhkjOPQQDAjApMScwJQYDVQQDEx5pZGVudGl0 eS5saW5rZXJkLmNsdXN0ZXIubG9jYWwwHhcNMjAwODI1MjMzMTU3WhcNMjEwODI1 MjMzMjE3WjApMScwJQYDVQQDEx5pZGVudGl0eS5saW5rZXJkLmNsdXN0ZXIubG9j YWwwWTATBgcqhkjOPQIBBggqhkjOPQMBBwNCAAQ0e7IPBlVZ03TL8UVlODllbh8b 2pcM5mbtSGgpX9z0l3n5M70oHn715xu2szh63oBjPl2ZfOA5Bd43cJIksONQo0Iw QDAOBgNVHQ8BAf8EBAMCAQYwHQYDVR0lBBYwFAYIKwYBBQUHAwEGCCsGAQUFBwMC MA8GA1UdEwEB/wQFMAMBAf8wCgYIKoZIzj0EAwIDSQAwRgIhAI7Sy8P+3TYCJBlK pIJSZD4lGTUyXPD4Chl/FwWdFfvyAiEA6AgCPbNCx1dOZ8RpjsN2icMRA8vwPtTx oSfEG/rBb68= -----END CERTIFICATE----- keyPEM: \| -----BEGIN EC PRIVATE KEY----- MHcCAQEEIJaqjoDnqkKSsTqJMGeo3/1VMfJTBsMEuMWYzdJVxIhToAoGCCqGSM49 AwEHoUQDQgAENHuyDwZVWdN0y/FFZTg5ZW4fG9qXDOZm7UhoKV/c9Jd5+TO9KB5+ 9ecbtrM4et6AYz5dmXzgOQXeN3CSJLDjUA== -----END EC PRIVATE KEY----- ``` Signed-off-by: Alex Leong <alex@buoyant.io>	2020-09-29 08:01:36 -07:00
Lutz Behnke	de098cd52d	make api service secrets compatible to cert manager (#4737 ) Currently the secrets for the proxy-injector, sp-validator webhooks and tap API service are using the Opaque secret type and linkerd-specific field names. This makes it impossible to use cert-manager (https://github.com/jetstack/cert-manager) to provisions and rotate the secrets for these services. This change converts the secrets defined in the linkerd2 helm charts and the controller use the kubernetes.io/tls format instead. This format is used for secrets containing the generated secrets by cert-manager. Signed-off-by: Lutz Behnke <lutz.behnke@finleap.com>	2020-09-29 09:17:09 -05:00
Tarun Pothulapati	d0caaa86c4	Bump k8s client-go to v0.19.2 (#5002 ) Fixes #4191 #4993 This bumps Kubernetes client-go to the latest v0.19.2 (We had to switch directly to 1.19 because of this issue). Bumping to v0.19.2 required upgrading to smi-sdk-go v0.4.1. This also depends on linkerd/stern#5 This consists of the following changes: - Fix ./bin/update-codegen.sh by adding the template path to the gen commands, as it is needed after we moved to GOMOD. - Bump all k8s related dependencies to v0.19.2 - Generate CRD types, client code using the latest k8s.io/code-generator - Use context.Context as the first argument, in all code paths that touch the k8s client-go interface Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-09-28 12:45:18 -05:00
Kevin Leimkuhler	2ec5245d67	Add configuration for opaque ports (#4972 ) ## Motivation Closes #4950 ## Solution Add the `config.linkerd.io/opaque-ports` annotation to either a namespace or pod spec to set the proxy `LINKERD2_PROXY_INBOUND_PORTS_DISABLE_PROTOCOL_DETECTION` environment variable. Currently this environment variable is not used by the proxy, but will be addressed by #4938. ## Valid values Ports: `config.linkerd.io/opaque-ports: 4322,3306` Port ranges: `config.linkerd.io/opaque-ports: 4320-4325` Mixed ports and port ranges: `config.linkerd.io/opaque-ports: 4320-4325` If the pod has named ports such as: ``` - name: nginx image: nginx:latest ports: - name: nginx-port containerPort: 80 protocol: TCP ``` The name can also be used as a value: `config.linkerd.io/opaque-ports: nginx-port` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-09-25 15:36:12 -04:00
Nil	69ca673682	Introduce support for authenticated docker registries using imagePullSecrets, Fixes #4413 (#4898 ) * Introduce support for authenticated docker registries using imagePullSecrets Problem: Private Docker Registries are not supported for the moment as detailed in issue #4413 Solution: Every Service Account of linkerd subcomponents are Attached with imagePullSecrets, which in turn can then pulls the docker images from authenticated private registries using them. The imagePullSecret is configured in global.imagePullSecret parameter of values.yaml like imagePullSecret: - name: <name-of-private-registry-secret-resource> Fixes #4413 Signed-off-by: Nilakhya <nilakhya@hotmail.com>	2020-09-23 08:49:35 -05:00
Tarun Pothulapati	c328de902b	CNI: Use skip ports configuration in CNI (#4974 ) * CNI: Use skip ports configuration in CNI This PR updates the install and `cmdAdd` workflow (which is called for each new Pod creation) to retrieve and set the configured Skip Ports. This also updates the `cmdAdd` workflow to check if the new pod is a control plane Pod, and adds `443` to OutBoundSkipPort so that 443 (used with k8s API) is skipped as it was causing errors because a resolve lookup was happening for them which is not intended.	2020-09-23 13:00:22 +05:30
OlivierB	f599bf9b10	Helm chart - linkerd2-collector : enable jaeger receiver (#4783 ) Fixes #4778 Signed-off-by: Olivier Boudet <o.boudet@gmail.com>	2020-09-21 12:17:04 -07:00
Tarun Pothulapati	5998728158	Add `dest-cni-bin-dir` flag in install-cni (#4968 ) Currently, This field has to be configured to make CNI work in GKE clusters as thats where the binaries have to be stored. This was configurable through Helm, but the same can be allowed through the CLI too Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-09-15 17:13:12 -05:00
Tarun Pothulapati	f75b9fe374	tracing: Move default values into addon-chart (#4951 ) * tracing: Move default values into chart This branch updates the tracing add-on's values into their own chart's values.yaml (just like grafana and prometheus). This prevents them from being saved into `linkerd-config-addons` where only the overridden values are stored. Thus allowing us to change the defaults. This also - Updates the check command to fall back to default values, if there are no overridden name fields. - Updates jaeger to `1.19.2` Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-09-15 15:19:25 -05:00
Alejandro Pedraza	ccf027c051	Push docker images to ghcr.io instead of gcr.io (#4953 ) * Push docker images to ghcr.io instead of gcr.io The `cloud_integration.yml` and `release.yml` workflows were modified to log into ghcr.io, and remove the `Configure gcloud` step which is no longer necessary. Note that besides the changes to cloud_integration.yml and release.yml, there was a change to the upgrade-stable integration test so that we do linkerd upgrade --addon-overwrite to reset the addons settings because in stable-2.8.1 the Grafana image was pegged to gcr.io/linkerd-io/grafana in linkerd-config-addons. This will need to be mentioned in the 2.9 upgrade notes. Also the egress integration test has a debug container that now is pegged to the edge-20.9.2 tag. Besides that, the other changes are just a global search and replace (s/gcr.io\/linkerd-io/ghcr.io\/linkerd/).	2020-09-10 15:16:24 -05:00
Oliver Gould	7ee638bb0c	inject: Configure the proxy to discover profiles for unnamed services (#4960 ) The proxy performs endpoint discovery for unnamed services, but not service profiles. The destination controller and proxy have been updated to support lookups for unnamed services in linkerd/linkerd2#4727 and linkerd/linkerd2-proxy#626, respectively. This change modifies the injection template so that the `proxy.destinationGetNetworks` configuration enables profile discovery for all networks on which endpoint discovery is permitted.	2020-09-10 12:44:00 -07:00
Zahari Dichev	084bb678c7	Perform TLS checks on injector, sp validator and tap (#4924 ) * Check sp-validator,proxy-injector and tap certs Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-09-10 11:21:23 -05:00
Tarun Pothulapati	c4f8ba270d	Generate Identity certs with alternate domain names (#4920 ) Updating only the go 1.15 version, makes the upgrades fail from older versions, as the identity certs do not have that setting and go 1.15 expects them. This PR upgrades the cert generation code to have that field, allowing us to move to go 1.15 in later versions of Linkerd.	2020-09-03 22:33:10 +05:30
Zahari Dichev	77c88419b8	Make destination and identity services headless (#4923 ) * Make destination and identity svcs headless Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-09-02 14:53:38 -05:00
Zhou Hao	55689044cb	add os.RemoveAll err verification (#4885 ) Signed-off-by: Zhou Hao <zhouhao@cn.fujitsu.com>	2020-08-31 13:58:13 -07:00
Ali Ariff	5186383c81	Add ARM64 Integration Test (#4897 ) * Add ARM64 Integration Test Signed-off-by: Ali Ariff <ali.ariff12@gmail.com>	2020-08-28 10:38:40 -07:00
Tarun Pothulapati	c9c5d97405	Remove SMI-Metrics charts and commands (#4843 ) Fixes #4790 This PR removes both the SMI-Metrics templates along with the experimental sub-commands. This also removes pkg `smi-metrics` as there is no direct use of it without the commands. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-08-24 14:35:33 -07:00
Matei David	7ed904f31d	Enable endpoint slices when upgrading through CLI (#4864 ) ## What/How @adleong pointed out in #4780 that when enabling slices during an upgrade, the new value does not persist in the `linkerd-config` ConfigMap. I took a closer look and it seems that we were never overwriting the values in case they were different. * To fix this, I added an if block when validating and building the upgrade options -- if the current flag value differs from what we have in the ConfigMap, then change the ConfigMap value. * When doing so, I made sure to check that if the cluster does not support `EndpointSlices` yet the flag is set to true, we will error out. This is done similarly (copy&paste similarily) to what's in the install part. * Additionally, I have noticed that the helm ConfigMap template stored the flag value under `enableEndpointSlices` field name. I assume this was not changed in the initial PR to reflect the changes made in the protocol buffer. The API (and thus the CLI) uses the field name `endpointSliceEnabled` instead. I have changed the config template so that helm installations will use the same field, which can then be used in the destination service or other components that may implement slice support in the future. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-08-24 14:34:50 -07:00
Matei David	f797ab1e65	service topologies: topology-aware service routing (#4780 ) [Link to RFC](https://github.com/linkerd/rfc/pull/23) ### What --- * PR that puts together all past pieces of the puzzle to deliver topology-aware service routing, as specified in the [Kubernetes docs](https://kubernetes.io/docs/concepts/services-networking/service-topology/) but with a much better load balancing algorithm and all the coolness of linkerd :) * The first piece of this PR is focused on adding topology metadata: topology preference for services and topology `<k,v>` pairs for endpoints. * The second piece of this PR puts together the new context format and fetching the source node topology metadata in order to allow for endpoints filtering. * The final part is doing the filtering -- passing all of the metadata to the listener and on every `Add` filtering endpoints based on the topology preference of the service, topology `<k,v>` pairs of endpoints and topology of the source (again `<k,v>` pairs). ### How --- * Collecting metadata: - Services do not have values for topology keys -- the topological keys defined in a service's spec are only there to dictate locality preference for routing; as such, I decided to store them in an array, they will be taken exactly as they are found in the service spec, this ensures we respect the preference order. - For EndpointSlices, we are using a map -- an EndpointSlice has locality information in the form of `<k,v>` pair, where the key is a topological key (similar to what's listed in the service) and the value is the locality information -- e.g `hostname: minikube`. For each address we now have a map of topology values which gets populated when we translate the endpoints to an address set. Because normal Endpoints do not have any topology information, we create each address with an empty map which is subsequently populated ONLY for slices in the `endpointSliceToAddressSet` function. * Filtering endpoints: - This was a tricky part and filled me with doubts. I think there are a few ways to do this, but this is how I "envisioned" it. First, the `endpoint_translator.go` should be the one to do the filtering; this means that on subscription, we need to feed all of the relevant metadata to the listener. To do this, I created a new function `AddTopologyFilter` as part of the listener interface. - To complement the `AddTopologyFilter` function, I created a new `TopologyFilter` struct in `endpoints_watcher.go`. I then embedded this structure in all listeners that implement the interface. The structure holds the source topology (source node), a boolean to tell if slices are activated in case we need to double check (or write tests for the function) and the service preference. We create the filter on Subscription -- we have access to the k8s client here as well as the service, so it's the best point to collect all of this data together. Addresses all have their own topology added to them so they do not have to be collected by the filter. - When we add a new set of addresses, we check to see if slices are enabled -- chances are if slices are enabled, service topology might be too. This lets us skip this step if the latest version is not adopted. Prior to sending an `Add` we filter the endpoints -- if the preference is registered by the filter we strictly enforce it, otherwise nothing changes. And that's pretty much it. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-08-18 11:11:09 -07:00
Josh Soref	72aadb540f	Spelling (#4872 ) This PR corrects misspellings identified by the [check-spelling action](https://github.com/marketplace/actions/check-spelling). The misspellings have been reported at `aaf440489e (commitcomment-41423663)` The action reports that the changes in this PR would make it happy: `5b82c6c5ca` Note: this PR does not include the action. If you're interested in running a spell check on every PR and push, that can be offered separately. Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-08-12 21:59:50 -07:00
Alex Leong	c2d8c16509	Add multicluster uninstall (#4840 ) Fixes #4708 Adds a `linkerd multicluster uninstall` command which outputs the manifests required to uninstall the mutlicluster components. This command first checks that no links exist and advises that any links must be removed with `linkerd multicluster unlink` before proceeding. Typical usage is: ``` linkerd multicluster uninstall \| kubectl delete -f - ``` Signed-off-by: Alex Leong <alex@buoyant.io>	2020-08-11 17:04:04 -07:00
Alex Leong	ae55a1aded	Turn on multicluster checks when --multicluster flag is used (#4817 ) When the Link CRD does not exist, multicluster checks in `linkerd check` will be skipped. The `--multicluster` flag is intended to force these checks on, but was being ignored. We update the options to force the multicluster checks on when the `--multicluster` flag is used, as intended. Now when `linkerd check --multicluster` is run on a cluster without the multicluster support installed, it gives the following output: ``` linkerd-multicluster -------------------- × Link CRD exists multicluster.linkerd.io/Link CRD is missing: the server could not find the requested resource see https://linkerd.io/checks/#l5d-multicluster-link-crd-exists for hints Status check results are × ``` Signed-off-by: Alex Leong <alex@buoyant.io>	2020-08-11 17:02:52 -07:00
Alejandro Pedraza	4876a94ed0	Update proxy-init version to v1.3.6 (#4850 ) Supersedes #4846 Bump proxy-init to v1.3.6, containing CNI fixes and support for multi-arch builds. #4846 included this in v1.3.5 but proxy.golang.org refused to update the modified SHA	2020-08-11 11:54:00 -05:00
Alex Leong	f4000afaf3	Refactor upgrade tests to remove use of golden files (#4860 ) The upgrade tests were failing due to hardcoded certificates which had expired. Additionally, these tests contained large swaths of yaml that made it very difficult to understand the semantics of each test case and even more difficult to maintain. We greatly improve the readability and maintainability of these tests by using a slightly different approach. Each test follows this basic structure: * Render an install manifest * Initialize a fake k8s client with the install manifest (and sometimes additional manifests) * Render an upgrade manifest * Parse the manifests as yaml tree structures * Perform a structured diff on the yaml tree structured and look for expected and unexpected differences The install manifests are generated dynamically using the regular install flow. This means that we no longer need large sections of hardcoded yaml in the tests themselves. Additionally, we now asses the output by doing a structured diff against the install manifest. This means that we no longer need golden files with explicit expected output. All test cases were preserved except for the following: * Any test cases related to multiphase install (config/control plane) were not replicated. This flow doesn't follow the same pattern as the tests above because the install and upgrade manifests are not expected to be the same or similar. I also felt that these tests were lower priority because the multiphase install/upgrade feature does not seem to be very popular and is a potential candidate for deprecation. * Any tests involving upgrading from a very old config were not replicated. The code to generate these old style configs is no longer present in the codebase so in order to test this case, we would need to resort to hardcoded install manifests. These tests also seemed low priority to me because Linkerd versions that used the old config are now over 1 year old so it may no longer be critical that we support upgrading from them. We generally recommend that users upgrading from an old version of Linkerd do so by upgrading through each major version rather than directly to the latest. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-08-11 09:22:29 -07:00
Ali Ariff	ae8bb0e26e	Release ARM CLI artifacts (#4841 ) * When releasing, build and upload the amd64, arm64 and arm architectures builds for the CLI * Refactored `Dockerfile-bin` so it has separate stages for single and multi arch builds. The latter stage is only used for releases. Signed-off-by: Ali Ariff <ali.ariff12@gmail.com>	2020-08-11 09:25:58 -05:00
Tarun Pothulapati	7e5804d1cf	grafana: move default values into values file (#4755 ) This PR moves default values into add-on specific values.yaml thus allowing us to update default values as they would not be present in linkerd-config-addons cm. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-08-06 13:57:28 -07:00
cpretzer	d69974a70c	Change log output for failed namespace lookup (#4824 ) * Change log output for failed namespace lookup Signed-off-by: Charles Pretzer <charles@buoyant.io>	2020-08-05 21:03:54 -07:00
Paul Balogh	62d54838b8	Ensure and update debug image during upgrade (#4823 ) Some installations upgrading from versions prior to 2.7.x may have missing debug image name and version. This fix ensures that the default values are in place for this scenario and additionally upgrades the version of debug image with the control plane version. Signed-off-by: Paul Balogh <javaducky@gmail.com>	2020-08-05 11:39:29 -07:00
Ali Ariff	61d7dedd98	Build ARM docker images (#4794 ) Build ARM docker images in the release workflow. # Changes: - Add a new env key `DOCKER_MULTIARCH` and `DOCKER_PUSH`. When set, it will build multi-arch images and push them to the registry. See https://github.com/docker/buildx/issues/59 for why it must be pushed to the registry. - Usage of `crazy-max/ghaction-docker-buildx ` is necessary as it already configured with the ability to perform cross-compilation (using QEMU) so we can just use it, instead of manually set up it. - Usage of `buildx` now make default global arguments. (See: https://docs.docker.com/engine/reference/builder/#automatic-platform-args-in-the-global-scope) # Follow-up: - Releasing the CLI binary file in ARM architecture. The docker images resulting from these changes already build in the ARM arch. Still, we need to make another adjustment like how to retrieve those binaries and to name it correctly as part of Github Release artifacts. Signed-off-by: Ali Ariff <ali.ariff12@gmail.com>	2020-08-05 11:14:01 -07:00
Alex Leong	381f237f69	Add multicluster unlink command (#4802 ) Fixes #4707 In order to remove a multicluster link, we add a `linkerd multicluster unlink` command which produces the yaml necessary to delete all of the resources associated with a `linkerd multicluster link`. These are: * the link resource * the service mirror controller deployment * the service mirror controller's RBAC * the probe gateway mirror for this link * all mirror services for this link This command follows the same pattern as the `linkerd uninstall` command in that its output is expected to be piped to `kubectl delete`. The typical usage of this command is: ``` linkerd --context=source multicluster unlink --cluster-name=foo \| kubectl --context=source delete -f - ``` This change also fixes the shutdown lifecycle of the service mirror controller by properly having it listen for the shutdown signal and exit its main loop. A few alternative designs were considered: I investigated using owner references as suggested [here](https://github.com/linkerd/linkerd2/issues/4707#issuecomment-653494591) but it turns out that owner references must refer to resources in the same namespace (or to cluster scoped resources). This was not feasible here because a service mirror controller can create mirror services in many different namespaces. I also considered having the service mirror controller delete the mirror services that it created during its own shutdown. However, this could lead to scenarios where the controller is killed before it finishes deleting the services that it created. It seemed more reliable to have all the deletions happen from `kubectl delete`. Since this is the case, we avoid having the service mirror controller delete mirror services, even when the link is deleted, to avoid the race condition where the controller and CLI both attempt to delete the same mirror services and one of them fails with a potentially alarming error message. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-08-04 16:21:59 -07:00
Rajat Jindal	cbcc305b78	Fixes unit tests when default kubeconfig namespace is not "default" (#4825 ) Use defaultNamespace from kubeconfig context Fixes #4779 Signed-off-by: Rajat Jindal <rajatjindal83@gmail.com>	2020-08-03 11:27:02 -07:00
cpretzer	670caaf8ff	Update to proxy-init v1.3.4 (#4815 ) Signed-off-by: Charles Pretzer <charles@buoyant.io>	2020-07-30 15:58:58 -05:00
Alex Leong	a1543b33e3	Add support for service-mirror selectors (#4795 ) * Add selector support Signed-off-by: Alex Leong <alex@buoyant.io> * Removed unused labels Signed-off-by: Alex Leong <alex@buoyant.io>	2020-07-30 10:07:14 -07:00
Tarun Pothulapati	6307868f3d	bump prometheus to the latest v2.19.3 (#4811 ) * bump prometheus to the latest v2.19.3 latest prometheus version shows a lot of decrease in the memory usage and other benefits Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-07-30 12:06:59 -05:00
Alexander Berger	4ffea3ba08	CNI add support for priorityClassName (#4742 ) * CNI add support for priorityClassName As requested in #2981 one should be able to optionally define a priorityClassName for the linkerd2 pods. With this commit support for priorityClassName is added to the CNI plugin helm chart as well as to the cli command for installing the CNI plugin. Also added an `installNamespace` Helm option for the CNI installation. Implements part of #2981. Signed-off-by: alex.berger@nexiot.ch <alex.berger@nexiot.ch>	2020-07-30 10:43:06 -05:00
Naseem	96f662dfac	replace linkerd.io/helm-release-version annotation (#4645 ) Replace mechanism to automatically roll deployments when secret content changes. As per https://helm.sh/docs/howto/charts_tips_and_tricks/\#automatically-roll-deployments this is a recommended approach to dealing with config changes and rolling deployments upon them. Signed-off-by: Naseem <naseem@transit.app>	2020-07-28 21:48:17 -05:00
Tarun Pothulapati	c68ab23ab2	Add global.prometheusUrl field for byop use-case (#4390 ) This pr adds `globa.prometheusUrl` field which will be used to configure publlic-api, hearbeat, grafana, etc (i,e query path) to use a external Prometheus.	2020-07-28 12:26:34 +05:30
Matt Miller	fc33b9b9aa	support overriding inbound and outbound connect timeouts. (#4759 ) * support overriding inbound and outbound connect timeouts. * add validation on user provided TCP connect timeouts * convert valid time values into ms Signed-off-by: Matt Miller <mamiller@rosettastone.com>	2020-07-27 13:56:21 -07:00
memory	d2f547d812	Add sidecar container support for linkerd-prometheus helm chart (#4761 ) * Add sidecar container support for linkerd-prometheus Adds a new setting to the Prometheus' Helm config, allowing adding any kind of sidecar containers to the main container. The specific use case that inspired this was for exporting data from Prometheus to external systems (e.g. cloudwatch, stackdriver, datadog) using a process that watches the prometheus write-ahead log (WAL). Signed-off-by: Nathan J. Mehl <n@oden.io>	2020-07-27 14:26:37 -05:00
Matei David	1c197b14e7	Change destination context token format (#4771 ) Add a new structure on the destination controller side to keep track of contextual information. The token format has been changed from ns:<namespace> to a JSON format so that more variables can be encdoed in the token. As part of this PR, a new field 'nodeName' has been added to help with service topologies. Fixes #4498 Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-27 09:49:48 -07:00
Alex Leong	d540e16c8b	Make service mirror controller per target cluster (#4710 ) This PR removes the service mirror controller from `linkerd mc install` to `linkerd mc link`, as described in https://github.com/linkerd/rfc/pull/31. For fuller context, please see that RFC. Basic multicluster functionality works here including: * `linkerd mc install` installs the Link CRD but not any service mirror controllers * `linkerd mc link` creates a Link resource and installs a service mirror controller which uses that Link * The service mirror controller creates and manages mirror services, a gateway mirror, and their endpoints. * The `linkerd mc gateways` command lists all linked target clusters, their liveliness, and probe latences. * The `linkerd check` multicluster checks have been updated for the new architecture. Several checks have been rendered obsolete by the new architecture and have been removed. The following are known issues requiring further work: * the service mirror controller uses the existing `mirror.linkerd.io/gateway-name` and `mirror.linkerd.io/gateway-ns` annotations to select which services to mirror. it does not yet support configuring a label selector. * an unlink command is needed for removing multicluster links: see https://github.com/linkerd/linkerd2/issues/4707 * an mc uninstall command is needed for uninstalling the multicluster addon: see https://github.com/linkerd/linkerd2/issues/4708 Signed-off-by: Alex Leong <alex@buoyant.io>	2020-07-23 14:32:50 -07:00
Alejandro Pedraza	5e789ba152	Migrate CI to docker buildx and other improvements (#4765 ) * Migrate CI to docker buildx and other improvements ## Motivation - Improve build times in forks. Specially when rerunning builds because of some flaky test. - Start using `docker buildx` to pave the way for multiplatform builds. ## Performance improvements These timings were taken for the `kind_integration.yml` workflow when we merged and rerun the lodash bump PR (#4762) Before these improvements: - when merging: `24:18` - when rerunning after merge (docker cache warm): `19:00` - when running the same changes in a fork (no docker cache): `32:15` After these improvements: - when merging: `25:38` - when rerunning after merge (docker cache warm): `19:25` - when running the same changes in a fork (docker cache warm): `19:25` As explained below, non-forks and forks now use the same cache, so the important take is that forks will always start with a warm cache and we'll no longer see long build times like the `32:15` above. The downside is a slight increase in the build times for non-forks (up to a little more than a minute, depending on the case). ## Build containers in parallel The `docker_build` job in the `kind_integration.yml`, `cloud_integration.yml` and `release.yml` workflows relied on running `bin/docker-build` which builds all the containers in sequence. Now each container is built in parallel using a matrix strategy. ## New caching strategy CI now uses `docker buildx` for building the container images, which allows using an external cache source for builds, a location in the filesystem in this case. That location gets cached using actions/cache, using the key `{{ runner.os }}-buildx-${{ matrix.target }}-${{ env.TAG }}` and the restore key `${{ runner.os }}-buildx-${{ matrix.target }}-`. For example when building the `web` container, its image and all the intermediary layers get cached under the key `Linux-buildx-web-git-abc0123`. When that has been cached in the `main` branch, that cache will be available to all the child branches, including forks. If a new branch in a fork asks for a key like `Linux-buildx-web-git-def456`, the key won't be found during the first CI run, but the system falls back to the key `Linux-buildx-web-git-abc0123` from `main` and so the build will start with a warm cache (more info about how keys are matched in the [actions/cache docs](https://docs.github.com/en/actions/configuring-and-managing-workflows/caching-dependencies-to-speed-up-workflows#matching-a-cache-key)). ## Packet host no longer needed To benefit from the warm caches both in non-forks and forks like just explained, we're required to ditch doing the builds in Packet and now everything runs in the github runners VMs. As a result there's no longer separate logic for non-forks and forks in the workflow files; `kind_integration.yml` was greatly simplified but `cloud_integration.yml` and `release.yml` got a little bigger in order to use the actions artifacts as a repository for the images built. This bloat will be fixed when support for [composite actions](https://github.com/actions/runner/blob/users/ethanchewy/compositeADR/docs/adrs/0549-composite-run-steps.md) lands in github. ## Local builds You still are able to run `bin/docker-build` or any of the `docker-build.*` scripts. And to make use of buildx, run those same scripts after having set the env var `DOCKER_BUILDKIT=1`. Using buildx supposes you have installed it, as instructed [here](https://github.com/docker/buildx). ## Other - A new script `bin/docker-cache-prune` is used to remove unused images from the cache. Without that the cache grows constantly and we can rapidly hit the 5GB limit (when the limit is attained the oldest entries get evicted). - The `go-deps` dockerfile base image was changed from `golang:1.14.2` (ubuntu based) to `golang-1:14.2-alpine` also to conserve cache space. # Addressed separately in #4875: Got rid of the `go-deps` image and instead added something similar on top of all the Dockerfiles dealing with `go`, as a first stage for those Dockerfiles. That continues to serve as a way to pre-populate go's build cache, which speeds up the builds in the subsequent stages. That build should in theory be rebuilt automatically only when `go.mod` or `go.sum` change, and now we don't require running `bin/update-go-deps-shas`. That script was removed along with all the logic elsewhere that used it, including the `go_dependencies` job in the `static_checks.yml` github workflow. The list of modules preinstalled was moved from `Dockerfile-go-deps` to a new script `bin/install-deps`. I couldn't find a way to generate that list dynamically, so whenever a slow-to-compile dependency is found, we have to make sure it's included in that list. Although this simplifies the dev workflow, note that the real motivation behind this was a limitation in buildx's `docker-container` driver that forbids us from depending on images that haven't been pushed to a registry, so we have to resort to building the dependencies as a first stage in the Dockerfiles.	2020-07-22 14:27:45 -05:00
Wei Lun	85a042c151	add fish shell completion (#4751 ) fixes #4208 Signed-off-by: Wei Lun <weilun_95@hotmail.com>	2020-07-20 15:46:30 -07:00
Matei David	8b85716eb8	Introduce install flag for EndpointSlices (#4740 ) EndpointSlices have been made opt-in due to their experimental nature. This PR introduces a new install flag 'enableEndpointSlices' that will allow adopters to specify in their cli install or helm install step whether they would like to use endpointslices as a resource in the destination service, instead of the endpoints k8s resource. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-15 09:53:04 -07:00
Tarun Pothulapati	2a099cb496	Move Prometheus as an Add-On (#4362 ) This moves Prometheus as a add-on, thus making it optional but enabled by default. The also make `linkerd-prometheus` more configurable, and allow it to have its own life-cycle for upgrades, configuration, etc. This work will be followed by documentation that help users configure existing Prometheus to work with Linkerd. Changes Include: - moving prometheus manifests into a separate chart at `charts/add-ons/prometheus`, and adding it as a dependency to `linkerd2` - implement the `addOn` interface to support the same with CLI. - include configuration in `linkerd-config-addons` User Facing Changes: The default install experience does not change much but for users who have already configured Prometheus differently, would need to apply the same using the new configuration fields present in chart README	2020-07-09 23:29:03 +05:30
cpretzer	d3553c59fd	Add volume and volumeMount for buster-based proxy-init (#4692 ) * Add volume and volumeMount for buster-based proxy-init Signed-off-by: Charles Pretzer <charles@buoyant.io>	2020-07-09 09:55:07 -07:00
Zahari Dichev	a2363f4051	Fix `splitStringListToPorts` port range object rendering (#4688 ) The splitStringListToPorts helm function is currently incorrectly formating a list of ports as an array of Port objects that look ike {"port" : 555}. The config map protobuf representation however expects that the ignoreOutboundPorts and ignoreInboundPorts fields are are list of PortRange objects ({"portRange" : 555}). This was causing the injector to return an empty string when trying to parse a PortRange object resulting in the ports not getting set correctly when injecting workloads. Note that this is happening only with helm installations as this is when we are actually using a helm template for outputting the config map. To fix that the splitStringListToPorts helm function is changed to format the objects as the json representation of PortRange and is renamed to splitStringListToPortRanges Fix: #4679 Signed-off-by: Zahari Dichev zaharidichev@gmail.com	2020-07-09 14:23:12 +03:00

1 2 3 4 5 ...

773 Commits