linkerd2

Commit Graph

Author	SHA1	Message	Date
Tarun Pothulapati	4c106e9c08	cli: make check return SkipError when there is no prometheus configured (#5150 ) Fixes #5143 The availability of prometheus is useful for some calls in public-api that the check uses. This change updates the ListPods in public-api to still return the pods even when prometheus is not configured. For a test that exclusively checks for prometheus metrics, we have a gate which checks if a prometheus is configured and skips it othervise. Signed-off-by: Tarun Pothulapati tarunpothulapati@outlook.com	2020-10-29 19:57:11 +05:30
Tarun Pothulapati	3a16baa141	Use errors.Is instead of checking underlying err messages (#5140 ) * Use errors.Is instead of checking underlying err messages Fixes #5132 This PR replaces the usage of `strings.hasSuffix` with `errors.Is` wherever error messages are being checked. So, that the code is not effected by changes in the underlying message. Also adds a string const for http2 response body closed error Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-10-28 21:33:17 +05:30
Tarun Pothulapati	39e7f84773	cli: fix and update timeout warnings in profile cmd (#5122 ) Fixes #5121 * cli: skip emitting warnings in Profile Whenever the tapDuration gets completed, there is a warning occured which we do not emit. This looks like it has been changed in the latest versions of the dependency. * Use context.withDeadline instead of client.timeout The usage of `client.Timeout` is not working correctly causing `W1022 17:20:12.372780 19049 transport.go:260] Unable to cancel request for promhttp.RoundTripperFunc` to be emitted by the Kubernetes Client. This is fixed by using context.WithDeadline and passing that into the http Request. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-10-27 22:08:21 +05:30
Alex Leong	b7c5bd07ae	Add 'linkerd.io/inject: ingress' mode (#5130 ) Fixes #5118 This PR adds a new supported value for the `linkerd.io/inject` annotation. In addition to `enabled` and `disabled`, this annotation may now be set to `ingress`. This functions identically to `enabled` but it also causes the `LINKERD2_PROXY_INGRESS_MODE="true"` environment variable to be set on the proxy. This causes the proxy to operate in ingress mode as described in #5118 With this set, ingresses are able to properly load service profiles based on the l5d-dst-override header. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-10-26 14:32:19 -07:00
Alejandro Pedraza	177669b377	Remove code refs to controllerImageVersion (#5119 ) Followup to #5100 We had both `controllerImageVersion` and `global.controllerImageVersion` configs, but only the latter was taken into account in the chart templates, so this change removes all of its references.	2020-10-21 13:40:25 -05:00
Oliver Gould	25e49433fd	Do not permit cluster networks to be overridden per-pod (#5111 ) In #5110 the `global.proxy.destinationGetNetworks` configuration is renamed to `global.clusterNetworks` to better reflect its purpose. The `config.linkerd.io/proxy-destination-get-networks` annotation allows this configuration to be overridden per-workload, but there's no real use case for this. I don't think we want to support this value differing between pods in a cluster. No good can come of it. This change removes support for the `proxy-destination-get-networks` annotation.	2020-10-21 09:34:13 -07:00
Oliver Gould	84b1a826bd	Replace global.proxy.destinationGetNetworks with global.clusterNetworks (#5110 ) There is no longer a proxy config `DESTINATION_GET_NETWORKS`. Instead of reflecting this implementation in our values.yaml, this changes this variable to the more general `clusterNetworks` to emphasize its similarity to `clusterDomain` for the purposes of discovery.	2020-10-20 19:05:31 -07:00
Oliver Gould	c5d3b281be	Add 100.64.0.0/10 to the set of discoverable networks (#5099 ) It appears that Amazon can use the `100.64.0.0/10` network, which is technically private, for a cluster's Pod network. Wikipedia describes the network as: > Shared address space for communications between a service provider > and its subscribers when using a carrier-grade NAT. In order to avoid requiring additional configuration on EKS clusters, we should permit discovery for this network by default.	2020-10-19 12:59:44 -07:00
Oliver Gould	4f16a234aa	Add a default set of ports to bypass the proxy (#5093 ) The proxy has a default, hardcoded set of ports on which it doesn't do protocol detection (25, 587, 3306 -- all of which are server-first protocols). In a recent change, this default set was removed from the outbound proxy, since there was no way to configure it to anything other than the default set. I had thought that there was a default set applied to proxy-init, but this appears to not be the case. This change adds these ports to the default Helm values to restore the prior behavior. I have also elected to include 443 in this set, as it is generally our recommendation to avoid proxying HTTPS traffic, since the proxy provides very little value on these connections today. Additionally, the memcached port 11211 is skipped by default, as clients do not issue any sort of preamble that is immediately detectable. These defaults may change in the future, but seem like good choices for the 2.9 release.	2020-10-16 11:53:41 -07:00
Alex Leong	9701f1944e	Stop rendering addon config (#5078 ) The linkerd-addon-config is no longer used and can be safely removed. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-10-16 11:07:51 -07:00
Oliver Gould	222c11400b	tests: Set proxy log to linkerd=debug (#5081 ) The proxy log level `linkerd2_proxy=debug` only enables logging from a few proxy modules. We should instead use the more general `linkerd=debug`.	2020-10-14 15:31:03 -07:00
Alex Leong	500c1cc2d7	Expose namespaceSelector for admission webhooks in helm chart (#5074 ) Closes (#5026) Signed-off-by: Alex Leong <alex@buoyant.io> Co-authored-by: Raphael Taylor-Davies <r.taylordavies@googlemail.com>	2020-10-13 16:08:56 -07:00
Tarun Pothulapati	2a5e7dba62	Handle grafana add-on config repair (#5059 ) * Handle grafana add-on config repair Fixes #5014 In Grafana Add-On, Default fields i.e `grafana.image.name`, `grafana.name` have been removed from `linkerd-config-addons` after `2.8.1`. Only overriden values are stored in `linkerd-config-addons` as of now. Hence, `grafana.image.name` has to be removed from `linkerd-config-addons` unless they are overriden so that updates to it can take place especially the move from `gcr` to `ghcr`. This also removes `grafana.name` field if they are set to default, as its removed. This problem will not occur again even if we update default values, as default values are not stored in `linekrd-config-addons` anymore for all add-ons. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-10-13 13:12:49 -07:00
Alex Leong	41c1fc65b0	Upgrade using config overrides (#5005 ) This is a major refactor of the install/upgrade code which removes the config protobuf and replaces it with a config overrides secret which stores overrides to the values struct. Further background on this change can be found here: https://github.com/linkerd/linkerd2/discussions/4966 Note: as-is this PR breaks injection. There is work to move injection onto a Values-based config which must land before this can be merged. A summary of the high level changes: * the install, global, and proxy fields of linkerd-config ConfigMap are no longer populated * the CLI install flow now follows these simple steps: * load default Values from the chart * update the Values based on the provided CLI flags * render the chart with these values * also render a Secret/linkerd-config-overrides which describes the values which have been changed from their defaults * the CLI upgrade flow now follows these simple stesp: * load the default Values from the chart * if Secret/linkerd-config-overrides exists, apply the overrides onto the values * otherwise load the legacy ConfigMap/linkerd-config and use it to updates the values * further update the values based on the provided CLI flags * render the chart and the Secret/linkerd-config-overrides as above * Helm install and upgrade is unchanged Signed-off-by: Alex Leong <alex@buoyant.io>	2020-10-12 14:23:14 -07:00
Alex Leong	530d8beccc	Add podLabels and podAnnotations to Values struct (#5056 ) PR https://github.com/linkerd/linkerd2/pull/5027 added `podLabels` and `podAnnotations` to `values.yaml` to allow setting labels and annotations on pods in the Helm template. However, these fields were not added to the `Values` struct in `Values.go`. This means that these fields were not serialized out to the `linkerd-config` or to the `linkerd-config-overrides`. Furthermore, in PR #5005 which moves to using the `Values` struct more authoritatively, the `podLabels` and `podAnnotations` fields would not take effect at all. Add these fields to the `Values` struct and update all test fixtures accordingly. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-10-09 09:27:28 -07:00
Tarun Pothulapati	1e7bb1217d	Update Injection to use new linkerd-config.values (#5036 ) This PR Updates the Injection Logic (both CLI and proxy-injector) to use `Values` struct instead of protobuf Config, part of our move in removing the protobuf. This does not touch any of the flags, install related code. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> Co-authored-by: Alex Leong <alex@buoyant.io>	2020-10-07 09:54:34 -07:00
Tarun Pothulapati	faf77798f0	Update check to use new linkerd-config.values (#5023 ) This branch updates the check functionality to read the new `linkerd-config.values` which contains the full Values struct showing the current state of the Linkerd installation. (being added in #5020 ) This is done by adding a new `FetchCurrentConfiguraiton` which first tries to get the latest, if not falls back to the older `linkerd-config` protobuf format.` Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-10-01 11:19:25 -07:00
Alex Leong	1784f0643e	Add linkerd-config-overrides secret (#4911 ) This PR adds a new secret to the output of `linkerd install` called `linkerd-config-overrides`. This is the first step towards simplifying the configuration of the linkerd install and upgrade flow through the CLI. This secret contains the subset of the values.yaml which have been overridden. In other words, the subset of values which differ from their default values. The idea is that this will give us a simpler way to produce the `linkerd upgrade` output while still persisting options set during install. This will eventually replace the `linkerd-config` configmap entirely. This PR only adds and populates the new secret. The secret is not yet read or used anywhere. Subsequent PRs will update individual control plane components to accept their configuration through flags and will update the `linkerd upgrade` flow to use this secret instead of the `linkerd-config` configmap. This secret is only generated by the CLI and is not present or required when installing or upgrading with Helm. Here are sample contents of the secret, base64 decoded. Note that identity tls context is saved as an override so that it can be persisted across updates. Since these fields contain private key material, this object must be a secret. This secret is only used for upgrades and thus only the CLI needs to be able to read it. We will not create any RBAC bindings to grant service accounts access to this secret. ``` global: identityTrustAnchorsPEM: \| -----BEGIN CERTIFICATE----- MIIBhDCCASmgAwIBAgIBATAKBggqhkjOPQQDAjApMScwJQYDVQQDEx5pZGVudGl0 eS5saW5rZXJkLmNsdXN0ZXIubG9jYWwwHhcNMjAwODI1MjMzMTU3WhcNMjEwODI1 MjMzMjE3WjApMScwJQYDVQQDEx5pZGVudGl0eS5saW5rZXJkLmNsdXN0ZXIubG9j YWwwWTATBgcqhkjOPQIBBggqhkjOPQMBBwNCAAQ0e7IPBlVZ03TL8UVlODllbh8b 2pcM5mbtSGgpX9z0l3n5M70oHn715xu2szh63oBjPl2ZfOA5Bd43cJIksONQo0Iw QDAOBgNVHQ8BAf8EBAMCAQYwHQYDVR0lBBYwFAYIKwYBBQUHAwEGCCsGAQUFBwMC MA8GA1UdEwEB/wQFMAMBAf8wCgYIKoZIzj0EAwIDSQAwRgIhAI7Sy8P+3TYCJBlK pIJSZD4lGTUyXPD4Chl/FwWdFfvyAiEA6AgCPbNCx1dOZ8RpjsN2icMRA8vwPtTx oSfEG/rBb68= -----END CERTIFICATE----- heartbeatSchedule: '42 23 * * * ' identity: issuer: crtExpiry: "2021-08-25T23:32:17Z" tls: crtPEM: \| -----BEGIN CERTIFICATE----- MIIBhDCCASmgAwIBAgIBATAKBggqhkjOPQQDAjApMScwJQYDVQQDEx5pZGVudGl0 eS5saW5rZXJkLmNsdXN0ZXIubG9jYWwwHhcNMjAwODI1MjMzMTU3WhcNMjEwODI1 MjMzMjE3WjApMScwJQYDVQQDEx5pZGVudGl0eS5saW5rZXJkLmNsdXN0ZXIubG9j YWwwWTATBgcqhkjOPQIBBggqhkjOPQMBBwNCAAQ0e7IPBlVZ03TL8UVlODllbh8b 2pcM5mbtSGgpX9z0l3n5M70oHn715xu2szh63oBjPl2ZfOA5Bd43cJIksONQo0Iw QDAOBgNVHQ8BAf8EBAMCAQYwHQYDVR0lBBYwFAYIKwYBBQUHAwEGCCsGAQUFBwMC MA8GA1UdEwEB/wQFMAMBAf8wCgYIKoZIzj0EAwIDSQAwRgIhAI7Sy8P+3TYCJBlK pIJSZD4lGTUyXPD4Chl/FwWdFfvyAiEA6AgCPbNCx1dOZ8RpjsN2icMRA8vwPtTx oSfEG/rBb68= -----END CERTIFICATE----- keyPEM: \| -----BEGIN EC PRIVATE KEY----- MHcCAQEEIJaqjoDnqkKSsTqJMGeo3/1VMfJTBsMEuMWYzdJVxIhToAoGCCqGSM49 AwEHoUQDQgAENHuyDwZVWdN0y/FFZTg5ZW4fG9qXDOZm7UhoKV/c9Jd5+TO9KB5+ 9ecbtrM4et6AYz5dmXzgOQXeN3CSJLDjUA== -----END EC PRIVATE KEY----- ``` Signed-off-by: Alex Leong <alex@buoyant.io>	2020-09-29 08:01:36 -07:00
Lutz Behnke	de098cd52d	make api service secrets compatible to cert manager (#4737 ) Currently the secrets for the proxy-injector, sp-validator webhooks and tap API service are using the Opaque secret type and linkerd-specific field names. This makes it impossible to use cert-manager (https://github.com/jetstack/cert-manager) to provisions and rotate the secrets for these services. This change converts the secrets defined in the linkerd2 helm charts and the controller use the kubernetes.io/tls format instead. This format is used for secrets containing the generated secrets by cert-manager. Signed-off-by: Lutz Behnke <lutz.behnke@finleap.com>	2020-09-29 09:17:09 -05:00
Tarun Pothulapati	d0caaa86c4	Bump k8s client-go to v0.19.2 (#5002 ) Fixes #4191 #4993 This bumps Kubernetes client-go to the latest v0.19.2 (We had to switch directly to 1.19 because of this issue). Bumping to v0.19.2 required upgrading to smi-sdk-go v0.4.1. This also depends on linkerd/stern#5 This consists of the following changes: - Fix ./bin/update-codegen.sh by adding the template path to the gen commands, as it is needed after we moved to GOMOD. - Bump all k8s related dependencies to v0.19.2 - Generate CRD types, client code using the latest k8s.io/code-generator - Use context.Context as the first argument, in all code paths that touch the k8s client-go interface Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-09-28 12:45:18 -05:00
Kevin Leimkuhler	2ec5245d67	Add configuration for opaque ports (#4972 ) ## Motivation Closes #4950 ## Solution Add the `config.linkerd.io/opaque-ports` annotation to either a namespace or pod spec to set the proxy `LINKERD2_PROXY_INBOUND_PORTS_DISABLE_PROTOCOL_DETECTION` environment variable. Currently this environment variable is not used by the proxy, but will be addressed by #4938. ## Valid values Ports: `config.linkerd.io/opaque-ports: 4322,3306` Port ranges: `config.linkerd.io/opaque-ports: 4320-4325` Mixed ports and port ranges: `config.linkerd.io/opaque-ports: 4320-4325` If the pod has named ports such as: ``` - name: nginx image: nginx:latest ports: - name: nginx-port containerPort: 80 protocol: TCP ``` The name can also be used as a value: `config.linkerd.io/opaque-ports: nginx-port` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-09-25 15:36:12 -04:00
Tarun Pothulapati	ecce5b91f6	tests: Add Calico CNI deep integration tests (#4952 ) * tests: Add new CNI deep integration tests Fixes #3944 This PR adds a new test, called cni-calico-deep which installs the Linkerd CNI plugin on top of a cluster with Calico and performs the current integration tests on top, thus validating various Linkerd features when CNI is enabled. For Calico to work, special config is required for kind which is at `cni-calico.yaml` This is different from the CNI integration tests that we run in cloud integration which performs the CNI level integration tests. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-09-23 19:58:28 +05:30
Nil	69ca673682	Introduce support for authenticated docker registries using imagePullSecrets, Fixes #4413 (#4898 ) * Introduce support for authenticated docker registries using imagePullSecrets Problem: Private Docker Registries are not supported for the moment as detailed in issue #4413 Solution: Every Service Account of linkerd subcomponents are Attached with imagePullSecrets, which in turn can then pulls the docker images from authenticated private registries using them. The imagePullSecret is configured in global.imagePullSecret parameter of values.yaml like imagePullSecret: - name: <name-of-private-registry-secret-resource> Fixes #4413 Signed-off-by: Nilakhya <nilakhya@hotmail.com>	2020-09-23 08:49:35 -05:00
Tarun Pothulapati	f75b9fe374	tracing: Move default values into addon-chart (#4951 ) * tracing: Move default values into chart This branch updates the tracing add-on's values into their own chart's values.yaml (just like grafana and prometheus). This prevents them from being saved into `linkerd-config-addons` where only the overridden values are stored. Thus allowing us to change the defaults. This also - Updates the check command to fall back to default values, if there are no overridden name fields. - Updates jaeger to `1.19.2` Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-09-15 15:19:25 -05:00
Alejandro Pedraza	ccf027c051	Push docker images to ghcr.io instead of gcr.io (#4953 ) * Push docker images to ghcr.io instead of gcr.io The `cloud_integration.yml` and `release.yml` workflows were modified to log into ghcr.io, and remove the `Configure gcloud` step which is no longer necessary. Note that besides the changes to cloud_integration.yml and release.yml, there was a change to the upgrade-stable integration test so that we do linkerd upgrade --addon-overwrite to reset the addons settings because in stable-2.8.1 the Grafana image was pegged to gcr.io/linkerd-io/grafana in linkerd-config-addons. This will need to be mentioned in the 2.9 upgrade notes. Also the egress integration test has a debug container that now is pegged to the edge-20.9.2 tag. Besides that, the other changes are just a global search and replace (s/gcr.io\/linkerd-io/ghcr.io\/linkerd/).	2020-09-10 15:16:24 -05:00
Zahari Dichev	084bb678c7	Perform TLS checks on injector, sp validator and tap (#4924 ) * Check sp-validator,proxy-injector and tap certs Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-09-10 11:21:23 -05:00
Tarun Pothulapati	c4f8ba270d	Generate Identity certs with alternate domain names (#4920 ) Updating only the go 1.15 version, makes the upgrades fail from older versions, as the identity certs do not have that setting and go 1.15 expects them. This PR upgrades the cert generation code to have that field, allowing us to move to go 1.15 in later versions of Linkerd.	2020-09-03 22:33:10 +05:30
Alex Leong	33ddd4e357	Use correct component name in multicluster checks (#4921 ) The multicluster checks make sure that the correct resources exist for each service mirror controller. When looking up these resources, it uses the `linkerd.io/control-plane-component=linkerd-service-mirror` label selector. However, these resources have the label `linkerd.io/control-plane-component=service-mirror`. This causes the resource lookup to fail to find the resource and the check spuriously fails. ``` × service mirror controller has required permissions missing ServiceAccounts: linkerd-service-mirror-self missing ClusterRoles: linkerd-service-mirror-access-local-resources-self missing ClusterRoleBindings: linkerd-service-mirror-access-local-resources-self missing Roles: linkerd-service-mirror-read-remote-creds-self missing RoleBindings: linkerd-service-mirror-read-remote-creds-self see https://linkerd.io/checks/#l5d-multicluster-source-rbac-correct for hints \| * no service mirror controller deployment for Link self ``` Instead, use the correct label selector when looking up these resources. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-08-31 13:40:53 -07:00
Hu Shuai	b1c953d20d	Fix a verb tense error (#4930 ) Signed-off-by: Hu Shuai <hus.fnst@cn.fujitsu.com>	2020-08-31 09:34:03 -05:00
Tarun Pothulapati	c9c5d97405	Remove SMI-Metrics charts and commands (#4843 ) Fixes #4790 This PR removes both the SMI-Metrics templates along with the experimental sub-commands. This also removes pkg `smi-metrics` as there is no direct use of it without the commands. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-08-24 14:35:33 -07:00
Zhou Hao	803511d77b	Add some unit test (#4853 ) Add unit tests for parsing IP addresses. Signed-off-by: Zhou Hao <zhouhao@cn.fujitsu.com>	2020-08-18 16:10:13 -07:00
Zahari Dichev	c25f0a3af5	Triger kube-system HA check based on webhook failure policy (#4861 ) This PR changes the HA check that verifies that the `config.linkerd.io/admission-webhooks=disabled` is present on kube-system to be enabled only when the failure policy for the proxy injector webhook is set to `Fail`. This allows users to skip this check in cases when the label is removed because the namespace is managed by the cloud provider like in the case described in #4754 Fix #4754 Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-08-17 13:56:03 +03:00
Josh Soref	72aadb540f	Spelling (#4872 ) This PR corrects misspellings identified by the [check-spelling action](https://github.com/marketplace/actions/check-spelling). The misspellings have been reported at `aaf440489e (commitcomment-41423663)` The action reports that the changes in this PR would make it happy: `5b82c6c5ca` Note: this PR does not include the action. If you're interested in running a spell check on every PR and push, that can be offered separately. Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-08-12 21:59:50 -07:00
Alejandro Pedraza	4876a94ed0	Update proxy-init version to v1.3.6 (#4850 ) Supersedes #4846 Bump proxy-init to v1.3.6, containing CNI fixes and support for multi-arch builds. #4846 included this in v1.3.5 but proxy.golang.org refused to update the modified SHA	2020-08-11 11:54:00 -05:00
Tarun Pothulapati	7e5804d1cf	grafana: move default values into values file (#4755 ) This PR moves default values into add-on specific values.yaml thus allowing us to update default values as they would not be present in linkerd-config-addons cm. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-08-06 13:57:28 -07:00
Alex Leong	024a35a3d3	Move multicluster API connectivity checks earlier (#4819 ) Fixes #4774 When a service mirror controller is unable to connect to the target cluster's API, the service mirror controller crashes with the error that it has failed to sync caches. This error lacks the necessary detail to debug the situation. Unfortunately, client-go does not surface more useful information about why the caches failed to sync. To make this more debuggable we do a couple things: 1. When creating the target cluster api client, we eagerly issue a server version check to test the connection. If the connection fails, the service-mirror-controller logs now look like this: ``` time="2020-07-30T23:53:31Z" level=info msg="Got updated link broken: {Name:broken Namespace:linkerd-multicluster TargetClusterName:broken TargetClusterDomain:cluster.local TargetClusterLinkerdNamespace:linkerd ClusterCredentialsSecret:cluster-credentials-broken GatewayAddress:35.230.81.215 GatewayPort:4143 GatewayIdentity:linkerd-gateway.linkerd-multicluster.serviceaccount.identity.linkerd.cluster.local ProbeSpec:ProbeSpec: {path: /health, port: 4181, period: 3s} Selector:{MatchLabels:map[] MatchExpressions:[{Key:mirror.linkerd.io/exported Operator:Exists Values:[]}]}}" time="2020-07-30T23:54:01Z" level=error msg="Unable to create cluster watcher: cannot connect to api for target cluster remote: Get \"https://36.199.152.138/version?timeout=32s\": dial tcp 36.199.152.138:443: i/o timeout" ``` This error also no longer causes the service mirror controller to crash. Updating the Link resource will cause the service mirror controller to reload the credentials and try again. 2. We rearrange the checks in `linkerd check --multicluster` to perform the target API connectivity checks before the service mirror controller checks. This means that we can validate the target cluster API connection even if the service mirror controller is not healthy. We also add a server version check here to quickly determine if the connection is healthy. Sample check output: ``` linkerd-multicluster -------------------- √ Link CRD exists √ Link resources are valid * broken W0730 16:52:05.620806 36735 transport.go:243] Unable to cancel request for promhttp.RoundTripperFunc × remote cluster access credentials are valid * failed to connect to API for cluster: [broken]: Get "https://36.199.152.138/version?timeout=30s": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers) see https://linkerd.io/checks/#l5d-smc-target-clusters-access for hints W0730 16:52:35.645499 36735 transport.go:243] Unable to cancel request for promhttp.RoundTripperFunc × clusters share trust anchors Problematic clusters: * broken: unable to fetch anchors: Get "https://36.199.152.138/api/v1/namespaces/linkerd/configmaps/linkerd-config?timeout=30s": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers) see https://linkerd.io/checks/#l5d-multicluster-clusters-share-anchors for hints √ service mirror controller has required permissions * broken √ service mirror controllers are running * broken × all gateway mirrors are healthy wrong number of (0) gateway metrics entries for probe-gateway-broken.linkerd-multicluster see https://linkerd.io/checks/#l5d-multicluster-gateways-endpoints for hints √ all mirror services have endpoints ‼ all mirror services are part of a Link mirror service voting-svc-gke.emojivoto is not part of any Link see https://linkerd.io/checks/#l5d-multicluster-orphaned-services for hints ``` Some logs from the underlying go network libraries sneak into the output which is kinda gross but I don't think it interferes too much with being able to understand what's going on. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-08-05 11:48:23 -07:00
cpretzer	670caaf8ff	Update to proxy-init v1.3.4 (#4815 ) Signed-off-by: Charles Pretzer <charles@buoyant.io>	2020-07-30 15:58:58 -05:00
Alex Leong	a1543b33e3	Add support for service-mirror selectors (#4795 ) * Add selector support Signed-off-by: Alex Leong <alex@buoyant.io> * Removed unused labels Signed-off-by: Alex Leong <alex@buoyant.io>	2020-07-30 10:07:14 -07:00
Alexander Berger	4ffea3ba08	CNI add support for priorityClassName (#4742 ) * CNI add support for priorityClassName As requested in #2981 one should be able to optionally define a priorityClassName for the linkerd2 pods. With this commit support for priorityClassName is added to the CNI plugin helm chart as well as to the cli command for installing the CNI plugin. Also added an `installNamespace` Helm option for the CNI installation. Implements part of #2981. Signed-off-by: alex.berger@nexiot.ch <alex.berger@nexiot.ch>	2020-07-30 10:43:06 -05:00
Tarun Pothulapati	c68ab23ab2	Add global.prometheusUrl field for byop use-case (#4390 ) This pr adds `globa.prometheusUrl` field which will be used to configure publlic-api, hearbeat, grafana, etc (i,e query path) to use a external Prometheus.	2020-07-28 12:26:34 +05:30
Matt Miller	fc33b9b9aa	support overriding inbound and outbound connect timeouts. (#4759 ) * support overriding inbound and outbound connect timeouts. * add validation on user provided TCP connect timeouts * convert valid time values into ms Signed-off-by: Matt Miller <mamiller@rosettastone.com>	2020-07-27 13:56:21 -07:00
Alejandro Pedraza	2aea2221ed	Fixed `linkerd check` not finding Prometheus (#4797 ) * Fixed `linkerd check` not finding Prometheus ## The Problem `linkerd check` run right after install is failing because it can't find the Prometheus Pod. ## The Cause The "control plane pods are ready" check used to verify the existence of all the control plane pods, blocking until all the pods were ready. Since #4724, Prometheus is no longer included in that check because it's checked separately as an add-on. An unintended consequence is that when the ensuing "control plane self-check" is triggered, Prometheus might not be ready yet and the check fails because it doesn't do retries. ## The Fix The "control plane self-check" uses a gRPC call (it's the only check that does that) and those weren't designed with retries in mind. This PR adds retry functionality to the `runCheckRPC()` function, making sure the final output remains the same It also temporarily disables the `upgrade-edge` integration test because after installing edge-20.7.4 `linkerd check` will fail because of this.	2020-07-27 11:54:03 -05:00
Alex Leong	d540e16c8b	Make service mirror controller per target cluster (#4710 ) This PR removes the service mirror controller from `linkerd mc install` to `linkerd mc link`, as described in https://github.com/linkerd/rfc/pull/31. For fuller context, please see that RFC. Basic multicluster functionality works here including: * `linkerd mc install` installs the Link CRD but not any service mirror controllers * `linkerd mc link` creates a Link resource and installs a service mirror controller which uses that Link * The service mirror controller creates and manages mirror services, a gateway mirror, and their endpoints. * The `linkerd mc gateways` command lists all linked target clusters, their liveliness, and probe latences. * The `linkerd check` multicluster checks have been updated for the new architecture. Several checks have been rendered obsolete by the new architecture and have been removed. The following are known issues requiring further work: * the service mirror controller uses the existing `mirror.linkerd.io/gateway-name` and `mirror.linkerd.io/gateway-ns` annotations to select which services to mirror. it does not yet support configuring a label selector. * an unlink command is needed for removing multicluster links: see https://github.com/linkerd/linkerd2/issues/4707 * an mc uninstall command is needed for uninstalling the multicluster addon: see https://github.com/linkerd/linkerd2/issues/4708 Signed-off-by: Alex Leong <alex@buoyant.io>	2020-07-23 14:32:50 -07:00
ZouYu	e75b1ca13c	Add unit test for pkg/version/channelversion.go (#4784 ) * Add unit test for pkg/version/channelversion.go Signed-off-by: zouyu <zouy.fnst@cn.fujitsu.com>	2020-07-23 10:29:30 -07:00
Tarun Pothulapati	986e0d4627	prometheus: add add-on checks (#4756 ) As linkerd-prometheus is optional now, the checks are also separated and should only work when the prometheus add-on is installed. This is done by re-using the add-on check code.	2020-07-23 18:03:24 +05:30
ZouYu	46d22f8b04	Add unit test for pkg/util/http.go (#4770 ) Signed-off-by: zouyu <zouy.fnst@cn.fujitsu.com>	2020-07-21 14:08:53 -07:00
Matei David	146c593cd5	Uncomment EndpointSliceAccess function (#4760 ) * Small PR that uncomments the `EndpointSliceAcess` method and cleans up left over todos in the destination service. * Based on the past three PRs related to `EndpointSlices` (#4663 #4696 #4740); they should now be functional (albeit prone to bugs) and ready to use. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-20 14:50:43 -07:00
Tarun Pothulapati	b7e9507174	Remove/Relax prometheus related checks (#4724 ) * Removes/Relaxes prometheus related checks Now that prometheus is an add-on, There can be cases where prometheus is disabled at which the check should show a warning but not fail. This decouples the tight depedency. This changes the following checks: - Removes serviceAccount and pod checks in the CLI. - Relaxes `linkerd-api` checks to only check for prometheus access when the URL is not empty. This should work seamlessly with external prometheus as that URL will be passed and it performs the same check. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-07-20 14:24:00 -07:00
Matei David	8b85716eb8	Introduce install flag for EndpointSlices (#4740 ) EndpointSlices have been made opt-in due to their experimental nature. This PR introduces a new install flag 'enableEndpointSlices' that will allow adopters to specify in their cli install or helm install step whether they would like to use endpointslices as a resource in the destination service, instead of the endpoints k8s resource. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-15 09:53:04 -07:00
Tarun Pothulapati	2a099cb496	Move Prometheus as an Add-On (#4362 ) This moves Prometheus as a add-on, thus making it optional but enabled by default. The also make `linkerd-prometheus` more configurable, and allow it to have its own life-cycle for upgrades, configuration, etc. This work will be followed by documentation that help users configure existing Prometheus to work with Linkerd. Changes Include: - moving prometheus manifests into a separate chart at `charts/add-ons/prometheus`, and adding it as a dependency to `linkerd2` - implement the `addOn` interface to support the same with CLI. - include configuration in `linkerd-config-addons` User Facing Changes: The default install experience does not change much but for users who have already configured Prometheus differently, would need to apply the same using the new configuration fields present in chart README	2020-07-09 23:29:03 +05:30

1 2 3 4 5 ...

462 Commits