linkerd2

Commit Graph

Author	SHA1	Message	Date
Kevin Leimkuhler	147d85dc70	Update `GetProfile` clients with policy server updates (#7388 ) ### What `GetProfile` clients do not receive destinatin profiles that consider Server protocol fields the way that `Get` clients do. If a Server exists for a `GetProfile` destination that specifies the protocol for that destination is `opaque`, this information is not passed back to the client. #7184 added this for `Get` by subscribing clients to Endpoint/EndpointSlice updates. When there is an update, or there is a Server update, the endpoints watcher passes this information back to the endpoint translator which handles sending the update back to the client. For `GetProfile` the situation is different. As with `Get`, we only consider Servers when dealing with Pod IPs, but this only occurs in two situations for `GetProfile`. 1. The destination is a Pod IP and port 2. The destionation is an Instance ID and port In both of these cases, we need to check if a already Server selects the endpoint and we need to subscribe for Server updates incase one is added or deleted which selects the endpoint. ### How First we check if there is already a Server which selects the endpoint. This is so that when the first destionation profile is returned, the client knows if the destination is `opaque` or not. After sending that first update, we then subscribe the client for any future updates which will come from a Server being added or deleted. This is handled by the new `ServerWatcher` which watches for Server updates on the cluster; when an update occurs it sends that to the `endpointProfileTranslator` which translates the protcol update into a DestinationProfile. By introducing the `endpointProfileTranslator` which only handles protocol updates, we're able to decouple the endpoint logic from `profileTranslator`—it's `endpoint` field has been removed now that it only handles updates for ServiceProfiles for Services. ### Testing A unit test has been added and below are some manual testing instructions to see how it interacts with Server updates: <details> <summary>app.yaml</summary> ```yaml apiVersion: v1 kind: Pod metadata: name: pod labels: app: pod spec: containers: - name: app image: nginx ports: - name: http containerPort: 80 --- apiVersion: policy.linkerd.io/v1beta1 kind: Server metadata: name: srv labels: policy: srv spec: podSelector: matchLabels: app: pod port: 80 proxyProtocol: opaque ``` </details> ```shell $ go run ./controller/cmd/main.go destination ``` ```shell $ linkerd inject app.yaml \|kubectl apply -f - ... $ kubectl get pods -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES pod 2/2 Running 0 53m 10.42.0.34 k3d-k3s-default-server-0 <none> <none> $ go run ./controller/script/destination-client/main.go -method getProfile -path 10.42.0.34:80 ... ``` You can add/delete `srv` as well as edit its `proxyProtocol` field to observe the correct DestinationProfile updates. Signed-off-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2021-12-08 12:26:27 -07:00
Tarun Pothulapati	4170b49b33	smi: remove default functionality in linkerd (#7334 ) Now, that SMI functionality is fully being moved into the [linkerd-smi](www.github.com/linkerd/linkerd-smi) extension, we can stop supporting its functionality by default. This means that the `destination` component will stop reacting to the `TrafficSplit` objects. When `linkerd-smi` is installed, It does the conversion of `TrafficSplit` objects to `ServiceProfiles` that destination components can understand, and will react accordingly. Also, Whenever a `ServiceProfile` with traffic splitting is associated with a service, the same information (i.e splits and weights) is also surfaced through the `UI` (in the new `services` tab) and the `viz cmd`. So, We are not really loosing any UI functionality here. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-12-03 12:07:30 +05:30
Krzysztof Dryś	f92e77f7f0	Remove legacy upgrade and it's references (#7309 ) With [linkerd2#5008](https://github.com/linkerd/linkerd2/issues/5008) and associated PRs, we changed the way configuration is handled by storing a helm values struct inside of the configmap. Now that we have had one stable release with new configuration, were no longer use and need to maintain the legacy config. This commit removes all the associated logic, protobuf files, and references. Changes Include: - Removed [`proto/config/config.proto`](https://github.com/linkerd/linkerd2/blob/main/proto/config/config.proto) - Changed [`bin/protoc-go.sh`](https://github.com/linkerd/linkerd2/blob/main/bin/protoc-go.sh) to not include `config.proto` - Changed [`FetchLinkerdConfigMap()`](`741fde679b/pkg/healthcheck/healthcheck.go (L1768)`) in `healthcheck.go` to return only the configmap, with the pb type. - Changed [`FetchCurrentConfiguration()`](`741fde679b/pkg/healthcheck/healthcheck.go (L1647)`) only unmarshal and use helm value struct from configmap (as a follow-up to the todo above; note that there's already a todo here to refactor the function once value struct is the default, which has already happened) - Removed [`upgrade_legacy.go`](https://github.com/linkerd/linkerd2/blob/main/cli/cmd/upgrade_legacy.go) Signed-off-by: Krzysztof Dryś <krzysztofdrys@gmail.com>	2021-11-29 20:08:58 +05:30
Kevin Leimkuhler	01cbe616f1	Honor Server `proxyProtocol` in destination service `Get` with policy CRD APIs (#7184 ) This change ensures that if a Server exists with `proxyProtocol: opaque` that selects an endpoint backed by a pod, that destination requests for that pod reflect the fact that it handles opaque traffic. Currently, the only way that opaque traffic is honored in the destination service is if the pod has the `config.linkerd.io/opaque-ports` annotation. With the introduction of Servers though, users can set `server.Spec.ProxyProtocol: opaque` to indicate that if a Server selects a pod, then traffic to that pod's `server.Spec.Port` should be opaque. Currently, the destination service does not take this into account. There is an existing change up that _also_ adds this functionality; it takes a different approach by creating a policy server client for each endpoint that a destination has. For `Get` requests on a service, the number of clients scales with the number of endpoints that back that service. This change fixes that issue by instead creating a Server watch in the endpoint watcher and sending updates through to the endpoint translator. The two primary scenarios to consider are ### A `Get` request for some service is streaming when a Server is created/updated/deleted When a Server is created or updated, the endpoint watcher iterates through its endpoint watches (`servicePublisher` -> `portPublisher`) and if it selects any of those endpoints, the port publisher sends an update if the Server has marked that port as opaque. When a Server is deleted, the endpoint watcher once again iterates through its endpoint watches and deletes the address set's `OpaquePodPorts` field—ensuring that updates have been cleared of Server overrides. ### A `Get` request for some service happens after a Server is created When a `Get` request occurs (or new endpoints are added—they both take the same path), we must check if any of those endpoints are selected by some existing Server. If so, we have to take that into account when creating the address set. This part of the change gives me a little concern as we first must get all the Servers on the cluster and then create a set of _all_ the pod-backed endpoints that they select in order to determine if any of these _new_ endpoints are selected. ## Testing Right now this can be tested by starting up the destination service locally and running `Get` requests on a service that has endpoints selected by a Server app.yaml ```yaml apiVersion: v1 kind: Pod metadata: name: pod labels: app: pod spec: containers: - name: app image: nginx ports: - containerPort: 80 --- apiVersion: v1 kind: Service metadata: name: svc spec: selector: app: pod ports: - name: http port: 80 --- apiVersion: policy.linkerd.io/v1alpha1 kind: Server metadata: name: srv labels: policy: srv spec: podSelector: matchLabels: app: pod port: 80 proxyProtocol: HTTP/1 ``` ```bash $ go run controller/script/destination-client/main.go -path svc.default.svc.cluster.local:80 ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-11-23 20:35:53 -07:00
Matei David	690bc09c35	Stop using deprecated `beta.kubernetes.io/node` label (#7310 ) In our chart values and (some) integration tests, we're using a deprecated label for node selection. According to the warning messages we get during installation, the label has been deprecated since k8s `v1.14`: ``` Warning: spec.template.spec.nodeSelector[beta.kubernetes.io/os]: deprecated since v1.14; use "kubernetes.io/os" instead Warning: spec.jobTemplate.spec.template.spec.nodeSelector[beta.kubernetes.io/os]: deprecated since v1.14; use "kubernetes.io/os" instead ``` This PR changes all occurrences of `beta.kubernetes.io/node` with `kubernetes.io/node`. Fixes #7225	2021-11-19 09:50:15 -08:00
Alex Leong	ea5461f674	Fix identity overrides for endpoint slices (#7243 ) When the `mirror.linkerd.io/remote-gateway-identity` and `mirror.linkerd.io/remote-svc-fq-name` annotations are set on an EndpointSlice object, the destination controller does not return the correct identity hints for that endpoint. We fix an incorrect assignment to fix this. We also fix some logic that can result in a nil pointer dereference instead of comparing empty strings. We add a test case to exercise these. Signed-off-by: Alex Leong <alex@buoyant.io>	2021-11-09 16:38:46 -08:00
Kevin Leimkuhler	ebb1ee8c4c	Deprecate `topologyKeys` and add support for endpoint slices `Hints`. (#6698 ) ## background In order to upgrade `client-go` and other related libraries to `v0.22.0`, we had to address the deprecation of the service's `TopologyKeys` field. This field and it's related feature have been deprecated and superseded by [Topology Aware Hints](https://github.com/kubernetes/enhancements/blob/master/keps/sig-network/2433-topology-aware-hints/README.md). The goal of topology aware hints is to to provide a simpler way for users to prefer endpoints by basing decisions soely off the node's `topology.kubernetes.io/zone` label. If a node is in `zone-a`, then it should prefer endpoints that _should_ be consumed by clients in `zone-a`. kube-proxy (and now the destination controller) know that an endpoint _should_ be consumed by clients in certain zones if its `Hints.ForZones` field is set with a zone value that matches that of the client. For example, the endpoint slice controller may add the following hint to an endpoint: ``` - addresses: ["1.1.1.1"] zone: "zone-a" hints: zone: "zone-b" ``` The above endpoint is an endpoint that is located in `zone-a` but should be consumed by clients in `zone-b`. ## changes Now that topological preference is not a concept, we can remove it from the `servicePublisher` and `portPublisher` structs. The fields were only there so that it could be populated down to individual addresses. The `Hints` field is only present on endpoints that belong to an `EndpointSlice`, so use of this field is limited to the `endpointSliceToAddresses` function. When endpoint slices are translated to an `AddressSet` now, for each address (endpoint) we make sure to copy the `Hints.ForZones` field if it is present. This field is only present if it's set by the endpoint slice controller and it has [several safeguards](https://kubernetes.io/docs/concepts/services-networking/topology-aware-hints/#safeguards). After `endpointSliceToAddresses` has translated an endpoint slice into an `AddressSet` and updated the endpoint translator's `availableEndpoints`, filtering takes place and is the crux of this change. For each potential address that we have to consider in `availableEndpoints`, we make sure to only return a set of addresses who's consumption zone (zones in `forZones` field) match that of the node's zone. That way, we only communicate with endpoints that have been labeled by the endpoint slice controller for the current node we're on. This allows us to remove the ordering/hierarchy of topological region and considering the `*` value. ## testing I've added a unit test which creates an endpoint translator tied to a node in `west-1a` and asserts that it only handles updates for addresses that should be consumed by clients in `west-1a`. Closes #6637 Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-11-08 12:21:31 -07:00
Bart Peeters	6bf507ad32	Change log level for watchers in destination svc Make destination info logs clearer by changing log level of watchers log messages 'Establishing watch', 'Starting watch' and 'Stopping watch' from info to debug (#6917) Signed-off-by: Bart Peeters <birtpeeters@hotmail.com>	2021-09-20 09:44:32 +01:00
LiuDui	61d4fa80fc	remove unused constants (#6630 ) Signed-off-by: liudui <1693291525@qq.com>	2021-08-09 16:23:48 +01:00
Alex Leong	24792cfd1c	Remove core dependency on viz (#6497 ) Fixes #5589 The core control plane has a dependency on the viz package in order to use the `BuildResource` function. This "backwards" dependency means that the viz source code needs to be included in core docker-builds and is bad for code hygiene. We move the `BuildResource` function into the viz package. In `cli/cmd/metrics.go` we replace a call to `BuildResource` with a call directly to `CanonicalResourceNameFromFriendlyName`. Signed-off-by: Alex Leong <alex@buoyant.io>	2021-07-19 14:28:45 -07:00
dependabot[bot]	789aeea561	Fix gRPC servers (#6510 ) Bump github.com/linkerd/linkerd2-proxy-api from 0.1.18 to 0.2.0 Bumps [github.com/linkerd/linkerd2-proxy-api](https://github.com/linkerd/linkerd2-proxy-api) from 0.1.18 to 0.2.0. - [Release notes](https://github.com/linkerd/linkerd2-proxy-api/releases) - [Changelog](https://github.com/linkerd/linkerd2-proxy-api/blob/main/CHANGES.md) - [Commits](https://github.com/linkerd/linkerd2-proxy-api/compare/v0.1.18...v0.2.0) --- updated-dependencies: - dependency-name: github.com/linkerd/linkerd2-proxy-api dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: Oliver Gould <olix0r@gmail.com> Co-authored-by: Oliver Gould <ver@buoyant.io> Co-authored-by: Oliver Gould <olix0r@gmail.com>	2021-07-19 10:24:23 -05:00
Matei David	06ef634a9b	Add endpoint in profile response for requests on pod DNS. (#6260 ) Closes #6253 ### What --- When we send a profile request with a pod IP, we get back an endpoint as part of the response. This has two advantages: we avoid building a load balancer and we can treat endpoint failure differently (with more of a fail fast approach). At the moment, when we use a pod DNS as the target of the profile lookup, we don't have an endpoint returned in the response. Through this change, the behaviour will be consistent. Whenever we look up a pod (either through IP or DNS name) we will get an endpoint back. The change also attempts to simplify some of the logic in GetProfile. ### How --- We already have a way to build an endpoint and return it back to the client; I sought to re-use most of the code in an effort to also simplify `GetProfile()`. I extracted most of the code that would have been duplicated into a separate method that is responsible for building the address, looking at annotations for opaque ports and for sending the response back. In addition, to support a pod DNS fqn I've expanded on the `else` branch of the topmost if statement -- if our host is not an IP, we parse the host to get the k8s fqn. If the parsing function returns an instance ID along with the ServiceID, then we know we are dealing directly with a pod -- if we do, we fetch the pod using the core informer and then return an endpoint for it. ### Tests --- I've tested this mostly with the destination client script. For the tests, I used the following pods: ``` ❯ kgp -n emojivoto -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES voting-ff4c54b8d-zbqc4 2/2 Running 0 3m58s 10.42.0.53 k3d-west-server-0 <none> <none> web-0 2/2 Running 0 3m58s 10.42.0.55 k3d-west-server-0 <none> <none> vote-bot-7d89964475-tfq7j 2/2 Running 0 3m58s 10.42.0.54 k3d-west-server-0 <none> <none> emoji-79cc56f589-57tsh 2/2 Running 0 3m58s 10.42.0.52 k3d-west-server-0 <none> <none> # emoji pod has an opaque port set to 8080. # web-svc is a headless service and it backs a statefulset (which is why we have web-0). # without a headless service we can't lookup based on pod DNS. ``` `Responses before the change`: ``` # request on IP, this is how things work at the moment. I included this because there shouldn't be # any diff between the response given here and the response we get with the change. # note: this corresponds to the emoji pod which has opaque ports set to 8080. ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.42.0.52:8080 INFO[0000] opaque_protocol:true retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} endpoint:{addr:{ip:{ipv4:170524724} port:8080} weight:10000 metric_labels:{key:"control_plane_ns" value:"linkerd"} metric_labels:{key:"deployment" value:"emoji"} metric_labels:{key:"namespace" value:"emojivoto"} metric_labels:{key:"pod" value:"emoji-79cc56f589-57tsh"} metric_labels:{key:"pod_template_hash" value:"79cc56f589"} metric_labels:{key:"serviceaccount" value:"emoji"} tls_identity:{dns_like_identity:{name:"emoji.emojivoto.serviceaccount.identity.linkerd.cluster.local"}} protocol_hint:{h2:{} opaque_transport:{inbound_port:4143}}} INFO[0000] # request web-0 by IP # there shouldn't be any diff with the response we get after the change ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.42.0.55:8080 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} endpoint:{addr:{ip:{ipv4:170524727} port:8080} weight:10000 metric_labels:{key:"control_plane_ns" value:"linkerd"} metric_labels:{key:"namespace" value:"emojivoto"} metric_labels:{key:"pod" value:"web-0"} metric_labels:{key:"serviceaccount" value:"web"} metric_labels:{key:"statefulset" value:"web"} tls_identity:{dns_like_identity:{name:"web.emojivoto.serviceaccount.identity.linkerd.cluster.local"}} protocol_hint:{h2:{}}} INFO[0000] # request web-0 by DNS name -- will not work. ❯ go run controller/script/destination-client/main.go -method getProfile -path web-0.web-svc.emojivoto.svc.cluster.loc al:8080 INFO[0000] fully_qualified_name:"web-0.web-svc.emojivoto.svc.cluster.local" retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"web-svc.emojivoto.svc.cluster.local.:8080" weight:10000} INFO[0000] INFO[0000] fully_qualified_name:"web-0.web-svc.emojivoto.svc.cluster.local" retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"web-svc.emojivoto.svc.cluster.local.:8080" weight:10000} INFO[0000] # ^ # \| # --> no endpoint in the response ``` `Responses after the change`: ``` # request profile for emoji, we see opaque transport being set on the endpoint. ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.42.0.52:8080 INFO[0000] opaque_protocol:true retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} endpoint:{addr:{ip:{ipv4:170524724} port:8080} weight:10000 metric_labels:{key:"control_plane_ns" value:"linkerd"} metric_labels:{key:"deployment" value:"emoji"} metric_labels:{key:"namespace" value:"emojivoto"} metric_labels:{key:"pod" value:"emoji-79cc56f589-57tsh"} metric_labels:{key:"pod_template_hash" value:"79cc56f589"} metric_labels:{key:"serviceaccount" value:"emoji"} tls_identity:{dns_like_identity:{name:"emoji.emojivoto.serviceaccount.identity.linkerd.cluster.local"}} protocol_hint:{h2:{} opaque_transport:{inbound_port:4143}}} INFO[0000] # request profile for web-0 with IP. ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.42.0.55:8080 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} endpoint:{addr:{ip:{ipv4:170524727} port:8080} weight:10000 metric_labels:{key:"control_plane_ns" value:"linkerd"} metric_labels:{key:"namespace" value:"emojivoto"} metric_labels:{key:"pod" value:"web-0"} metric_labels:{key:"serviceaccount" value:"web"} metric_labels:{key:"statefulset" value:"web"} tls_identity:{dns_like_identity:{name:"web.emojivoto.serviceaccount.identity.linkerd.cluster.local"}} protocol_hint:{h2:{}}} INFO[0000] # request profile for web-0 with pod DNS, resp contains endpoint. ❯ go run controller/script/destination-client/main.go -method getProfile -path web-0.web-svc.emojivoto.svc.cluster.local:8080 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} endpoint:{addr:{ip:{ipv4:170524727} port:8080} weight:10000 metric_labels:{key:"control_plane_ns" value:"linkerd"} metric_labels:{key:"namespace" value:"emojivoto"} metric_labels:{key:"pod" value:"web-0"} metric_labels:{key:"serviceaccount" value:"web"} metric_labels:{key:"statefulset" value:"web"} tls_identity:{dns_like_identity:{name:"web.emojivoto.serviceaccount.identity.linkerd.cluster.local"}} protocol_hint:{h2:{}}} INFO[0000] ``` Signed-off-by: Matei David <matei@buoyant.io>	2021-06-22 16:51:29 -06:00
Josh Soref	0be792fadc	Spelling (#6215 ) This PR corrects misspellings identified by the [check-spelling action](https://github.com/marketplace/actions/check-spelling). The misspellings have been reported at `0d56327e6f (commitcomment-51603624)` The action reports that the changes in this PR would make it happy: `03a9c310aa` Note: this PR does not include the action. If you're interested in running a spell check on every PR and push, that can be offered separately. Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2021-06-07 15:16:59 -06:00
wangchenglong01	9ea66c8f73	Condition is always 'false' because 'err' is always 'nil' (#6218 ) Remove unnecessary err check Signed-off-by: Cookie Wang <wangchl01@inspur.com>	2021-06-04 14:57:00 +05:30
Tarun Pothulapati	432f90c4cf	destination: prefer `sp.dstOverrides` over `trafficsplit` (#6156 ) This updates the destination to prefer `serviceprofiles.dstOverrides` over `trafficsplits`. This is useful as it is important for ServiceProfile to take preference over TrafficSplits when both are present. This also makes integration testing the `smi-adaptor` easier. This also adds unit tests in the `traffic_split_adaptor` to check for the same. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-05-27 17:34:54 +05:30
Kevin Leimkuhler	a12e2226b4	Separate protocol hint setting from H2 upgrades (#6150 ) While uncommon, if H2 upgrades are disabled it's possible for an opaque workload to not have it's hint.OpaqueTransport field set in it's destination profile response. This changes the H2 upgrade enabled check to be specific for setting the hint.Protocol while allowing hint.OpaqueTransport to be set independent of that value. Signed-off-by: Kevin Leimkuhler kevin@kleimkuhler.com Co-authored-by: Oliver Gould <ver@buoyant.io>	2021-05-24 13:15:54 -07:00
Oliver Gould	f2eb3162d1	destination: Check port bounds (#6143 ) CodeQL analysis flags that our use of `strconv.Atoi` is potentially incorrect. More information [here][1]. This change addresses this by explicitly checking the bounds of the port integer before casting it from `int` to `uint32`. [1]: https://codeql.github.com/codeql-query-help/go/go-incorrect-integer-conversion/	2021-05-24 10:20:47 -07:00
Tarun Pothulapati	5f2d969cfa	Support Traffic Splitting through `ServiceProfile.dstOverrides` (#6060 ) * Support Traffic Splitting through `ServiceProfile.dstOverrides` This commit - updates the destination logic to prevent the override of `ServiceProfiles.dstOverrides` when a `TS` is absent and no `dstOverrides` set. This means that any `serviceprofiles.dstOverrides` set by the user are correctly propagated to the proxy and thus making Traffic Splitting through service profiles work. - This also updates the `profile_translator.toDstOverrides` to use the port from `GetProfiles` when there is no port in `dstOverrides.authority` After these changes, The following `ServiceProfile` to perform traffic splitting should work. ```yaml apiVersion: linkerd.io/v1alpha2 kind: ServiceProfile metadata: name: backend-svc.linkerd-trafficsplit-test-sp.svc.cluster.local spec: dstOverrides: - authority: "backend-svc.linkerd-trafficsplit-test-sp.svc.cluster.local.:8080" weight: 500m - authority: "failing-svc.linkerd-trafficsplit-test-sp.svc.cluster.local.:8080" weight: 500m ``` This PR also adds an integration test i.e `TestTrafficSplitCliWithSP` which checks for Traffic Splitting through Service Profiles. This integration test follows the same pattern of other traffic split integration tests but Instead, we perform `linkerd viz stat` on the client and check if there are expected server objects to be there as `linkerd viz stat ts` does not _yet_ work with Service Profiles. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-05-03 23:39:56 +05:30
Tarun Pothulapati	fac28ff8a7	destination: Remove support for IP Queries in `Get` API (#6018 ) * destination: Remove support for IP Queries in `Get` API Fixes #5246 This PR updates the destination to report an error when `Get` is called for IP Queries. As the issue mentions, The proxies are not using this API anymore and it helps to simplify and remove unnecessary logic. This removes the relevant `IPWatcher` logic, along with unit tests Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-04-21 12:40:40 +05:30
Alejandro Pedraza	6980e45e1d	Remove the `linkerd-controller` pod (#6039 ) * Remove the `linkerd-controller` pod Now that we got rid of the `Version` API (#6000) and the destination API forwarding business in `linkerd-controller` (#5993), we can get rid of the `linkerd-controller` pod. ## Removals - Deleted everything under `/controller/api/public` and `/controller/cmd/public-api`. - Moved `/controller/api/public/test_helper.go` to `/controller/api/destination/test_helper.go` because those are really utils for destination testing. I also extracted from there the prometheus mock structs and put that under `/pkg/prometheus/test_helper.go`, which is now by both the `linkerd diagnostics endpoints` and the `metrics-api` tests, removing some duplication. - Deleted the `controller.yaml` and `controller-rbac.yaml` helm templates along with the `publicAPIResources` and `publicAPIProxyResources` helm values. ## Health checks - Removed the `can initialize the client` check given such client is no longer needed. The `linkerd-api` section was left with only the check `control pods are ready`, so I moved that under the `linkerd-existence` section and got rid of the `linkerd-api` section altogether. - In that same `linkerd-existence` section, got rid of the `controller pod is running` check. ## Other changes - Fixed the Control Plane section of the dashboard, taking account the disappearance of `linkerd-controller` and previously, of `linkerd-sp-validator`.	2021-04-19 09:57:45 -05:00
Dennis Adjei-Baah	78363ca894	Disable protocol and TLS hints on skipped ports (#6022 ) When a pod is configured with `skip-inbound-ports` annotation, a client proxy trying to connect to that pod tries to connect to it via H2 and also tries to initiate a TLS connection. This issue is caused by the destination controller when it sends protocol and TLS hints to the client proxy for that skipped port. This change fixes the destination controller so that it no longer sends protocol and TLS identity hints to outbound proxies resolving a `podIP:port` that is on a skipped inbound port. I've included a test that exhibits this error prior to this fix but you can also test the prior behavior by: ```bash curl https://run.linkerd.io/booksapp.yml > booksapp.yaml # edit either the books or authors service to: 1: Configure a failure rate of 0.0 2: add the `skip-inbound-ports` config annotation bin/linkerd viz stat pods webapp There should be no successful requests on the webapp deployment ``` Fixes #5995 Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2021-04-16 12:44:17 -04:00
Alejandro Pedraza	c24585e6ea	Removed `Version` API from the public-api (#6000 ) * Removed `Version` API from the public-api This is a sibling PR to #5993, and it's the second step towards removing the `linkerd-controller` pod. This one deals with a replacement for the `Version` API, fetching instead the `linkerd-config` CM and retrieving the `LinkerdVersion` value. ## Changes to the public-api - Removal of the `publicPb.ApiClient` entry from the `Client` interface - Removal of the `publicPb.ApiServer` entry from the `Server` interface - Removal of the `Version` and related methods from `client.go`, `grpc_server.go` and `http_server.go` ## Changes to `linkerd version` - Removal of all references to the public API. - Call `healthcheck.GetServerVersion` to retrieve the version ## Changes to `linkerd check` - Removal of the "can query the control API" check from the "linkerd-api" section - Addition of a new "can retrieve the control plane version" check under the "control-plane-version" section ## Changes to `linkerd-web` - The version is now retrieved from the `linkerd-config` CM instead of a public-API call. - Removal of all references to the public API. - Removal of the `data-go-version` global attribute on the dashboard, which wasn't being used. ## Other changes - Added `ValuesFromConfigMap` function in `values.go` to convert the `linkerd-config` CM into a `*Values` struct instance - Removal of the `public` protobuf - Refactor 'linkerd repair' to use the refactored 'healthcheck.GetServerVersion()' function	2021-04-16 11:23:55 -05:00
Alejandro Pedraza	b22b5849e3	Removed Destination's `Get` API from the public-api (#5993 ) * Removed Destination's `Get` API from the public-api This is the first step towards removing the `linkerd-controller` pod. It deals with removing the Destination `Get` http and gRPC endpoint it exposes, that only the `linkerd diagnostics endpoints` is consuming. Removed all references to Destination in the public-api, including all the gRPC-to-http-to-gRPC forwardings: - Removed the `Get` method from the public-api gRPC server that forwarded the connection from the controller pod to the destination pod. Clients should now connect directly to the destination service via gRPC. - Likewise, removed the destination boilerplate in the public-api http server (and its `main.go`) that served the `Get` destination endpoint and forwarded it into the gRPC server. - Finally, removed the destination boilerplate from the public-api's `client.go` that created a client connecting to the http API.	2021-04-16 09:04:17 -05:00
Bruce Chen Wenliang	b84b2077d3	Ignore pods in "Terminating" when watching IP addresses. (#5940 ) Fixes #5939 Some CNIs reasssign the IP of a terminating pod to a new pod, which leads to duplicate IPs in the cluster. It eventually triggers #5939. This commit will make the IPWatcher, when given an IP, filter out the terminating pods (when a pod is given a deletionTimestamp). The issue is hard reproduce because we are not able to assign a particular IP to a pod manually. Signed-off-by: Bruce <wenliang.chen@personio.de> Co-authored-by: Bruce <wenliang.chen@personio.de>	2021-03-24 18:21:42 +05:30
Kevin Leimkuhler	3f72c998b3	Handle pod lookups for pods that map to a host IP and host port (#5904 ) This fixes an issue where pod lookups by host IP and host port fail even though the cluster has a matching pod. Usually these manifested as `FailedPrecondition` errors, but the messages were too long and resulted in http/2 errors. This change depends on #5893 which fixes that separate issue. This changes how often those `FailedPrecondition` errors actually occur. The destination service now considers pod host IPs and should reduce the frequency of those errors. Closes #5881 --- Lookups like this happen when a pod is created with a host IP and host port set in its spec. It still has a pod IP when running, but requests to `hostIP:hostPort` will also be redirected to the pod. Combinations of host IP and host Port are unique to the cluster and enforced by Kubernetes. Currently, the destination services fails to find pods in this scenario because we only keep an index with pod and their pod IPs, not pods and their host IPs. To fix this, we now also keep an index of pods and their host IPs—if and only if they have the host IP set. Now when doing a pod lookup, we consider both the IP and the port. We perform the following steps: 1. Do a lookup by IP in the pod podIP index - If only one pod is found then return it 2. 0 or more than 1 pods have the same pod IP 3. Do a lookup by IP in the pod hostIP index - If any number of pods were found, we know that IP maps to a node IP. Therefore, we search for a pod with a matching host Port. If one exists then return it; if not then there is no pod that matches `hostIP:port` 4. The IP does not map to a host IP 5. If multiple pods were found in `1`, then we know there are pods with conflicting podIPs and an error is returned 6. If no pounds were found in `1` then there is no pod that matches `IP:port` --- Aside from the additional IP watcher test being added, this can be tested with the following steps: 1. Create a kind cluster. kind is required because it's pods in `kube-system` have the same pod IPs; this not the case with k3d: `bin/kind create cluster` 2. Install Linkerd with `4445` marked as opaque: `linkerd install --set proxy.opaquePorts="4445" \|kubectl apply -f -` 2. Get the node IP: `kubectl get -o wide nodes` 3. Pull my fork of `tcp-echo`: ``` $ git clone https://github.com/kleimkuhler/tcp-echo ... $ git checkout --track kleimkuhler/host-pod-repro ``` 5. `helm package .` 7. Install `tcp-echo` with the server not injected and correct host IP: `helm install tcp-echo tcp-echo-0.1.0.tgz --set server.linkerdInject="false" --set hostIP="..."` 8. Looking at the client's proxy logs, you should not observe any errors or protocol detection timeouts. 9. Looking at the server logs, you should see all the requests coming through correctly. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-03-18 13:29:43 -04:00
Riccardo Freixo	66b89f55e7	Fix named port resolution mid roll out (#5912 ) (#5911 ) # Problem While rolling out often not all pods will be ready in all the same set of ports, leading the Kubernetes Endpoints API to return multiple subsets, each covering a different set of ports, with the end result that the same address gets repeated across subsets. The old code for endpointsToAddresses would loop through all subsets, and the later occurrences of an address would overwrite previous ones, with the last one prevailing. If the last subset happened to be for an irrelevant port, and the port to be resolved is named, resolveTargetPort would resolve to port 0, which would return port 0 to clients, ultimately leading linkerd-proxy to forward connections to port 0. This only happens if the pods selected by a service expose > 1 port, the service maps to > 1 of these ports, and at least one of these ports is named. # Solution Never write an address to set of addresses if resolved port is 0, which indicates named port resolution failed. # Validation Added a test case. Signed-off-by: Riccardo Freixo <riccardofreixo@gmail.com>	2021-03-17 17:40:11 -04:00
Kevin Leimkuhler	1544d90150	dest: Reduce possible response size in destination service errors (#5893 ) This reduces the possible HTTP response size from the destination service when it encounters an error during a profile lookup. If multiple objects on a cluster share the same IP (such as pods in `kube-system`), the destination service will return an error with the two conflicting pod yamls. In certain cases, these pod yamls can be too large for the HTTP response and the destination pod's proxy will indicate that with the following error: ``` hyper::proto::h2::server: send response error: user error: header too big ``` From the app pod's proxy, this results in the following error: ``` poll_profile: linkerd_service_profiles::client: Could not fetch profile error=status: Unknown, message: "http2 error: protocol error: unexpected internal error encountered" ``` We now only return the conflicting pods (or services) names. This reduces the size of the returned error and fixes these warnings from occurring. Example response error: ``` poll_profile: linkerd_service_profiles::client: Could not fetch profile error=status: FailedPrecondition, message: "Pod IP address conflict: kube-system/kindnet-wsflq, kube-system/kube-scheduler-kind-control-plane", details: [], metadata: MetadataMap { headers: {"content-type": "application/grpc", "date": "Fri, 12 Mar 2021 19:54:09 GMT"} } ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-03-16 13:28:05 -04:00
Tarun Pothulapati	5c1a375a51	destination: pass opaque-ports through cmd flag (#5829 ) * destination: pass opaque-ports through cmd flag Fixes #5817 Currently, Default opaque ports are stored at two places i.e `Values.yaml` and also at `opaqueports/defaults.go`. As these ports are used only in destination, We can instead pass these values as a cmd flag for destination component from Values.yaml and remove defaultPorts in `defaults.go`. This means that users if they override `Values.yaml`'s opauePorts field, That change is propogated both for injection and also discovery like expected. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-03-01 16:00:20 +05:30
Kevin Leimkuhler	51a965e228	Return default opaque ports in the destination service (#5814 ) This changes the destination service to always use a default set of opaque ports for pods and services. This is so that after Linkerd is installed onto a cluster, users can benefit from common opaque ports without having to annotate the workloads that serve the applications. After #5810 merges, the proxy containers will be have the default opaque ports `25,443,587,3306,5432,11211`. This value on the proxy container does not affect traffic though; it only configures the proxy. In order for clients and servers to detect opaque protocols and determine opaque transports, the pods and services need to have these annotations. The ports `25,443,587,3306,5432,11211` are now handled opaquely when a pod or service does not have the opaque ports annotation. If the annotation is present with a different value, this is used instead of the default. If the annotation is present but is an empty string, there are no opaque ports for the workload. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-24 14:55:31 -05:00
Kevin Leimkuhler	5bd5db6524	Revert "Rename multicluster annotation prefix and move when possible (#5771 )" (#5813 ) This reverts commit `f9ab867cbc` which renamed the multicluster label name from `mirror.linkerd.io` to `multicluster.linkerd.io`. While this change was made to follow similar namings in other extensions, it complicates the multicluster upgrade process due to the secret creation. `mirror.linkerd.io` is not that important of a label to change and this will allow a smoother upgrade process for `stable-2.10.x` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-24 12:54:52 -05:00
Kevin Leimkuhler	ff93d2d317	Mirror opaque port annotations on services (#5770 ) This change introduces an opaque ports annotation watcher that will send destination profile updates when a service has its opaque ports annotation change. The user facing change introduced by this is that the opaque ports annotation is now required on services when using the multicluster extension. This is because the service mirror will create mirrored services in the source cluster, and destination lookups in the source cluster need to discover that the workloads in the target cluster are opaque protocols. ### Why Closes #5650 ### How The destination server now has a new opaque ports annotation watcher. When a client subscribes to updates for a service name or cluster IP, the `GetProfile` method creates a profile translator stack that passes updates through resource adaptors such as: traffic split adaptor, service profile adaptor, and now opaque ports adaptor. When the annotation on a service changes, the update is passed through to the client where the `opaque_protocol` field will either be set to true or false. A few scenarios to consider are: - If the annotation is removed from the service, the client should receive an update with no opaque ports set. - If the service is deleted, the stream stays open so the client should receive an update with no opaque ports set. - If the service has the annotation added, the client should receive that update. ### Testing Unit test have been added to the watcher as well as the destination server. An integration test has been added that tests the opaque port annotation on a service. For manual testing, using the destination server scripts is easiest: ``` # install Linkerd # start the destination server $ go run controller/cmd/main.go destination -kubeconfig ~/.kube/config # Create a service or namespace with the annotation and inject it # get the destination profile for that service and observe the opaque protocol field $ go run controller/script/destination-client/main.go -method getProfile -path test-svc.default.svc.cluster.local:8080 INFO[0000] fully_qualified_name:"terminus-svc.default.svc.cluster.local" opaque_protocol:true retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"terminus-svc.default.svc.cluster.local.:8080" weight:10000} INFO[0000] INFO[0000] fully_qualified_name:"terminus-svc.default.svc.cluster.local" opaque_protocol:true retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"terminus-svc.default.svc.cluster.local.:8080" weight:10000} INFO[0000] ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-23 13:36:17 -05:00
Kevin Leimkuhler	f9ab867cbc	Rename multicluster annotation prefix and move when possible (#5771 ) This renames the multicluster annotation prefix from `mirror.linkerd.io` to `multicluster.linkerd.io` in order to reflect other extension naming patterns. Additionally, it moves labels only used in the Multicluster extension into their own labels file—again to reflect other extensions. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-18 17:10:33 -05:00
Oliver Gould	6dc7efd704	docker: Access container images via cr.l5d.io (#5756 ) We've created a custom domain, `cr.l5d.io`, that redirects to `ghcr.io` (using `scarf.sh`). This custom domain allows us to swap the underlying container registry without impacting users. It also provides us with important metrics about container usage, without collecting PII like IP addresses. This change updates our Helm charts and CLIs to reference this custom domain. The integration test workflow now refers to the new domain, while the release workflow continues to use the `ghcr.io/linkerd` registry for the purpose of publishing images.	2021-02-17 14:31:54 -08:00
Kevin Leimkuhler	5dc662ae97	Remove namespace inheritance of opaque ports annotation (#5739 ) This change removes the namespace inheritance of the opaque ports annotation. Now when setting opaque port related fields in destination profile responses, we only look at the pod annotations. This prepares for #5736 where the proxy-injector will add the annotation from the namespace if the pod does not have it already. Closes #5735 Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-15 10:21:20 -05:00
Filip Petkovski	73f9fb3518	Use a shared informer when getting node topology (#5722 ) Getting information about node topology queries the k8s api directly. In an environment with high traffic and high number of pods, the k8s api server can become overwhelmed or start throttling requests. This MR introduces a node informer to resolve the bottleneck and fetch node information asynchronously. Fixes #5684 Signed-off-by: fpetkovski <filip.petkovsky@gmail.com>	2021-02-12 11:05:38 -05:00
Kevin Leimkuhler	75fcc9d623	Move tap from core into Viz extension (#5651 ) Closes #5545. This change moves all tap and tap-injector code into the viz directory. The tap and tap-injector components now also use a new tap image—separating these components from the controller image that they are currently part of. This means the controller image has removed all its build dependencies related to tap. Finally, the tap Protobuf has been separated from the metrics-api and moved into it's own `.proto` file and gen directory. This introduces a clear split between metrics-api and tap Protobuf. There is no change in behavior for the `viz tap` command. ### Reviewing #### Docker images All the bin directory scripts should be updated to build and load the tap image. All the CI workflows should be updated to build and push the tap image. #### Controller and pkg directories This is primarily deletions. Most of the deleted code in this directory is now in the tap directory of the Viz extension. #### viz/tap This is the location that all the tap related code now lives in. New files are mostly moved from the controller and pkg directories. Imports have all been updated to point at the right locations and Protobuf. The Protobuf here is taken from metrics-api and contains all tap-related Protobuf. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-09 12:43:21 -05:00
Alejandro Pedraza	8ac5360041	Extract from public-api all the Prometheus dependencies, and moves things into a new viz component 'linkerd-metrics-api' (#5560 ) * Protobuf changes: - Moved `healthcheck.proto` back from viz to `proto/common` as it remains being used by the main `healthcheck.go` library (it was moved to viz by #5510). - Extracted from `viz.proto` the IP-related types and put them in `/controller/gen/common/net` to be used by both the public and the viz APIs. * Added chart templates for new viz linkerd-metrics-api pod * Spin-off viz healthcheck: - Created `viz/pkg/healthcheck/healthcheck.go` that wraps the original `pkg/healthcheck/healthcheck.go` while adding the `vizNamespace` and `vizAPIClient` fields which were removed from the core `healthcheck`. That way the core healthcheck doesn't have any dependencies on viz, and viz' healthcheck can now be used to retrieve viz api clients. - The core and viz healthcheck libs are now abstracted out via the new `healthcheck.Runner` interface. - Refactored the data plane checks so they don't rely on calling `ListPods` - The checks in `viz/cmd/check.go` have been moved to `viz/pkg/healthcheck/healthcheck.go` as well, so `check.go`'s sole responsibility is dealing with command business. This command also now retrieves its viz api client through viz' healthcheck. * Removed linkerd-controller dependency on Prometheus: - Removed the `global.prometheusUrl` config in the core values.yml. - Leave the Heartbeat's `-prometheus` flag hard-coded temporarily. TO-DO: have it automatically discover viz and pull Prometheus' endpoint (#5352). * Moved observability gRPC from linkerd-controller to viz: - Created a new gRPC server under `viz/metrics-api` moving prometheus-dependent functions out of the core gRPC server and into it (same thing for the accompaigning http server). - Did the same for the `PublicAPIClient` (now called just `Client`) interface. The `VizAPIClient` interface disappears as it's enough to just rely on the viz `ApiClient` protobuf type. - Moved the other files implementing the rest of the gRPC functions from `controller/api/public` to `viz/metrics-api` (`edge.go`, `stat_summary.go`, etc.). - Also simplified some type names to avoid stuttering. * Added linkerd-metrics-api bootstrap files. At the same time, we strip out of the public-api's `main.go` file the prometheus parameters and other no longer relevant bits. * linkerd-web updates: it requires connecting with both the public-api and the viz api, so both addresses (and the viz namespace) are now provided as parameters to the container. * CLI updates and other minor things: - Changes to command files under `cli/cmd`: - Updated `endpoints.go` according to new API interface name. - Updated `version.go`, `dashboard` and `uninstall.go` to pull the viz namespace dynamically. - Changes to command files under `viz/cmd`: - `edges.go`, `routes.go`, `stat.go` and `top.go`: point to dependencies that were moved from public-api to viz. - Other changes to have tests pass: - Added `metrics-api` to list of docker images to build in actions workflows. - In `bin/fmt` exclude protobuf generated files instead of entire directories because directories could contain both generated and non-generated code (case in point: `viz/metrics-api`). * Add retry to 'tap API service is running' check * mc check shouldn't err when viz is not available. Also properly set the log in multicluster/cmd/root.go so that it properly displays messages when --verbose is used	2021-01-21 18:26:38 -05:00
Oleh Ozimok	c416e78261	destination: Fix crash when EndpointSlices are enabled (#5543 ) The Destination controller can panic due to a nil-deref when the EndpointSlices API is enabled. This change updates the controller to properly initialize values to avoid this segmentation fault. Fixes #5521 Signed-off-by: Oleg Ozimok <oleg.ozimok@corp.kismia.com>	2021-01-15 12:52:11 -08:00
Alejandro Pedraza	f3b1ebfa99	Separate observability API (#5510 ) * Separate observability API Closes #5312 This is a preliminary step towards moving all the observability API into `/viz`, by first moving its protobuf into `viz/metrics-api`. This should facilitate review as the go files are not moved yet, which will happen in a followup PR. There are no user-facing changes here. - Moved `proto/common/healthcheck.proto` to `viz/metrics-api/proto/healthcheck.prot` - Moved the contents of `proto/public.proto` to `viz/metrics-api/proto/viz.proto` except for the `Version` Stuff. - Merged `proto/controller/tap.proto` into `viz/metrics-api/proto/viz.proto` - `grpc_server.go` now temporarily exposes `PublicAPIServer` and `VizAPIServer` interfaces to separate both APIs. This will get properly split in a followup. - The web server provides handlers for both interfaces. - `cli/cmd/public_api.go` and `pkg/healthcheck/healthcheck.go` temporarily now have methods to access both APIs. - Most of the CLI commands will use the Viz API, except for `version`. The other changes in the go files are just changes in the imports to point to the new protobufs. Other minor changes: - Removed `git add controller/gen` from `bin/protoc-go.sh`	2021-01-13 14:34:54 -05:00
Filip Petkovski	40192e258a	Ignore pods with status.phase=Succeeded when watching IP addresses (#5412 ) Ignore pods with status.phase=Succeeded when watching IP addresses When a pod terminates successfully, some CNIs will assign its IP address to newly created pods. This can lead to duplicate pod IPs in the same Kubernetes cluster. Filter out pods which are in a Succeeded phase since they are not routable anymore. Fixes #5394 Signed-off-by: fpetkovski <filip.petkovsky@gmail.com>	2021-01-12 12:25:37 -05:00
Tarun Pothulapati	e04647fb8d	remove prom check for public-api self-check (#5436 ) Currently, public-api is part of the core control-plane where the prom check fails when ran before the viz extension is installed. This change comments out that check, Once metrics api is moved into viz, maybe this check can be part of it instead or directly part of `linkerd viz check`. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> Co-authored-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-01-05 17:22:39 -05:00
Alejandro Pedraza	d3d7f4e2e2	Destination should return `OpaqueTransport` hint when annotation matches resolved target port (#5458 ) The destination service now returns `OpaqueTransport` hint when the annotation matches the resolve target port. This is different from the current behavior which always sets the hint when a proxy is present. Closes #5421 This happens by changing the endpoint watcher to set a pod's opaque port annotation in certain cases. If the pod already has an annotation, then its value is used. If the pod has no annotation, then it checks the namespace that the endpoint belongs to; if it finds an annotation on the namespace then it overrides the pod's annotation value with that. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-01-05 14:54:55 -05:00
Kevin Leimkuhler	b830efdad7	Add OpaqueTransport field to destination protocol hints (#5421 ) ## What When the destination service returns a destination profile for an endpoint, indicate if the endpoint can receive opaque traffic. ## Why Closes #5400 ## How When translating a pod address to a destination profile, the destination service checks if the pod is controlled by any linkerd control plane. If it is, it can set a protocol hint where we indicate that it supports H2 and opaque traffic. If the pod supports opaque traffic, we need to get the port that it expects inbound traffic on. We do this by getting the proxy container and reading it's `LINKERD2_PROXY_INBOUND_LISTEN_ADDR` environment variable. If we successfully parse that into a port, we can set the opaque transport field in the destination profile. ## Testing A test has been added to the destination server where a pod has a `linkerd-proxy` container. We can expect the `OpaqueTransport` field to be set in the returned destination profile's protocol hint. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-12-23 11:06:39 -05:00
Kevin Leimkuhler	7c0843a823	Add opaque ports to destination service updates (#5294 ) ## Summary This changes the destination service to start indicating whether a profile is an opaque protocol or not. Currently, profiles returned by the destination service are built by chaining together updates coming from watching Profile and Traffic Split updates. With this change, we now also watch updates to Opaque Port annotations on pods and namespaces; if an update occurs this is now included in building a profile update and is sent to the client. ## Details Watching updates to Profiles and Traffic Splits is straightforward--we watch those resources and if an update occurs on one associated to a service we care about then the update is passed through. For Opaque Ports this is a little different because it is an annotation on pods or namespaces. To account for this, we watch the endpoints that we should care about. ### When host is a Pod IP When getting the profile for a Pod IP, we check for the opaque ports annotation on the pod and the pod's namespace. If one is found, we'll indicate if the profile is an opaque protocol if the requested port is in the annotation. We do not subscribe for updates to this pod IP. The only update we really care about is if the pod is deleted and this is already handled by the proxy. ### When host is a Service When getting the profile for a Service, we subscribe for updates to the endpoints of that service. For any ports set in the opaque ports annotation on any of the pods, we check if the requested port is present. Since the endpoints for a service can be added and removed, we do subscribe for updates to the endpoints of the service. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-12-18 12:38:59 -05:00
Kevin Leimkuhler	ca86a31816	Add destination service tests for the IP path (#5266 ) This adds additional tests for the destination service that assert `GetProfile` behavior when the path is an IP address. 1. Assert that when the path is a cluster IP, the configured service profile is returned. 2. Assert that when the path a pod IP, the endpoint field is populated in the service profile returned. 3. Assert that when the path is not a cluster or pod IP, the default service profile is returned. 4. Assert that when path is a pod IP with or without the controller annotation, the endpoint has or does not have a protocol hint Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-11-23 13:17:05 -05:00
Kevin Leimkuhler	92f9387997	Check correct label value when setting protocl hint (#5267 ) This fixes an issue where the protocol hint is always set on endpoint responses. We now check the right value which determines if the pod has the required label. A test for this has been added to #5266. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-11-20 13:32:50 -08:00
Oliver Gould	375ffd782f	proxy: v2.121.0 (#5253 ) This release changes error handling to teardown the server-side connection when an unexpected error is encountered. Additionally, the outbound TCP routing stack can now skip redundant service discovery lookups when profile responses include endpoint information. Finally, the cache implementation has been updated to reduce latency by removing unnecessary buffers. --- * h2: enable HTTP/2 keepalive PING frames (linkerd/linkerd2-proxy#737) * actions: Add timeouts to GitHub actions (linkerd/linkerd2-proxy#738) * outbound: Skip endpoint resolution on profile hint (linkerd/linkerd2-proxy#736) * Add a FromStr for dns::Name (linkerd/linkerd2-proxy#746) * outbound: Avoid redundant TCP endpoint resolution (linkerd/linkerd2-proxy#742) * cache: Make the cache cloneable with RwLock (linkerd/linkerd2-proxy#743) * http: Teardown serverside connections on error (linkerd/linkerd2-proxy#747)	2020-11-18 16:55:53 -08:00
Kevin Leimkuhler	e65f216d52	Add endpoint to GetProfile response (#5227 ) Context: #5209 This updates the destination service to set the `Endpoint` field in `GetProfile` responses. The `Endpoint` field is only set if the IP maps to a Pod--not a Service. Additionally in this scenario, the default Service Profile is used as the base profile so no other significant fields are set. ### Examples ``` # GetProfile for an IP that maps to a Service ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.43.222.0:9090 INFO[0000] fully_qualified_name:"linkerd-prometheus.linkerd.svc.cluster.local" retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"linkerd-prometheus.linkerd.svc.cluster.local.:9090" weight:10000} ``` Before: ``` # GetProfile for an IP that maps to a Pod ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.42.0.20 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} ``` After: ``` # GetProfile for an IP that maps to a Pod ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.42.0.20 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} endpoint:{addr:{ip:{ipv4:170524692}} weight:10000 metric_labels:{key:"control_plane_ns" value:"linkerd"} metric_labels:{key:"deployment" value:"fast-1"} metric_labels:{key:"pod" value:"fast-1-5cc87f64bc-9hx7h"} metric_labels:{key:"pod_template_hash" value:"5cc87f64bc"} metric_labels:{key:"serviceaccount" value:"default"} tls_identity:{dns_like_identity:{name:"default.default.serviceaccount.identity.linkerd.cluster.local"}} protocol_hint:{h2:{}}} ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-11-18 15:41:25 -05:00
Tarun Pothulapati	4c106e9c08	cli: make check return SkipError when there is no prometheus configured (#5150 ) Fixes #5143 The availability of prometheus is useful for some calls in public-api that the check uses. This change updates the ListPods in public-api to still return the pods even when prometheus is not configured. For a test that exclusively checks for prometheus metrics, we have a gate which checks if a prometheus is configured and skips it othervise. Signed-off-by: Tarun Pothulapati tarunpothulapati@outlook.com	2020-10-29 19:57:11 +05:30
Alex Leong	4f34ce8e2f	Empty the stored addresses when the endpoint translator gets a NoEndpoints message (#5126 ) Signed-off-by: Alex Leong <alex@buoyant.io>	2020-10-22 17:01:03 -07:00

1 2 3 4 5 ...

308 Commits