linkerd2

Commit Graph

Author	SHA1	Message	Date
Bruce Chen Wenliang	b84b2077d3	Ignore pods in "Terminating" when watching IP addresses. (#5940 ) Fixes #5939 Some CNIs reasssign the IP of a terminating pod to a new pod, which leads to duplicate IPs in the cluster. It eventually triggers #5939. This commit will make the IPWatcher, when given an IP, filter out the terminating pods (when a pod is given a deletionTimestamp). The issue is hard reproduce because we are not able to assign a particular IP to a pod manually. Signed-off-by: Bruce <wenliang.chen@personio.de> Co-authored-by: Bruce <wenliang.chen@personio.de>	2021-03-24 18:21:42 +05:30
Kevin Leimkuhler	3f72c998b3	Handle pod lookups for pods that map to a host IP and host port (#5904 ) This fixes an issue where pod lookups by host IP and host port fail even though the cluster has a matching pod. Usually these manifested as `FailedPrecondition` errors, but the messages were too long and resulted in http/2 errors. This change depends on #5893 which fixes that separate issue. This changes how often those `FailedPrecondition` errors actually occur. The destination service now considers pod host IPs and should reduce the frequency of those errors. Closes #5881 --- Lookups like this happen when a pod is created with a host IP and host port set in its spec. It still has a pod IP when running, but requests to `hostIP:hostPort` will also be redirected to the pod. Combinations of host IP and host Port are unique to the cluster and enforced by Kubernetes. Currently, the destination services fails to find pods in this scenario because we only keep an index with pod and their pod IPs, not pods and their host IPs. To fix this, we now also keep an index of pods and their host IPs—if and only if they have the host IP set. Now when doing a pod lookup, we consider both the IP and the port. We perform the following steps: 1. Do a lookup by IP in the pod podIP index - If only one pod is found then return it 2. 0 or more than 1 pods have the same pod IP 3. Do a lookup by IP in the pod hostIP index - If any number of pods were found, we know that IP maps to a node IP. Therefore, we search for a pod with a matching host Port. If one exists then return it; if not then there is no pod that matches `hostIP:port` 4. The IP does not map to a host IP 5. If multiple pods were found in `1`, then we know there are pods with conflicting podIPs and an error is returned 6. If no pounds were found in `1` then there is no pod that matches `IP:port` --- Aside from the additional IP watcher test being added, this can be tested with the following steps: 1. Create a kind cluster. kind is required because it's pods in `kube-system` have the same pod IPs; this not the case with k3d: `bin/kind create cluster` 2. Install Linkerd with `4445` marked as opaque: `linkerd install --set proxy.opaquePorts="4445" \|kubectl apply -f -` 2. Get the node IP: `kubectl get -o wide nodes` 3. Pull my fork of `tcp-echo`: ``` $ git clone https://github.com/kleimkuhler/tcp-echo ... $ git checkout --track kleimkuhler/host-pod-repro ``` 5. `helm package .` 7. Install `tcp-echo` with the server not injected and correct host IP: `helm install tcp-echo tcp-echo-0.1.0.tgz --set server.linkerdInject="false" --set hostIP="..."` 8. Looking at the client's proxy logs, you should not observe any errors or protocol detection timeouts. 9. Looking at the server logs, you should see all the requests coming through correctly. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-03-18 13:29:43 -04:00
Riccardo Freixo	66b89f55e7	Fix named port resolution mid roll out (#5912 ) (#5911 ) # Problem While rolling out often not all pods will be ready in all the same set of ports, leading the Kubernetes Endpoints API to return multiple subsets, each covering a different set of ports, with the end result that the same address gets repeated across subsets. The old code for endpointsToAddresses would loop through all subsets, and the later occurrences of an address would overwrite previous ones, with the last one prevailing. If the last subset happened to be for an irrelevant port, and the port to be resolved is named, resolveTargetPort would resolve to port 0, which would return port 0 to clients, ultimately leading linkerd-proxy to forward connections to port 0. This only happens if the pods selected by a service expose > 1 port, the service maps to > 1 of these ports, and at least one of these ports is named. # Solution Never write an address to set of addresses if resolved port is 0, which indicates named port resolution failed. # Validation Added a test case. Signed-off-by: Riccardo Freixo <riccardofreixo@gmail.com>	2021-03-17 17:40:11 -04:00
Kevin Leimkuhler	1544d90150	dest: Reduce possible response size in destination service errors (#5893 ) This reduces the possible HTTP response size from the destination service when it encounters an error during a profile lookup. If multiple objects on a cluster share the same IP (such as pods in `kube-system`), the destination service will return an error with the two conflicting pod yamls. In certain cases, these pod yamls can be too large for the HTTP response and the destination pod's proxy will indicate that with the following error: ``` hyper::proto::h2::server: send response error: user error: header too big ``` From the app pod's proxy, this results in the following error: ``` poll_profile: linkerd_service_profiles::client: Could not fetch profile error=status: Unknown, message: "http2 error: protocol error: unexpected internal error encountered" ``` We now only return the conflicting pods (or services) names. This reduces the size of the returned error and fixes these warnings from occurring. Example response error: ``` poll_profile: linkerd_service_profiles::client: Could not fetch profile error=status: FailedPrecondition, message: "Pod IP address conflict: kube-system/kindnet-wsflq, kube-system/kube-scheduler-kind-control-plane", details: [], metadata: MetadataMap { headers: {"content-type": "application/grpc", "date": "Fri, 12 Mar 2021 19:54:09 GMT"} } ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-03-16 13:28:05 -04:00
Tarun Pothulapati	5c1a375a51	destination: pass opaque-ports through cmd flag (#5829 ) * destination: pass opaque-ports through cmd flag Fixes #5817 Currently, Default opaque ports are stored at two places i.e `Values.yaml` and also at `opaqueports/defaults.go`. As these ports are used only in destination, We can instead pass these values as a cmd flag for destination component from Values.yaml and remove defaultPorts in `defaults.go`. This means that users if they override `Values.yaml`'s opauePorts field, That change is propogated both for injection and also discovery like expected. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-03-01 16:00:20 +05:30
Kevin Leimkuhler	51a965e228	Return default opaque ports in the destination service (#5814 ) This changes the destination service to always use a default set of opaque ports for pods and services. This is so that after Linkerd is installed onto a cluster, users can benefit from common opaque ports without having to annotate the workloads that serve the applications. After #5810 merges, the proxy containers will be have the default opaque ports `25,443,587,3306,5432,11211`. This value on the proxy container does not affect traffic though; it only configures the proxy. In order for clients and servers to detect opaque protocols and determine opaque transports, the pods and services need to have these annotations. The ports `25,443,587,3306,5432,11211` are now handled opaquely when a pod or service does not have the opaque ports annotation. If the annotation is present with a different value, this is used instead of the default. If the annotation is present but is an empty string, there are no opaque ports for the workload. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-24 14:55:31 -05:00
Kevin Leimkuhler	5bd5db6524	Revert "Rename multicluster annotation prefix and move when possible (#5771 )" (#5813 ) This reverts commit `f9ab867cbc` which renamed the multicluster label name from `mirror.linkerd.io` to `multicluster.linkerd.io`. While this change was made to follow similar namings in other extensions, it complicates the multicluster upgrade process due to the secret creation. `mirror.linkerd.io` is not that important of a label to change and this will allow a smoother upgrade process for `stable-2.10.x` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-24 12:54:52 -05:00
Kevin Leimkuhler	ff93d2d317	Mirror opaque port annotations on services (#5770 ) This change introduces an opaque ports annotation watcher that will send destination profile updates when a service has its opaque ports annotation change. The user facing change introduced by this is that the opaque ports annotation is now required on services when using the multicluster extension. This is because the service mirror will create mirrored services in the source cluster, and destination lookups in the source cluster need to discover that the workloads in the target cluster are opaque protocols. ### Why Closes #5650 ### How The destination server now has a new opaque ports annotation watcher. When a client subscribes to updates for a service name or cluster IP, the `GetProfile` method creates a profile translator stack that passes updates through resource adaptors such as: traffic split adaptor, service profile adaptor, and now opaque ports adaptor. When the annotation on a service changes, the update is passed through to the client where the `opaque_protocol` field will either be set to true or false. A few scenarios to consider are: - If the annotation is removed from the service, the client should receive an update with no opaque ports set. - If the service is deleted, the stream stays open so the client should receive an update with no opaque ports set. - If the service has the annotation added, the client should receive that update. ### Testing Unit test have been added to the watcher as well as the destination server. An integration test has been added that tests the opaque port annotation on a service. For manual testing, using the destination server scripts is easiest: ``` # install Linkerd # start the destination server $ go run controller/cmd/main.go destination -kubeconfig ~/.kube/config # Create a service or namespace with the annotation and inject it # get the destination profile for that service and observe the opaque protocol field $ go run controller/script/destination-client/main.go -method getProfile -path test-svc.default.svc.cluster.local:8080 INFO[0000] fully_qualified_name:"terminus-svc.default.svc.cluster.local" opaque_protocol:true retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"terminus-svc.default.svc.cluster.local.:8080" weight:10000} INFO[0000] INFO[0000] fully_qualified_name:"terminus-svc.default.svc.cluster.local" opaque_protocol:true retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"terminus-svc.default.svc.cluster.local.:8080" weight:10000} INFO[0000] ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-23 13:36:17 -05:00
Kevin Leimkuhler	f9ab867cbc	Rename multicluster annotation prefix and move when possible (#5771 ) This renames the multicluster annotation prefix from `mirror.linkerd.io` to `multicluster.linkerd.io` in order to reflect other extension naming patterns. Additionally, it moves labels only used in the Multicluster extension into their own labels file—again to reflect other extensions. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-18 17:10:33 -05:00
Kevin Leimkuhler	5dc662ae97	Remove namespace inheritance of opaque ports annotation (#5739 ) This change removes the namespace inheritance of the opaque ports annotation. Now when setting opaque port related fields in destination profile responses, we only look at the pod annotations. This prepares for #5736 where the proxy-injector will add the annotation from the namespace if the pod does not have it already. Closes #5735 Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-15 10:21:20 -05:00
Filip Petkovski	73f9fb3518	Use a shared informer when getting node topology (#5722 ) Getting information about node topology queries the k8s api directly. In an environment with high traffic and high number of pods, the k8s api server can become overwhelmed or start throttling requests. This MR introduces a node informer to resolve the bottleneck and fetch node information asynchronously. Fixes #5684 Signed-off-by: fpetkovski <filip.petkovsky@gmail.com>	2021-02-12 11:05:38 -05:00
Oleh Ozimok	c416e78261	destination: Fix crash when EndpointSlices are enabled (#5543 ) The Destination controller can panic due to a nil-deref when the EndpointSlices API is enabled. This change updates the controller to properly initialize values to avoid this segmentation fault. Fixes #5521 Signed-off-by: Oleg Ozimok <oleg.ozimok@corp.kismia.com>	2021-01-15 12:52:11 -08:00
Filip Petkovski	40192e258a	Ignore pods with status.phase=Succeeded when watching IP addresses (#5412 ) Ignore pods with status.phase=Succeeded when watching IP addresses When a pod terminates successfully, some CNIs will assign its IP address to newly created pods. This can lead to duplicate pod IPs in the same Kubernetes cluster. Filter out pods which are in a Succeeded phase since they are not routable anymore. Fixes #5394 Signed-off-by: fpetkovski <filip.petkovsky@gmail.com>	2021-01-12 12:25:37 -05:00
Alejandro Pedraza	d3d7f4e2e2	Destination should return `OpaqueTransport` hint when annotation matches resolved target port (#5458 ) The destination service now returns `OpaqueTransport` hint when the annotation matches the resolve target port. This is different from the current behavior which always sets the hint when a proxy is present. Closes #5421 This happens by changing the endpoint watcher to set a pod's opaque port annotation in certain cases. If the pod already has an annotation, then its value is used. If the pod has no annotation, then it checks the namespace that the endpoint belongs to; if it finds an annotation on the namespace then it overrides the pod's annotation value with that. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-01-05 14:54:55 -05:00
Kevin Leimkuhler	b830efdad7	Add OpaqueTransport field to destination protocol hints (#5421 ) ## What When the destination service returns a destination profile for an endpoint, indicate if the endpoint can receive opaque traffic. ## Why Closes #5400 ## How When translating a pod address to a destination profile, the destination service checks if the pod is controlled by any linkerd control plane. If it is, it can set a protocol hint where we indicate that it supports H2 and opaque traffic. If the pod supports opaque traffic, we need to get the port that it expects inbound traffic on. We do this by getting the proxy container and reading it's `LINKERD2_PROXY_INBOUND_LISTEN_ADDR` environment variable. If we successfully parse that into a port, we can set the opaque transport field in the destination profile. ## Testing A test has been added to the destination server where a pod has a `linkerd-proxy` container. We can expect the `OpaqueTransport` field to be set in the returned destination profile's protocol hint. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-12-23 11:06:39 -05:00
Kevin Leimkuhler	7c0843a823	Add opaque ports to destination service updates (#5294 ) ## Summary This changes the destination service to start indicating whether a profile is an opaque protocol or not. Currently, profiles returned by the destination service are built by chaining together updates coming from watching Profile and Traffic Split updates. With this change, we now also watch updates to Opaque Port annotations on pods and namespaces; if an update occurs this is now included in building a profile update and is sent to the client. ## Details Watching updates to Profiles and Traffic Splits is straightforward--we watch those resources and if an update occurs on one associated to a service we care about then the update is passed through. For Opaque Ports this is a little different because it is an annotation on pods or namespaces. To account for this, we watch the endpoints that we should care about. ### When host is a Pod IP When getting the profile for a Pod IP, we check for the opaque ports annotation on the pod and the pod's namespace. If one is found, we'll indicate if the profile is an opaque protocol if the requested port is in the annotation. We do not subscribe for updates to this pod IP. The only update we really care about is if the pod is deleted and this is already handled by the proxy. ### When host is a Service When getting the profile for a Service, we subscribe for updates to the endpoints of that service. For any ports set in the opaque ports annotation on any of the pods, we check if the requested port is present. Since the endpoints for a service can be added and removed, we do subscribe for updates to the endpoints of the service. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-12-18 12:38:59 -05:00
Kevin Leimkuhler	ca86a31816	Add destination service tests for the IP path (#5266 ) This adds additional tests for the destination service that assert `GetProfile` behavior when the path is an IP address. 1. Assert that when the path is a cluster IP, the configured service profile is returned. 2. Assert that when the path a pod IP, the endpoint field is populated in the service profile returned. 3. Assert that when the path is not a cluster or pod IP, the default service profile is returned. 4. Assert that when path is a pod IP with or without the controller annotation, the endpoint has or does not have a protocol hint Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-11-23 13:17:05 -05:00
Kevin Leimkuhler	92f9387997	Check correct label value when setting protocl hint (#5267 ) This fixes an issue where the protocol hint is always set on endpoint responses. We now check the right value which determines if the pod has the required label. A test for this has been added to #5266. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-11-20 13:32:50 -08:00
Oliver Gould	375ffd782f	proxy: v2.121.0 (#5253 ) This release changes error handling to teardown the server-side connection when an unexpected error is encountered. Additionally, the outbound TCP routing stack can now skip redundant service discovery lookups when profile responses include endpoint information. Finally, the cache implementation has been updated to reduce latency by removing unnecessary buffers. --- * h2: enable HTTP/2 keepalive PING frames (linkerd/linkerd2-proxy#737) * actions: Add timeouts to GitHub actions (linkerd/linkerd2-proxy#738) * outbound: Skip endpoint resolution on profile hint (linkerd/linkerd2-proxy#736) * Add a FromStr for dns::Name (linkerd/linkerd2-proxy#746) * outbound: Avoid redundant TCP endpoint resolution (linkerd/linkerd2-proxy#742) * cache: Make the cache cloneable with RwLock (linkerd/linkerd2-proxy#743) * http: Teardown serverside connections on error (linkerd/linkerd2-proxy#747)	2020-11-18 16:55:53 -08:00
Kevin Leimkuhler	e65f216d52	Add endpoint to GetProfile response (#5227 ) Context: #5209 This updates the destination service to set the `Endpoint` field in `GetProfile` responses. The `Endpoint` field is only set if the IP maps to a Pod--not a Service. Additionally in this scenario, the default Service Profile is used as the base profile so no other significant fields are set. ### Examples ``` # GetProfile for an IP that maps to a Service ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.43.222.0:9090 INFO[0000] fully_qualified_name:"linkerd-prometheus.linkerd.svc.cluster.local" retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"linkerd-prometheus.linkerd.svc.cluster.local.:9090" weight:10000} ``` Before: ``` # GetProfile for an IP that maps to a Pod ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.42.0.20 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} ``` After: ``` # GetProfile for an IP that maps to a Pod ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.42.0.20 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} endpoint:{addr:{ip:{ipv4:170524692}} weight:10000 metric_labels:{key:"control_plane_ns" value:"linkerd"} metric_labels:{key:"deployment" value:"fast-1"} metric_labels:{key:"pod" value:"fast-1-5cc87f64bc-9hx7h"} metric_labels:{key:"pod_template_hash" value:"5cc87f64bc"} metric_labels:{key:"serviceaccount" value:"default"} tls_identity:{dns_like_identity:{name:"default.default.serviceaccount.identity.linkerd.cluster.local"}} protocol_hint:{h2:{}}} ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-11-18 15:41:25 -05:00
Alex Leong	4f34ce8e2f	Empty the stored addresses when the endpoint translator gets a NoEndpoints message (#5126 ) Signed-off-by: Alex Leong <alex@buoyant.io>	2020-10-22 17:01:03 -07:00
Kevin Leimkuhler	6b7a39c9fa	Set FQN in profile resolutions (#5019 ) ## Motivation Closes #5016 Depends on linkerd/linkerd2-proxy-api#44 ## Solution A `profileTranslator` exists for each service and now has a new `fullyQualifiedName` field. This field is used to set the `FullyQualifiedName` field of `DestinationProfile`s each time an update is sent. In the case that no service profile exists for a service, a default `DestinationProfile` is created and we can use the field to set the correct name. In the case that a service profile does exist for a service, we still use this field to set the name to keep it consistent. ### Example Install linkerd on a cluster and run the destination server: ``` go run controller/cmd/main.go destination -kubeconfig ~/.kube/config ``` Get the IP of a service. Here, we'll get the ip for `linkerd-identity`: ``` > kubectl get -n linkerd svc/linkerd-identity NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE linkerd-identity ClusterIP 10.43.161.68 <none> 8080/TCP 4h25m ``` Get the profile of `linkerd-identity` from service name or IP and note the `FullyQualifiedName` field: ``` > go run controller/script/destination-client/main.go -method getProfile -path 10.43.161.68:8080 INFO[0000] fully_qualified_name:"linkerd-identity.linkerd.svc.cluster.local" .. ``` ``` > go run controller/script/destination-client/main.go -method getProfile -path linkerd-identity.linkerd.svc.cluster.local INFO[0000] fully_qualified_name:"linkerd-identity.linkerd.svc.cluster.local" .. ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-10-01 11:06:00 -04:00
Tarun Pothulapati	d0caaa86c4	Bump k8s client-go to v0.19.2 (#5002 ) Fixes #4191 #4993 This bumps Kubernetes client-go to the latest v0.19.2 (We had to switch directly to 1.19 because of this issue). Bumping to v0.19.2 required upgrading to smi-sdk-go v0.4.1. This also depends on linkerd/stern#5 This consists of the following changes: - Fix ./bin/update-codegen.sh by adding the template path to the gen commands, as it is needed after we moved to GOMOD. - Bump all k8s related dependencies to v0.19.2 - Generate CRD types, client code using the latest k8s.io/code-generator - Use context.Context as the first argument, in all code paths that touch the k8s client-go interface Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-09-28 12:45:18 -05:00
Oliver Gould	6d67b84447	profiles: Eliminate default timeout (#4958 ) * profiles: Eliminate default timeout	2020-09-10 14:00:18 -07:00
Kevin Leimkuhler	c2301749ef	Always return destination overrides for services (#4890 ) ## Motivation #4879 ## Solution When no traffic split exists for services, return a single destination override with a weight of 100%. Using the destination client on a new linkerd installation, this results in the following output for `linkerd-identity` service: ``` ❯ go run controller/script/destination-client/main.go -method getProfile -path linkerd-identity.linkerd.svc.cluster.local:8080 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} dst_overrides:{authority:"linkerd-identity.linkerd.svc.cluster.local.:8080" weight:100000} INFO[0000] ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-08-19 12:25:58 -07:00
Matei David	f797ab1e65	service topologies: topology-aware service routing (#4780 ) [Link to RFC](https://github.com/linkerd/rfc/pull/23) ### What --- * PR that puts together all past pieces of the puzzle to deliver topology-aware service routing, as specified in the [Kubernetes docs](https://kubernetes.io/docs/concepts/services-networking/service-topology/) but with a much better load balancing algorithm and all the coolness of linkerd :) * The first piece of this PR is focused on adding topology metadata: topology preference for services and topology `<k,v>` pairs for endpoints. * The second piece of this PR puts together the new context format and fetching the source node topology metadata in order to allow for endpoints filtering. * The final part is doing the filtering -- passing all of the metadata to the listener and on every `Add` filtering endpoints based on the topology preference of the service, topology `<k,v>` pairs of endpoints and topology of the source (again `<k,v>` pairs). ### How --- * Collecting metadata: - Services do not have values for topology keys -- the topological keys defined in a service's spec are only there to dictate locality preference for routing; as such, I decided to store them in an array, they will be taken exactly as they are found in the service spec, this ensures we respect the preference order. - For EndpointSlices, we are using a map -- an EndpointSlice has locality information in the form of `<k,v>` pair, where the key is a topological key (similar to what's listed in the service) and the value is the locality information -- e.g `hostname: minikube`. For each address we now have a map of topology values which gets populated when we translate the endpoints to an address set. Because normal Endpoints do not have any topology information, we create each address with an empty map which is subsequently populated ONLY for slices in the `endpointSliceToAddressSet` function. * Filtering endpoints: - This was a tricky part and filled me with doubts. I think there are a few ways to do this, but this is how I "envisioned" it. First, the `endpoint_translator.go` should be the one to do the filtering; this means that on subscription, we need to feed all of the relevant metadata to the listener. To do this, I created a new function `AddTopologyFilter` as part of the listener interface. - To complement the `AddTopologyFilter` function, I created a new `TopologyFilter` struct in `endpoints_watcher.go`. I then embedded this structure in all listeners that implement the interface. The structure holds the source topology (source node), a boolean to tell if slices are activated in case we need to double check (or write tests for the function) and the service preference. We create the filter on Subscription -- we have access to the k8s client here as well as the service, so it's the best point to collect all of this data together. Addresses all have their own topology added to them so they do not have to be collected by the filter. - When we add a new set of addresses, we check to see if slices are enabled -- chances are if slices are enabled, service topology might be too. This lets us skip this step if the latest version is not adopted. Prior to sending an `Add` we filter the endpoints -- if the preference is registered by the filter we strictly enforce it, otherwise nothing changes. And that's pretty much it. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-08-18 11:11:09 -07:00
Josh Soref	72aadb540f	Spelling (#4872 ) This PR corrects misspellings identified by the [check-spelling action](https://github.com/marketplace/actions/check-spelling). The misspellings have been reported at `aaf440489e (commitcomment-41423663)` The action reports that the changes in this PR would make it happy: `5b82c6c5ca` Note: this PR does not include the action. If you're interested in running a spell check on every PR and push, that can be offered separately. Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-08-12 21:59:50 -07:00
Alex Leong	a1543b33e3	Add support for service-mirror selectors (#4795 ) * Add selector support Signed-off-by: Alex Leong <alex@buoyant.io> * Removed unused labels Signed-off-by: Alex Leong <alex@buoyant.io>	2020-07-30 10:07:14 -07:00
Matei David	1c197b14e7	Change destination context token format (#4771 ) Add a new structure on the destination controller side to keep track of contextual information. The token format has been changed from ns:<namespace> to a JSON format so that more variables can be encdoed in the token. As part of this PR, a new field 'nodeName' has been added to help with service topologies. Fixes #4498 Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-27 09:49:48 -07:00
Matei David	146c593cd5	Uncomment EndpointSliceAccess function (#4760 ) * Small PR that uncomments the `EndpointSliceAcess` method and cleans up left over todos in the destination service. * Based on the past three PRs related to `EndpointSlices` (#4663 #4696 #4740); they should now be functional (albeit prone to bugs) and ready to use. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-20 14:50:43 -07:00
Matei David	8b85716eb8	Introduce install flag for EndpointSlices (#4740 ) EndpointSlices have been made opt-in due to their experimental nature. This PR introduces a new install flag 'enableEndpointSlices' that will allow adopters to specify in their cli install or helm install step whether they would like to use endpointslices as a resource in the destination service, instead of the endpoints k8s resource. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-15 09:53:04 -07:00
Kevin Leimkuhler	f49b40c4a9	Add support for profile lookups by IP address (#4727 ) ## Motivation Closes #3916 This adds the ability to get profiles for services by IP address. ### Change in behavior When the destination server receives a `GetProfile` request with an IP address, it now tries to map that IP address to a service. If the IP address maps to an existing service, then the destination server returns the profile stream subscribes for updates to the _service_--this is the existing behavior. If the IP changes to a new service, the stream will still send updates for the first service the IP address corresponded to since that is what it is subscribed to. If the IP address does not map to an existing service, then the destination server returns the profile stream but does not subscribe for updates. The stream will receive one update, the default profile. ### Solution This change uses the `IPWatcher` within the destination server to check for what services an IP address correspond to. By adding a new method `GetSvc` to `IPWatcher`, the server now calls this method if `GetProfile` receives a request with an IP address. ## Testing Install linkerd on a cluster and get the cluster IP of any service: ```bash ❯ kubectl get -n linkerd svc/linkerd-tap -o wide NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE SELECTOR linkerd-tap ClusterIP 10.104.57.90 <none> 8088/TCP,443/TCP 16h linkerd.io/control-plane-component=tap ``` Run the destination server: ```bash ❯ go run controller/cmd/main.go destination -kubeconfig ~/.kube/config ``` Get the profile for the tap service by IP address: ```bash ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.104.57.90:8088 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} INFO[0000] ``` Get the profile for an IP address that does not correspond to a service: ```bash ❯ go run controller/script/destination-client/main.go -method getProfile -path 10.256.0.1:8088 INFO[0000] retry_budget:{retry_ratio:0.2 min_retries_per_second:10 ttl:{seconds:10}} INFO[0000] ``` You can add and remove settings for the service profile for tap and get updates. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-07-10 14:41:15 -07:00
Matei David	9d8d89cce8	Add EndpointSlice logic to EndpointsWatcher (#4501 ) (#4663 ) Introduce support for the EndpointSlice k8s resource (k8s v1.16+) in the destination service. Through this PR, in the EndpointsWatcher, there will be a dedicated informer for EndpointSlice; the informer cannot run at the same time as the Endpoints resource informer. The main difference is that EndpointSlices have a one-to-many relationship with a service, they provide better performance benefits, dual-stack addresses and more. EndpointSlice support also implies service topology and other k8s related features. Validated and tested manually, as well as with dedicated unit tests. Closes #4501 Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-07 13:20:40 -07:00
Oliver Gould	c4d649e25d	Update proxy-api version to v0.1.13 (#4614 ) This update includes no API changes, but updates grpc-go to the latest release.	2020-06-24 12:52:59 -07:00
Zahari Dichev	3365455e45	Fix mc labels (#4560 ) Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-06-05 19:36:09 +03:00
Zahari Dichev	4176580a0f	Threadsafe buffering listener (#4359 ) * Add thread safety to watcher tests Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-05-14 20:45:41 +03:00
Zahari Dichev	ef1a2c2b10	Multicluster dashboard for traffic metrics (#4178 ) This change adds labels to endpoints that target remote services. It also adds a Grafana dashboard that can be used to monitor multicluster traffic. Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-05-14 17:48:27 +03:00
Alex Leong	8fbaa3ef9b	Don't send NoEndpoints during pod updates for ip watches (#4338 ) When the proxy has an IP watch on a pod and the destination controller gets a pod update event, the destination controller sends a NoEndpoints message to all listeners followed by an Add with the new pod state. This can result in the proxy's load balancer being briefly empty and could result in failing requests in the period. Since consecutive Add events with the same address will override each other, we can simply send the Adds without needing to clear the previous state with a NoEndpoints message.	2020-05-07 16:10:17 -07:00
Zahari Dichev	26c14d3c66	Detect changes in addresses when getting updates in endpoints watcher (#4104 ) Detect changes in addresses when getting updates in endpoints watcher Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-04-10 11:42:39 +03:00
Alex Leong	d8eebee4f7	Upgrade to client-go 0.17.4 and smi-sdk-go 0.3.0 (#4221 ) Here we upgrade our dependencies on client-go to 0.17.4 and smi-sdk-go to 0.3.0. Since smi-sdk-go uses client-go 0.17.4, these upgrades must be performed simultaneously. This also requires simultaneously upgrading our dependency on linkerd/stern to a SHA which also uses client-go 0.17.4. This keeps all of our transitive dependencies synchronized on one version of client-go. This ALSO requires updating our codegen scripts to use the 0.17.4 version of code-generator and running it to generate 0.17.4 compatible generated code. I took this opportunity to update our code generation script to properly use the version of code-generater from `go.mod` rather than a hardcoded SHA. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-04-01 10:07:23 -07:00
Zahari Dichev	10ecd8889e	Set auth override (#4160 ) Set AuthOverride when present on endpoints annotation Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-03-25 10:56:36 +02:00
Zahari Dichev	2db307ee91	Remove target port requirement in port resolution (#4174 ) This change removes the target port requirement when resolving ports in the dst service. Based on the comments, it seems that we need to have a target port defined in the port spec in order to resolve to the port in the Endpoints. In reality if target port is note defined when creating the service, k8s will set the port and the target port to the same value. Seems to me that checking for the targetPort to be different than 0, is a no-op. Signed-off-by: Zahari Dichev zaharidichev@gmail.com	2020-03-16 23:04:08 +02:00
Zahari Dichev	caf4e61daf	Enable identitiy on endpoints not associated with pods (#4134 ) Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-03-09 20:55:57 +02:00
Zahari Dichev	edd7fd203d	Service Mirroring Component (#4028 ) This PR introduces a service mirroring component that is responsible for watching remote clusters and mirroring their services locally. Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-03-02 21:16:08 +02:00
Zahari Dichev	6fa9407318	Ensure we get the correct type out of Informer Deletion events (#4034 ) Ensure we get what we expect when receiving DELETE events from the k8s Informer api Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-02-15 10:15:24 +02:00
Alejandro Pedraza	3ba66f6f9d	Fix flakey TestGetProfiles (#3965 ) Fixes #3332 Fixes the very rare test failure ``` --- FAIL: TestGetProfiles (0.33s) --- FAIL: TestGetProfiles/Returns_server_profile (0.11s) server_test.go:228: Expected 1 or 2 updates but got 3: [retry_budget:<retry_ratio:0.2 min_retries_per_second:10 ttl:<seconds:10 > > routes:<condition:<path:<regex:"/a/b/c" > > metrics_labels:<key:"route" value:"route1" > timeout:<seconds:10 > > retry_budget:<retry_ratio:0.2 min_retries_per_second:10 ttl:<seconds:10 > > routes:<condition:<path:<regex:"/a/b/c" > > metrics_labels:<key:"route" value:"route1" > timeout:<seconds:10 > > retry_budget:<retry_ratio:0.2 min_retries_per_second:10 ttl:<seconds:10 > > ] FAIL FAIL github.com/linkerd/linkerd2/controller/api/destination 0.624s ``` that occurs when a third unexpected stream update occurs, when the fake API takes more time to notify its listeners about the resources created. For all the nasty details check #3332	2020-02-07 19:43:29 -05:00
Alejandro Pedraza	afb93cddc8	Use `t.Name()` instead of `t.Name` in tests (#3970 ) Use `t.Name()` instead of `t.Name` when retrieving the name of tests. This was causing an error to be added in the log: ``` output: logrus_error="can not add field \"test\" ``` Followup to [comment](https://github.com/linkerd/linkerd2/pull/3965#discussion_r370387990)	2020-01-27 09:17:19 -05:00
Alex Leong	03762cc526	Support pod ip and service cluster ip lookups in the destination service (#3595 ) Fixes #3444 Fixes #3443 ## Background and Behavior This change adds support for the destination service to resolve Get requests which contain a service clusterIP or pod ip as the `Path` parameter. It returns the stream of endpoints, just as if `Get` had been called with the service's authority. This lays the groundwork for allowing the proxy to TLS TCP connections by allowing the proxy to do destination lookups for the SO_ORIG_DST of tcp connections. When that ip address corresponds to a service cluster ip or pod ip, the destination service will return the endpoints stream, including the pod metadata required to establish identity. Prior to this change, attempting to look up an ip address in the destination service would result in a `InvalidArgument` error. Updating the `GetProfile` method to support ip address lookups is out of scope and attempts to look up an ip address with the `GetProfile` method will result in `InvalidArgument`. ## Implementation We do this by creating a `IPWatcher` which wraps the `EndpointsWatcher` and supports lookups by ip. `IPWatcher` maintains a mapping up clusterIPs to service ids and translates subscriptions to an IP address into a subscription to the service id using the underlying `EndpointsWatcher`. Since the service name is no longer always infer-able directly from the input parameters, we restructure `EndpointTranslator` and `PodSet` so that we propagate the service name from the endpoints API response. ## Testing This can be tested by running the destination service locally, using the current kube context to connect to a Kubernetes cluster: ``` go run controller/cmd/main.go destination -kubeconfig ~/.kube/config ``` Then lookups can be issued using the destination client: ``` go run controller/script/destination-client/main.go -path 192.168.54.78:80 -method get -addr localhost:8086 ``` Service cluster ips and pod ips can be used as the `path` argument. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-12-19 09:25:12 -08:00
Alex Leong	0026103362	Unit and integration test fixups (#3730 ) - Added cleanup step at the end of all integration tests. - Disable external_issuer_integration_tests in cloud_tests due to namespace issue. Running this via `kind` tests is sufficient for now. - Set a flakey test to `Skip`, relates to #3332. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-11-15 03:40:42 -08:00
Tarun Pothulapati	f3deee01b6	Trace Control plane Components with OC (#3495 ) * add trace flags and initialisation * add ocgrpc handler to newgrpc * add ochttp handler to linkerd web * add flags to linkerd web * add ochttp handler to prometheus handler initialisation * add ochttp clients for components * add span for prometheus query * update godep sha * fix reviews * better commenting * add err checking * remove sampling * add check in main * move to pkg/trace Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-10-18 12:19:13 -07:00

1 2

90 Commits