linkerd2

Commit Graph

Author	SHA1	Message	Date
Tarun Pothulapati	fac28ff8a7	destination: Remove support for IP Queries in `Get` API (#6018 ) * destination: Remove support for IP Queries in `Get` API Fixes #5246 This PR updates the destination to report an error when `Get` is called for IP Queries. As the issue mentions, The proxies are not using this API anymore and it helps to simplify and remove unnecessary logic. This removes the relevant `IPWatcher` logic, along with unit tests Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2021-04-21 12:40:40 +05:30
Kevin Leimkuhler	3f72c998b3	Handle pod lookups for pods that map to a host IP and host port (#5904 ) This fixes an issue where pod lookups by host IP and host port fail even though the cluster has a matching pod. Usually these manifested as `FailedPrecondition` errors, but the messages were too long and resulted in http/2 errors. This change depends on #5893 which fixes that separate issue. This changes how often those `FailedPrecondition` errors actually occur. The destination service now considers pod host IPs and should reduce the frequency of those errors. Closes #5881 --- Lookups like this happen when a pod is created with a host IP and host port set in its spec. It still has a pod IP when running, but requests to `hostIP:hostPort` will also be redirected to the pod. Combinations of host IP and host Port are unique to the cluster and enforced by Kubernetes. Currently, the destination services fails to find pods in this scenario because we only keep an index with pod and their pod IPs, not pods and their host IPs. To fix this, we now also keep an index of pods and their host IPs—if and only if they have the host IP set. Now when doing a pod lookup, we consider both the IP and the port. We perform the following steps: 1. Do a lookup by IP in the pod podIP index - If only one pod is found then return it 2. 0 or more than 1 pods have the same pod IP 3. Do a lookup by IP in the pod hostIP index - If any number of pods were found, we know that IP maps to a node IP. Therefore, we search for a pod with a matching host Port. If one exists then return it; if not then there is no pod that matches `hostIP:port` 4. The IP does not map to a host IP 5. If multiple pods were found in `1`, then we know there are pods with conflicting podIPs and an error is returned 6. If no pounds were found in `1` then there is no pod that matches `IP:port` --- Aside from the additional IP watcher test being added, this can be tested with the following steps: 1. Create a kind cluster. kind is required because it's pods in `kube-system` have the same pod IPs; this not the case with k3d: `bin/kind create cluster` 2. Install Linkerd with `4445` marked as opaque: `linkerd install --set proxy.opaquePorts="4445" \|kubectl apply -f -` 2. Get the node IP: `kubectl get -o wide nodes` 3. Pull my fork of `tcp-echo`: ``` $ git clone https://github.com/kleimkuhler/tcp-echo ... $ git checkout --track kleimkuhler/host-pod-repro ``` 5. `helm package .` 7. Install `tcp-echo` with the server not injected and correct host IP: `helm install tcp-echo tcp-echo-0.1.0.tgz --set server.linkerdInject="false" --set hostIP="..."` 8. Looking at the client's proxy logs, you should not observe any errors or protocol detection timeouts. 9. Looking at the server logs, you should see all the requests coming through correctly. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-03-18 13:29:43 -04:00
Riccardo Freixo	66b89f55e7	Fix named port resolution mid roll out (#5912 ) (#5911 ) # Problem While rolling out often not all pods will be ready in all the same set of ports, leading the Kubernetes Endpoints API to return multiple subsets, each covering a different set of ports, with the end result that the same address gets repeated across subsets. The old code for endpointsToAddresses would loop through all subsets, and the later occurrences of an address would overwrite previous ones, with the last one prevailing. If the last subset happened to be for an irrelevant port, and the port to be resolved is named, resolveTargetPort would resolve to port 0, which would return port 0 to clients, ultimately leading linkerd-proxy to forward connections to port 0. This only happens if the pods selected by a service expose > 1 port, the service maps to > 1 of these ports, and at least one of these ports is named. # Solution Never write an address to set of addresses if resolved port is 0, which indicates named port resolution failed. # Validation Added a test case. Signed-off-by: Riccardo Freixo <riccardofreixo@gmail.com>	2021-03-17 17:40:11 -04:00
Kevin Leimkuhler	5dc662ae97	Remove namespace inheritance of opaque ports annotation (#5739 ) This change removes the namespace inheritance of the opaque ports annotation. Now when setting opaque port related fields in destination profile responses, we only look at the pod annotations. This prepares for #5736 where the proxy-injector will add the annotation from the namespace if the pod does not have it already. Closes #5735 Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-02-15 10:21:20 -05:00
Oleh Ozimok	c416e78261	destination: Fix crash when EndpointSlices are enabled (#5543 ) The Destination controller can panic due to a nil-deref when the EndpointSlices API is enabled. This change updates the controller to properly initialize values to avoid this segmentation fault. Fixes #5521 Signed-off-by: Oleg Ozimok <oleg.ozimok@corp.kismia.com>	2021-01-15 12:52:11 -08:00
Alejandro Pedraza	d3d7f4e2e2	Destination should return `OpaqueTransport` hint when annotation matches resolved target port (#5458 ) The destination service now returns `OpaqueTransport` hint when the annotation matches the resolve target port. This is different from the current behavior which always sets the hint when a proxy is present. Closes #5421 This happens by changing the endpoint watcher to set a pod's opaque port annotation in certain cases. If the pod already has an annotation, then its value is used. If the pod has no annotation, then it checks the namespace that the endpoint belongs to; if it finds an annotation on the namespace then it overrides the pod's annotation value with that. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2021-01-05 14:54:55 -05:00
Tarun Pothulapati	d0caaa86c4	Bump k8s client-go to v0.19.2 (#5002 ) Fixes #4191 #4993 This bumps Kubernetes client-go to the latest v0.19.2 (We had to switch directly to 1.19 because of this issue). Bumping to v0.19.2 required upgrading to smi-sdk-go v0.4.1. This also depends on linkerd/stern#5 This consists of the following changes: - Fix ./bin/update-codegen.sh by adding the template path to the gen commands, as it is needed after we moved to GOMOD. - Bump all k8s related dependencies to v0.19.2 - Generate CRD types, client code using the latest k8s.io/code-generator - Use context.Context as the first argument, in all code paths that touch the k8s client-go interface Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-09-28 12:45:18 -05:00
Matei David	f797ab1e65	service topologies: topology-aware service routing (#4780 ) [Link to RFC](https://github.com/linkerd/rfc/pull/23) ### What --- * PR that puts together all past pieces of the puzzle to deliver topology-aware service routing, as specified in the [Kubernetes docs](https://kubernetes.io/docs/concepts/services-networking/service-topology/) but with a much better load balancing algorithm and all the coolness of linkerd :) * The first piece of this PR is focused on adding topology metadata: topology preference for services and topology `<k,v>` pairs for endpoints. * The second piece of this PR puts together the new context format and fetching the source node topology metadata in order to allow for endpoints filtering. * The final part is doing the filtering -- passing all of the metadata to the listener and on every `Add` filtering endpoints based on the topology preference of the service, topology `<k,v>` pairs of endpoints and topology of the source (again `<k,v>` pairs). ### How --- * Collecting metadata: - Services do not have values for topology keys -- the topological keys defined in a service's spec are only there to dictate locality preference for routing; as such, I decided to store them in an array, they will be taken exactly as they are found in the service spec, this ensures we respect the preference order. - For EndpointSlices, we are using a map -- an EndpointSlice has locality information in the form of `<k,v>` pair, where the key is a topological key (similar to what's listed in the service) and the value is the locality information -- e.g `hostname: minikube`. For each address we now have a map of topology values which gets populated when we translate the endpoints to an address set. Because normal Endpoints do not have any topology information, we create each address with an empty map which is subsequently populated ONLY for slices in the `endpointSliceToAddressSet` function. * Filtering endpoints: - This was a tricky part and filled me with doubts. I think there are a few ways to do this, but this is how I "envisioned" it. First, the `endpoint_translator.go` should be the one to do the filtering; this means that on subscription, we need to feed all of the relevant metadata to the listener. To do this, I created a new function `AddTopologyFilter` as part of the listener interface. - To complement the `AddTopologyFilter` function, I created a new `TopologyFilter` struct in `endpoints_watcher.go`. I then embedded this structure in all listeners that implement the interface. The structure holds the source topology (source node), a boolean to tell if slices are activated in case we need to double check (or write tests for the function) and the service preference. We create the filter on Subscription -- we have access to the k8s client here as well as the service, so it's the best point to collect all of this data together. Addresses all have their own topology added to them so they do not have to be collected by the filter. - When we add a new set of addresses, we check to see if slices are enabled -- chances are if slices are enabled, service topology might be too. This lets us skip this step if the latest version is not adopted. Prior to sending an `Add` we filter the endpoints -- if the preference is registered by the filter we strictly enforce it, otherwise nothing changes. And that's pretty much it. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-08-18 11:11:09 -07:00
Alex Leong	a1543b33e3	Add support for service-mirror selectors (#4795 ) * Add selector support Signed-off-by: Alex Leong <alex@buoyant.io> * Removed unused labels Signed-off-by: Alex Leong <alex@buoyant.io>	2020-07-30 10:07:14 -07:00
Matei David	146c593cd5	Uncomment EndpointSliceAccess function (#4760 ) * Small PR that uncomments the `EndpointSliceAcess` method and cleans up left over todos in the destination service. * Based on the past three PRs related to `EndpointSlices` (#4663 #4696 #4740); they should now be functional (albeit prone to bugs) and ready to use. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-20 14:50:43 -07:00
Matei David	8b85716eb8	Introduce install flag for EndpointSlices (#4740 ) EndpointSlices have been made opt-in due to their experimental nature. This PR introduces a new install flag 'enableEndpointSlices' that will allow adopters to specify in their cli install or helm install step whether they would like to use endpointslices as a resource in the destination service, instead of the endpoints k8s resource. Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-15 09:53:04 -07:00
Matei David	9d8d89cce8	Add EndpointSlice logic to EndpointsWatcher (#4501 ) (#4663 ) Introduce support for the EndpointSlice k8s resource (k8s v1.16+) in the destination service. Through this PR, in the EndpointsWatcher, there will be a dedicated informer for EndpointSlice; the informer cannot run at the same time as the Endpoints resource informer. The main difference is that EndpointSlices have a one-to-many relationship with a service, they provide better performance benefits, dual-stack addresses and more. EndpointSlice support also implies service topology and other k8s related features. Validated and tested manually, as well as with dedicated unit tests. Closes #4501 Signed-off-by: Matei David <matei.david.35@gmail.com>	2020-07-07 13:20:40 -07:00
Zahari Dichev	3365455e45	Fix mc labels (#4560 ) Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-06-05 19:36:09 +03:00
Zahari Dichev	ef1a2c2b10	Multicluster dashboard for traffic metrics (#4178 ) This change adds labels to endpoints that target remote services. It also adds a Grafana dashboard that can be used to monitor multicluster traffic. Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-05-14 17:48:27 +03:00
Zahari Dichev	26c14d3c66	Detect changes in addresses when getting updates in endpoints watcher (#4104 ) Detect changes in addresses when getting updates in endpoints watcher Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-04-10 11:42:39 +03:00
Zahari Dichev	10ecd8889e	Set auth override (#4160 ) Set AuthOverride when present on endpoints annotation Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-03-25 10:56:36 +02:00
Zahari Dichev	2db307ee91	Remove target port requirement in port resolution (#4174 ) This change removes the target port requirement when resolving ports in the dst service. Based on the comments, it seems that we need to have a target port defined in the port spec in order to resolve to the port in the Endpoints. In reality if target port is note defined when creating the service, k8s will set the port and the target port to the same value. Seems to me that checking for the targetPort to be different than 0, is a no-op. Signed-off-by: Zahari Dichev zaharidichev@gmail.com	2020-03-16 23:04:08 +02:00
Zahari Dichev	caf4e61daf	Enable identitiy on endpoints not associated with pods (#4134 ) Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-03-09 20:55:57 +02:00
Zahari Dichev	6fa9407318	Ensure we get the correct type out of Informer Deletion events (#4034 ) Ensure we get what we expect when receiving DELETE events from the k8s Informer api Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-02-15 10:15:24 +02:00
Alex Leong	03762cc526	Support pod ip and service cluster ip lookups in the destination service (#3595 ) Fixes #3444 Fixes #3443 ## Background and Behavior This change adds support for the destination service to resolve Get requests which contain a service clusterIP or pod ip as the `Path` parameter. It returns the stream of endpoints, just as if `Get` had been called with the service's authority. This lays the groundwork for allowing the proxy to TLS TCP connections by allowing the proxy to do destination lookups for the SO_ORIG_DST of tcp connections. When that ip address corresponds to a service cluster ip or pod ip, the destination service will return the endpoints stream, including the pod metadata required to establish identity. Prior to this change, attempting to look up an ip address in the destination service would result in a `InvalidArgument` error. Updating the `GetProfile` method to support ip address lookups is out of scope and attempts to look up an ip address with the `GetProfile` method will result in `InvalidArgument`. ## Implementation We do this by creating a `IPWatcher` which wraps the `EndpointsWatcher` and supports lookups by ip. `IPWatcher` maintains a mapping up clusterIPs to service ids and translates subscriptions to an IP address into a subscription to the service id using the underlying `EndpointsWatcher`. Since the service name is no longer always infer-able directly from the input parameters, we restructure `EndpointTranslator` and `PodSet` so that we propagate the service name from the endpoints API response. ## Testing This can be tested by running the destination service locally, using the current kube context to connect to a Kubernetes cluster: ``` go run controller/cmd/main.go destination -kubeconfig ~/.kube/config ``` Then lookups can be issued using the destination client: ``` go run controller/script/destination-client/main.go -path 192.168.54.78:80 -method get -addr localhost:8086 ``` Service cluster ips and pod ips can be used as the `path` argument. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-12-19 09:25:12 -08:00
Johannes Hansen	f880e71fcd	The linkerd proxy does not work with headless services (#3470 ) * The linkerd proxy does not work with headless services (i.e. endpoints not referencing a pod). Changed endpoints_watcher to also return endpoints with no targetref. Fixes #3308 Signed-off-by: Johannes Hansen <johannesh1980@gmail.com> * Fix panic in endpoint_translator Signed-off-by: Johannes Hansen <johannesh1980@gmail.com>	2019-10-15 14:56:41 -07:00
陈谭军	e281fb3410	fix-up grammar (#3351 ) Signed-off-by: chentanjun <2799194073@qq.com>	2019-08-30 08:09:36 -07:00
Alejandro Pedraza	fd248d3755	Undo refactoring from #3316 (#3331 ) Thus fixing `linkerd edges` and the dashboard topology graph Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-29 13:37:54 -05:00
Alejandro Pedraza	5d7499dc84	Avoid the dashboard requesting stats when not needed (#3338 ) * Avoid the dashboard requesting stats when not needed Create an alternative to `urlsForResource` called `urlsForResourceNoStats` that makes use of the `skip_stats` parameter in the stats API (created in #1871) that doesn't query Prometheus when not needed. When testing using the dashboard looking at the linkerd namespace, queries per second went down from 2874 to 2756, a 4% decrease. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-29 05:52:44 -05:00
Alejandro Pedraza	02efb46e45	Have the proxy-injector emit events upon injection/skipping injection (#3316 ) * Have the proxy-injector emit events upon injection/skipping injection Fixes #3253 Have the proxy-injector emit an event whenever a injection happens, or when injection is skipped for some reason (also added that reason into the proxy-injector logs). The level is associated to the parent workload (it can't be associated to the pod because at this point the pod hasn't been persisted). The event recorder was setup at the `webhook/server.go` level and passed to the proxy-injector's `Inject` function. The sp-validator thus also has access to the event recorder, but for now it's not using it. Related changes: - Refactored `api.GetOwnerKindAndName()` to have it return a more generic object. - Refactored `report.Injectable()` to also have it return the reason why a workload is not injectable. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-26 13:34:36 -05:00
Alex Leong	ab7226cbcd	Return invalid argument for external name services (#3120 ) Fixes https://github.com/linkerd/linkerd2/issues/2800#issuecomment-513740498 When the Linkerd proxy sends a query for a Kubernetes external name service to the destination service, the destination service returns `NoEndpoints: exists=false` because an external name service has no endpoints resource. Due to a change in the proxy's fallback logic, this no longer causes the proxy to fallback to either DNS or SO_ORIG_DST and instead fails the request. The net effect is that Linkerd fails all requests to external name services. We change the destination service to instead return `InvalidArgument` for external name services. This causes the proxy to fallback to SO_ORIG_DST instead of failing the request. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-07-29 16:31:22 -07:00
Andrew Seigner	51b33ad53c	Fix nil pointer dereference in endpoints watcher (#3147 ) The destination service's endpoints watcher assumed every `Endpoints` object contained a `TargetRef`. This field is optional, and in cases such as the default `ep/kubernetes` object, `TargetRef` is nil, causing a nil pointer dereference. Fix endpoints watcher to check for `TargetRef` prior to dereferencing. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-25 17:11:56 -07:00
Alex Leong	e538a05ce2	Add support for stateful sets (#3113 ) We add support for looking up individual pods in a stateful set with the destination service. This allows Linkerd to correctly proxy requests which address individual pods. The authority structure for such a request is `<pod-name>.<service>.<namespace>.svc.cluster.local:<port>`. Fixes #2266 Signed-off-by: Alex Leong <alex@buoyant.io>	2019-07-24 14:09:46 -07:00
Alex Leong	92ddffa3c2	Add prometheus metrics for watchers (#3022 ) To give better visibility into the inner workings of the kubernetes watchers in the destination service, we add some prometheus metrics. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-07-08 11:50:26 -07:00
Alejandro Pedraza	8988a5723f	Have `GetOwnerKindAndName` be able to skip the cache (#2972 ) * Have `GetOwnerKindAndName` be able to skip the cache Refactored `GetOwnerKindAndName` so it can optionally skip the shared informer cache and instead hit the k8s API directly. Useful for the proxy injector, when the pod's replicaset got just created and might not be in ready in the cache yet. Fixes #2738 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-20 12:58:15 -05:00
Alex Leong	06a69f69c5	Refactor destination service (#2786 ) This is a major refactor of the destination service. The goals of this refactor are to simplify the code for improved maintainability. In particular: * Remove the "resolver" interfaces. These were a holdover from when our decision tree was more complex about how to handle different kinds of authorities. The current implementation only accepts fully qualified kubernetes service names and thus this was an unnecessary level of indirection. * Moved the endpoints and profile watchers into their own package for a more clear separation of concerns. These watchers deal only in Kubernetes primitives and are agnostic to how they are used. This allows a cleaner layering when we use them from our gRPC service. * Renamed the "listener" types to "translator" to make it more clear that the function of these structs is to translate kubernetes updates from the watcher to gRPC messages. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-06-04 15:01:16 -07:00

31 Commits