linkerd2

Commit Graph

Author	SHA1	Message	Date
Alex Leong	03762cc526	Support pod ip and service cluster ip lookups in the destination service (#3595 ) Fixes #3444 Fixes #3443 ## Background and Behavior This change adds support for the destination service to resolve Get requests which contain a service clusterIP or pod ip as the `Path` parameter. It returns the stream of endpoints, just as if `Get` had been called with the service's authority. This lays the groundwork for allowing the proxy to TLS TCP connections by allowing the proxy to do destination lookups for the SO_ORIG_DST of tcp connections. When that ip address corresponds to a service cluster ip or pod ip, the destination service will return the endpoints stream, including the pod metadata required to establish identity. Prior to this change, attempting to look up an ip address in the destination service would result in a `InvalidArgument` error. Updating the `GetProfile` method to support ip address lookups is out of scope and attempts to look up an ip address with the `GetProfile` method will result in `InvalidArgument`. ## Implementation We do this by creating a `IPWatcher` which wraps the `EndpointsWatcher` and supports lookups by ip. `IPWatcher` maintains a mapping up clusterIPs to service ids and translates subscriptions to an IP address into a subscription to the service id using the underlying `EndpointsWatcher`. Since the service name is no longer always infer-able directly from the input parameters, we restructure `EndpointTranslator` and `PodSet` so that we propagate the service name from the endpoints API response. ## Testing This can be tested by running the destination service locally, using the current kube context to connect to a Kubernetes cluster: ``` go run controller/cmd/main.go destination -kubeconfig ~/.kube/config ``` Then lookups can be issued using the destination client: ``` go run controller/script/destination-client/main.go -path 192.168.54.78:80 -method get -addr localhost:8086 ``` Service cluster ips and pod ips can be used as the `path` argument. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-12-19 09:25:12 -08:00
Sergio C. Arteaga	cee8e3d0ae	Add CronJobs and ReplicaSets to dashboard and CLI (#3687 ) This PR adds support for CronJobs and ReplicaSets to `linkerd inject`, the web dashboard and CLI. It adds a new Grafana dashboard for each kind of resource. Closes #3614 Closes #3630 Closes #3584 Closes #3585 Signed-off-by: Sergio Castaño Arteaga tegioz@icloud.com Signed-off-by: Cintia Sanchez Garcia cynthiasg@icloud.com	2019-12-11 10:02:37 -08:00
Alejandro Pedraza	b4d27f9d82	No need for `processYAML()` in `install` (#3784 ) * No need for `processYAML()` in `install` Since `install` uses helm to do its proxy injection, there's no need to call `processYAML`. This also fixes an issue discovered in #3687 where we started supporting injection of cronjobs, and even though `linkerd`'s namespace is flagged to skip automatic injection it was being injected. This replaces #3773 as it's a much more simpler approach.	2019-12-09 09:32:14 -05:00
Alejandro Pedraza	4b6254b52e	Replaced `uuid` with `uid` from linkerd-config resource (#3694 ) * Replaced `uuid` with `uid` from linkerd-config resource Fixes #3621 Removed the old `uuid` for identifying linkerd installations, and replaced it with the `uid` property from the `linkerd-config` ConfigMap. I tested that this `uid` remains the same by updating the config and also upgrading linkerd, using both the CLI and Helm. Note that this required granting `linkerd-web` RBAC access to the `linkerd-config` Config. I also added an integration test to verify the stability of the uid.	2019-11-13 13:56:01 -05:00
Sergio C. Arteaga	eff1714a08	Add `linkerd check` to dashboard (#3656 ) `linkerd check` can now be run from the dashboard in the `/controlplane` view. Once the check results are received, they are displayed in a modal in a similar style to the CLI output. Closes #3613	2019-11-12 12:37:36 -08:00
Tarun Pothulapati	f18e27b115	use appsv1 api in identity (#3682 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-11-06 15:06:09 -08:00
Mayank Shah	ec848d4ef3	Add inject support for namespace configs (Fix #3255 ) (#3607 ) * Add inject support for namespaces(Fix #3255) * Add relevant unit tests (including overridden annotations) Signed-off-by: Mayank Shah <mayankshah1614@gmail.com>	2019-10-30 10:18:01 -05:00
Alejandro Pedraza	d3d8266c63	If tap source IP matches many running pods then only show the IP (#3513 ) * If tap source IP matches many running pods then only show the IP When an unmeshed source ip matched more than one running pod, tap was showing the names for all those pods, even though the didn't necessary originate the connection. This could be reproduced when using pod network add-on such as Calico. With this change, if a node matches, return it, otherwise we proceed to look for a matching pod. If exactly one running pod matches we return it. Otherwise we return just the IP. Fixes #3103	2019-10-25 12:38:11 -05:00
Zahari Dichev	0017f9a60a	Cert manager support (#3600 ) * Add support for --identity-issuer-mode flag to install cmd * Change flag to be a bool * Read correct data form identity when external issuer is used * Add ability for identity service to dynamically reload certs * Fix failing tests * Minor refactor * Load trust anchors from identity issuer secret * Make identity service actually watch for issuer certs updates * Add some testing around cmd line identity options validation * Add tests ensuring that identity service loads issuer * Take into account external-issuer flag during upgrade + tests * Fix failing upgrade test * Address initial review feedback * Address further review feedback on cli and helm * Do not persist --identity-external-issuer * Some improvements to identitiy service * Bring back persistane of external issuer flag * Address more feedback * Update dockerfiles shas * Publishing k8s events on issuer certs rotation * Ensure --ignore-cluster+external issuer is not supported * Update go-deps shas * Transition to identity issuer scheme based configuration * Use k8s consts for secret file names Signed-off-by: zaharidichev <zaharidichev@gmail.com>	2019-10-24 13:15:14 -07:00
Daniel Mangum	fa01b49998	proxy injector: mwc match expressions admission-webhooks disabled (#3460 ) When running linkerd in HA mode, a cluster can be broken by bringing down the proxy-injector. Add a label to MWC namespace selctor that skips any namespace. Fixes #3346 Signed-off-by: hasheddan <georgedanielmangum@gmail.com>	2019-09-24 19:28:16 -07:00
Alejandro Pedraza	1653f88651	Put the destination controller into its own deployment (#3407 ) * Put the destination controller into its own deployment Fixes #3268 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-09-18 13:41:06 -05:00
Alejandro Pedraza	17dd9bf6bc	Couple of injection events fixes (#3363 ) * Couple of injection events fixes When generating events in quick succession against the same target, client-go issues a PATCH request instead of a POST, so we need the extra RBAC permission. Also we have an informer on pods, so we also need the "watch" permission for them, whose omission was causing an error entry in the logs. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-09-04 11:57:20 -05:00
Alejandro Pedraza	02efb46e45	Have the proxy-injector emit events upon injection/skipping injection (#3316 ) * Have the proxy-injector emit events upon injection/skipping injection Fixes #3253 Have the proxy-injector emit an event whenever a injection happens, or when injection is skipped for some reason (also added that reason into the proxy-injector logs). The level is associated to the parent workload (it can't be associated to the pod because at this point the pod hasn't been persisted). The event recorder was setup at the `webhook/server.go` level and passed to the proxy-injector's `Inject` function. The sp-validator thus also has access to the event recorder, but for now it's not using it. Related changes: - Refactored `api.GetOwnerKindAndName()` to have it return a more generic object. - Refactored `report.Injectable()` to also have it return the reason why a workload is not injectable. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-26 13:34:36 -05:00
Ivan Sim	4d01e3720e	Update install and upgrade code to use the new helm charts (#3229 ) * Delete symlink to old Helm chart * Update 'install' code to use common Helm template structs * Remove obsolete TLS assets functions. These are now handle by Helm functions inside the templates * Read defaults from values.yaml and values-ha.yaml * Ensure that webhooks TLS assets are retained during upgrade * Fix a few bugs in the Helm templates (see bullet points): * Merge the way the 'install' ha and non-ha options are handled into one function * Honor the 'NoInitContainer' option in the components templates * Control plane mTLS will not be disabled if identity context in the config map is empty. The data plane mTLS will still be automatically disabled if the context is nil. * Resolve test failures from rebase with master * Fix linter issues * Set service account mount path read-only field * Add TLS variables of the webhooks and tap to values.yaml During upgrade, these secrets are preserved to ensure they remain synced wih the CA bundle in the webhook configurations. These Helm variables are used to override the defaults in the templates. * Remove obsolete 'chart' folder * Fix bugs in templates * Handle missing webhooks and tap TLS assets during upgrade When upgrading from an older version that don't have these secrets, fallback to let Helm create them by creating an empty charts.TLS struct. * Revert the selector labels of webhooks to be compatible with that in 2.4 In 2.4, the proxy injector and profile validator webhooks already have their selector labels defined. Since these attributes are immutable, the recent change to these selectors introduced by the Helm chart work will cause upgrade to fail. * Alejandro's feedback * Siggy's feedback * Removed redundant unexported custom types Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-08-13 14:16:24 -07:00
Thomas Rampelberg	ca5b4fab2e	Add container metrics and grafana dashboard (#3217 ) * Add container metrics and grafana dashboard * Review cleanup * Update templates	2019-08-12 08:03:57 -07:00
Andrew Seigner	43bc175ea9	Enable tap-admin ClusterRole privileges for `` (#3214 ) The `linkerd-linkerd-tap-admin` ClusterRole had `watch` privileges on `/tap` resources. This disallowed non-namespaced tap requests of the form: `/apis/tap.linkerd.io/v1alpha1/watch/namespaces/linkerd/tap`, because that URL structure is interpreted by the Kubernetes API as watching a resource of type `tap` within the linkerd namespace, rather than tapping the linkerd namespace. Modify `linkerd-linkerd-tap-admin` to have `watch` privileges on ``, enabling any request of the form `/apis/tap.linkerd.io/v1alpha1/watch/namespaces/linkerd/` to succeed. Fixes #3212 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-08 12:04:03 -07:00
Andrew Seigner	0ff39ddf8d	Introduce tap-admin ClusterRole, web privs flag (#3203 ) The web dashboard will be migrating to the new Tap APIService, which requires RBAC privileges to access. Introduce a new ClusterRole, `linkerd-linkerd-tap-admin`, which gives cluster-wide tap privileges. Also introduce a new ClusterRoleBinding, `linkerd-linkerd-web-admin` which binds the `linkerd-web` service account to the new tap ClusterRole. This ClusterRoleBinding is enabled by default, but may be disabled via a new `linkerd install` flag `--restrict-dashboard-privileges`. Fixes #3177 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-08 10:28:35 -07:00
Andrew Seigner	a59c1dd32d	Introduce tap APIService, update `linkerd tap` (#3167 ) The Tap Service enabled tapping of any meshed pod, regardless of user privilege. This change introduces a new Tap APIService. Kubernetes provides authentication and authorization of Tap requests, and then forwards requests to a new Tap APIServer, which implements a Kubernetes aggregated APIServer. The Tap APIServer authenticates the client TLS from Kubernetes, and authorizes the user via a SubjectAccessReview. This change also modifies the `linkerd tap` command to make requests against the new APIService. The Tap APIService implements these Kubernetes-style endpoints: POST /apis/tap.linkerd.io/v1alpha1/watch/namespaces/:ns/tap POST /apis/tap.linkerd.io/v1alpha1/watch/namespaces/:ns/:res/:name/tap GET /apis GET /apis/tap.linkerd.io GET /apis/tap.linkerd.io/v1alpha1 GET /healthz GET /healthz/log GET /healthz/ping GET /metrics GET /openapi/v2 GET /version Users authorize to the new `tap.linkerd.io/v1alpha1` via RBAC. Only the `watch` verb is supported. Access is also available via subresources such as `deployments/tap` and `pods/tap`. This change introduces the following resources into the default Linkerd install: - Global - APIService/v1alpha1.tap.linkerd.io - ClusterRoleBinding/linkerd-linkerd-tap-auth-delegator - `linkerd` namespace: - Secret/linkerd-tap-tls - `kube-system` namespace: - RoleBinding/linkerd-linkerd-tap-auth-reader Tasks not covered by this PR: - `linkerd top` - `linkerd dashboard` - `linkerd profile --tap` - removal of the unauthenticated tap controller Fixes #2725, #3162, #3172 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-01 14:02:45 -07:00
Andrew Seigner	64ed8e4a74	Introduce Cluster Heartbeat cronjob (#3056 ) `linkerd check`, the web dashboard, and Grafana all perform version checks to validate Linkerd is up to date. It's common for users to seldom execute these codepaths. This makes it difficult to identify what versions of Linkerd are currently in use and what environments it is being run in, which helps prioritize testing and backports. Introduce a `heartbeat` CronJob to the default Linkerd install. The cronjob executes every 24 hours, starting from 5 minutes after `linkerd install` is run. Example check URL: https://versioncheck.linkerd.io/version.json? install-time=1562761177& k8s-version=v1.15.0& meshed-pods=8& rps=3& source=heartbeat& uuid=cc4bb700-3314-426a-9f0f-ec588b9df020& version=git-b97ee9f7 Fixes #2961 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-23 17:12:30 -07:00
Alex Leong	d6ef9ea460	Update ServiceProfile CRD to version v1alpha2 and remove validation (#3078 ) The openAPIV3Schema validation in the ServiceProfiles CRD is very limited in what it can validate and is obviated by more sophisticated validation done by the validating admission controller. Therefore, we would like to remove the openAPIV3Schema validation to reduce the size and complexity of the CRD object. To do so, we must also bump the version of the ServiceProfile custom resource from v1alpha1 to v1alpha2. This ensures that when the controller is upgraded, it will attempt to watch the v1alpha2 resource. If it cannot (because, for example, the controller pod started before the ServiceProfile CRD was updated and therefore the v1alpha2 version does not exist) then it will go into a crash loop backoff until it can. This essentially means that the controller will wait for the CRD to be upgraded to include v1alpha2 before it will start. Bumping the version is necessary because if we did not, it would be possible for the controller to start before the CRD is updated (removing the validation). In this case, when the CRD is edited, the controller will lose its list watch on ServiceProfiles and will stop getting updates. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-07-23 11:46:31 -07:00
Ivan Sim	7e1c14e783	Add the 'linkerd.io/control-plane-ns' label to the Traffic Split CRD (#3026 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-07-02 15:46:25 -07:00
Alex Leong	27373a8b78	Add traffic splitting to destination profiles (#2931 ) This change implements the DstOverrides feature of the destination profile API (aka traffic splitting). We add a TrafficSplitWatcher to the destination service which watches for TrafficSplit resources and notifies subscribers about TrafficSplits for services that they are subscribed to. A new TrafficSplitAdaptor then merges the TrafficSplit logic into the DstOverrides field of the destination profile. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-06-28 13:19:47 -07:00
Tarun Pothulapati	a3ce06bd80	Add sideEffects field to Webhooks (#2963 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-06-21 11:06:10 -07:00
Ivan Sim	435fe861d0	Label all Linkerd resources (#2971 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-20 09:44:30 -07:00
Ivan Sim	e2e976cce9	Add `NET_RAW` capability to the proxy-init container (#2969 ) Also, update control plane PSP to match linkerd/website#94 Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-19 19:34:37 -07:00
Dennis Adjei-Baah	694ba9c2cb	Revert add namespace name to MWC (#2946 ) * revert add namespace name to MWC	2019-06-14 15:26:34 -07:00
Alejandro Pedraza	7fc6c195ad	Set MWC and VWC failure policy to 'fail' in HA mode only (#2943 ) Fixes #2927 Also moved `TestInstallSP` after `TestCheckPostInstall` so we're sure the validating webhook is ready before installing a service profile. Signed-off-by: Alejandro Pedraza Borrero <alejandro@buoyant.io>	2019-06-14 11:50:59 -05:00
Alejandro Pedraza	28025eeb56	Remove UPDATE event from the mutating webhook config (#2919 ) Fixes #2889 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-13 15:42:47 -05:00
Alejandro Pedraza	e9bf014d34	Remove MWVC RBAC from webhook configs (#2925 ) Fixes #2890 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-13 15:42:00 -05:00
Dennis Adjei-Baah	8aef9280dd	add namespace name to MWC (#2905 ) When installing multiple control planes, the mutatingwebhookconfiguration of the first control plane gets overwritten by any subsequent control plane install. This is caused by the fixed name given to the mutatingwebhookconfiguration manifest at install time. This commit adds in the namespace to the manifest so that there is a unique configuration for each control plane. Fixes #2887	2019-06-13 12:15:43 -07:00
Ivan Sim	ecc4465cd1	Introduce Control Plane's PSP and RBAC resources into Helm templates (#2920 ) * Add control plane and CNI PSP and RBAC resources * Add the '--linkerd-cni-enabled' flag to the multi-stage install subcommands This flag ensures that the NET_ADMIN capability is omitted from the control plane's PSP during 'install config' and the proxy-init containers aren't injected during 'install control-plane'. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-12 20:18:46 -07:00
Alejandro Pedraza	8416d326c2	If HA, set the webhooks failure policy to 'Fail' (#2906 ) * If HA, set the webhooks failure policy to 'Fail' I'm adding to the linkerd namespace a new label `linkerd.io/is-control-plane: true` that is used in the webhook configs' selector to skip the proxy injector for this namespace. This avoids running into the timing issues described in #2852. Fixes #2852 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-11 13:11:54 -05:00
Tarun Pothulapati	1a574def1f	Added labels to webhook configurations in charts/ (#2853 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-05-28 16:44:09 -05:00
Dennis Adjei-Baah	b9cc66c6c8	fixes roleRef name in linkerd-tap rbac (#2845 ) * fixes roleRef name in linkerd-tap rbac	2019-05-28 10:05:01 -07:00
Ivan Sim	5a5f8bbfe8	Install MWC and VWC During Installation (#2806 ) * Update helm charts to include webhooks config and TLS secret * Update the webhooks to read the secret cert and key * Update webhooks to not recreate config on restart * Ensure upgrade preserve existing secrets * Revert the change to rename the webhook configs The renaming change breaks upgrade, where the new webhook configs conflict with the existing ones. The older resources aren't deleted during upgrade because they are dynamically created. * Make the secret volume read-only * Remove unnecessary exported getter functions * Remove obsolete mwc and vwc templates Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-05-20 12:43:50 -07:00
Dennis Adjei-Baah	a0fa1dff59	Move tap service into its own pod. (#2773 ) * Split tap into its own pod in the control plane Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2019-05-15 16:28:44 -05:00
Andrew Seigner	be60b37e93	Group Web and Grafana ServiceAccounts with RBAC (#2756 ) All ServiceAccounts are intended to be grouped together with other RBAC resources, particularly for `linkerd install config` output. Grafana and Web ServiceAccounts were still included with their respective Deployments. Group Grafana and Web ServiceAccounts with other RBAC resources. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 17:33:05 -07:00
Andrew Seigner	15ffd86cf1	Introduce multi-stage upgrade (#2723 ) `linkerd install` supports a 2-stage install process, `linkerd upgrade` did not. Add 2-stage support for `linkerd upgrade`. Also exercise multi-stage functionality during upgrade integration tests. Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 14:29:52 -07:00
Alex Leong	4ea7c62b0d	Revert " Remove validation from service profile CRD definition (#2740 )" (#2752 ) This reverts commit `3de16d47be`. #2740 modified the ServiceProfiles CRD which will cause issues for users upgrading from the old CRD version to the new version. #2748 was an attempt to fix this by bumping the service profile CRD version, however, our testing infrastructure is not well set up to accommodate changes to CRDs because they are resources which are global to the cluster. We revert this change for now and will revisit it in the future when we can give more thought to CRD versioning, upgrade, and testing. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-04-25 13:40:20 -07:00
Alejandro Pedraza	53bb7c47f6	Make the auto-injector required and removed proxy-auto-inject flag (#2733 ) Make the auto-injector required and removed proxy-auto-inject flag Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-24 13:06:51 -05:00
Alex Leong	3de16d47be	Remove validation from service profile CRD definition (#2740 ) Fixes #2736 Signed-off-by: Alex Leong <alex@buoyant.io>	2019-04-23 16:10:50 -07:00
Andrew Seigner	b2b4780430	Introduce install stages (#2719 ) This change introduces two named parameters for `linkerd install`, split by privilege: - `linkerd install config` - Namespace - ClusterRoles - ClusterRoleBindings - CustomResourceDefinition - ServiceAccounts - `linkerd install control-plane` - ConfigMaps - Secrets - Deployments - Services Comprehensive `linkerd install` is still supported. TODO: - `linkerd check` support - `linkerd upgrade` support - integration tests Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-23 14:52:34 -07:00

42 Commits