linkerd2

Commit Graph

Author	SHA1	Message	Date
Andrew Seigner	a8830b2323	Set heartbeat cronjobs to not restart on failure (#3174 ) The heartbeat cronjob specified `restartPolicy: OnFailure`. In cases where failure was non-transient, such as if a cluster did not have internet access, this would continuously restart and fail. Change the heartbeat cronjob to `restartPolicy: Never`, as a failed job has no user-facing impact. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-31 13:51:13 -07:00
Cody Vandermyn	808fa381f9	A Slightly More Restrictive PSP (#3085 ) * Adds more PSP restrictions * Update test fixtures * Updates PSP to be conditional on initContainer - The proxy-init container runs as root and needs the PSP to allow this user when there is an init container. Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2019-07-24 10:12:33 -07:00
Andrew Seigner	c832d354f2	Fix resources yaml indentation (#3134 ) The `_resources.yaml` partial hard-coded indentation, making it cumbersome to use it in contexts that did not have a specific indentation level. Remove indentation from `_resources.yaml`, and instead specify required indentation at the call site via `nindent`. Fixes #3119 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-24 10:01:48 -07:00
Andrew Seigner	64ed8e4a74	Introduce Cluster Heartbeat cronjob (#3056 ) `linkerd check`, the web dashboard, and Grafana all perform version checks to validate Linkerd is up to date. It's common for users to seldom execute these codepaths. This makes it difficult to identify what versions of Linkerd are currently in use and what environments it is being run in, which helps prioritize testing and backports. Introduce a `heartbeat` CronJob to the default Linkerd install. The cronjob executes every 24 hours, starting from 5 minutes after `linkerd install` is run. Example check URL: https://versioncheck.linkerd.io/version.json? install-time=1562761177& k8s-version=v1.15.0& meshed-pods=8& rps=3& source=heartbeat& uuid=cc4bb700-3314-426a-9f0f-ec588b9df020& version=git-b97ee9f7 Fixes #2961 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-23 17:12:30 -07:00
Andrew Seigner	48a69cb88a	Bump Prometheus to 2.11.1, Grafana to 6.2.5 (#3123 ) - set `disable_sanitize_html` in `grafana.ini`. - make all text box dimensions whole integers to fix dropdown issue, reported in: https://github.com/linkerd/linkerd2/issues/2955#issuecomment-503085444 - rev all dashboards to `schemaVersion` 18 for Grafana 6.2.5 - `prometheus-benchmark.json` based on: https://grafana.com/grafana/dashboards/9761 - `prometheus.json` based on: `69c93e6401/public/app/plugins/datasource/prometheus/dashboards/prometheus_2_stats.json` - `grafana.json` based on: `85aed0276e/public/app/plugins/datasource/prometheus/dashboards/grafana_stats.json` Fixes #2955 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-23 13:37:56 -07:00
Alex Leong	d6ef9ea460	Update ServiceProfile CRD to version v1alpha2 and remove validation (#3078 ) The openAPIV3Schema validation in the ServiceProfiles CRD is very limited in what it can validate and is obviated by more sophisticated validation done by the validating admission controller. Therefore, we would like to remove the openAPIV3Schema validation to reduce the size and complexity of the CRD object. To do so, we must also bump the version of the ServiceProfile custom resource from v1alpha1 to v1alpha2. This ensures that when the controller is upgraded, it will attempt to watch the v1alpha2 resource. If it cannot (because, for example, the controller pod started before the ServiceProfile CRD was updated and therefore the v1alpha2 version does not exist) then it will go into a crash loop backoff until it can. This essentially means that the controller will wait for the CRD to be upgraded to include v1alpha2 before it will start. Bumping the version is necessary because if we did not, it would be possible for the controller to start before the CRD is updated (removing the validation). In this case, when the CRD is edited, the controller will lose its list watch on ServiceProfiles and will stop getting updates. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-07-23 11:46:31 -07:00
Tarun Pothulapati	fcec1cfb8a	Added Anti Affinity when HA is configured (#2893 ) * Added Anti Affinity when HA is configured * Move check to validate() * Test output with anti-affinity when ha upgrade * Add anti-affinity to identity deployment * made host anti-affinity default when ha * Define affinity template in a separate file Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-07-18 10:03:25 -07:00
Ivan Sim	7e1c14e783	Add the 'linkerd.io/control-plane-ns' label to the Traffic Split CRD (#3026 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-07-02 15:46:25 -07:00
Alex Leong	27373a8b78	Add traffic splitting to destination profiles (#2931 ) This change implements the DstOverrides feature of the destination profile API (aka traffic splitting). We add a TrafficSplitWatcher to the destination service which watches for TrafficSplit resources and notifies subscribers about TrafficSplits for services that they are subscribed to. A new TrafficSplitAdaptor then merges the TrafficSplit logic into the DstOverrides field of the destination profile. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-06-28 13:19:47 -07:00
Tarun Pothulapati	5c5ec6d816	add admin port label to proxy-injector and sp-validator (#2984 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-06-27 17:25:49 -05:00
Tarun Pothulapati	a3ce06bd80	Add sideEffects field to Webhooks (#2963 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-06-21 11:06:10 -07:00
Ivan Sim	435fe861d0	Label all Linkerd resources (#2971 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-20 09:44:30 -07:00
Ivan Sim	e2e976cce9	Add `NET_RAW` capability to the proxy-init container (#2969 ) Also, update control plane PSP to match linkerd/website#94 Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-19 19:34:37 -07:00
Dennis Adjei-Baah	694ba9c2cb	Revert add namespace name to MWC (#2946 ) * revert add namespace name to MWC	2019-06-14 15:26:34 -07:00
Alejandro Pedraza	7fc6c195ad	Set MWC and VWC failure policy to 'fail' in HA mode only (#2943 ) Fixes #2927 Also moved `TestInstallSP` after `TestCheckPostInstall` so we're sure the validating webhook is ready before installing a service profile. Signed-off-by: Alejandro Pedraza Borrero <alejandro@buoyant.io>	2019-06-14 11:50:59 -05:00
Alejandro Pedraza	28025eeb56	Remove UPDATE event from the mutating webhook config (#2919 ) Fixes #2889 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-13 15:42:47 -05:00
Alejandro Pedraza	e9bf014d34	Remove MWVC RBAC from webhook configs (#2925 ) Fixes #2890 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-13 15:42:00 -05:00
Dennis Adjei-Baah	8aef9280dd	add namespace name to MWC (#2905 ) When installing multiple control planes, the mutatingwebhookconfiguration of the first control plane gets overwritten by any subsequent control plane install. This is caused by the fixed name given to the mutatingwebhookconfiguration manifest at install time. This commit adds in the namespace to the manifest so that there is a unique configuration for each control plane. Fixes #2887	2019-06-13 12:15:43 -07:00
Ivan Sim	ecc4465cd1	Introduce Control Plane's PSP and RBAC resources into Helm templates (#2920 ) * Add control plane and CNI PSP and RBAC resources * Add the '--linkerd-cni-enabled' flag to the multi-stage install subcommands This flag ensures that the NET_ADMIN capability is omitted from the control plane's PSP during 'install config' and the proxy-init containers aren't injected during 'install control-plane'. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-12 20:18:46 -07:00
Alejandro Pedraza	8416d326c2	If HA, set the webhooks failure policy to 'Fail' (#2906 ) * If HA, set the webhooks failure policy to 'Fail' I'm adding to the linkerd namespace a new label `linkerd.io/is-control-plane: true` that is used in the webhook configs' selector to skip the proxy injector for this namespace. This avoids running into the timing issues described in #2852. Fixes #2852 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-11 13:11:54 -05:00
Tarun Pothulapati	590249c66b	HA for proxy-injector and sp-validator (#2874 ) * Added labels to webhook configurations in charts/ * Multiple replicas of proxy-injector and sp-validator in HA * Use ControllerComponent template variable for webhookconfigurations Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-05-31 14:48:30 -07:00
Tarun Pothulapati	1a574def1f	Added labels to webhook configurations in charts/ (#2853 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-05-28 16:44:09 -05:00
Dennis Adjei-Baah	b9cc66c6c8	fixes roleRef name in linkerd-tap rbac (#2845 ) * fixes roleRef name in linkerd-tap rbac	2019-05-28 10:05:01 -07:00
Ivan Sim	5a5f8bbfe8	Install MWC and VWC During Installation (#2806 ) * Update helm charts to include webhooks config and TLS secret * Update the webhooks to read the secret cert and key * Update webhooks to not recreate config on restart * Ensure upgrade preserve existing secrets * Revert the change to rename the webhook configs The renaming change breaks upgrade, where the new webhook configs conflict with the existing ones. The older resources aren't deleted during upgrade because they are dynamically created. * Make the secret volume read-only * Remove unnecessary exported getter functions * Remove obsolete mwc and vwc templates Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-05-20 12:43:50 -07:00
Dennis Adjei-Baah	a0fa1dff59	Move tap service into its own pod. (#2773 ) * Split tap into its own pod in the control plane Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2019-05-15 16:28:44 -05:00
Andrew Seigner	be60b37e93	Group Web and Grafana ServiceAccounts with RBAC (#2756 ) All ServiceAccounts are intended to be grouped together with other RBAC resources, particularly for `linkerd install config` output. Grafana and Web ServiceAccounts were still included with their respective Deployments. Group Grafana and Web ServiceAccounts with other RBAC resources. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 17:33:05 -07:00
Andrew Seigner	15ffd86cf1	Introduce multi-stage upgrade (#2723 ) `linkerd install` supports a 2-stage install process, `linkerd upgrade` did not. Add 2-stage support for `linkerd upgrade`. Also exercise multi-stage functionality during upgrade integration tests. Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 14:29:52 -07:00
Alex Leong	4ea7c62b0d	Revert " Remove validation from service profile CRD definition (#2740 )" (#2752 ) This reverts commit `3de16d47be`. #2740 modified the ServiceProfiles CRD which will cause issues for users upgrading from the old CRD version to the new version. #2748 was an attempt to fix this by bumping the service profile CRD version, however, our testing infrastructure is not well set up to accommodate changes to CRDs because they are resources which are global to the cluster. We revert this change for now and will revisit it in the future when we can give more thought to CRD versioning, upgrade, and testing. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-04-25 13:40:20 -07:00
Alejandro Pedraza	53bb7c47f6	Make the auto-injector required and removed proxy-auto-inject flag (#2733 ) Make the auto-injector required and removed proxy-auto-inject flag Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-24 13:06:51 -05:00
Alex Leong	3de16d47be	Remove validation from service profile CRD definition (#2740 ) Fixes #2736 Signed-off-by: Alex Leong <alex@buoyant.io>	2019-04-23 16:10:50 -07:00
Andrew Seigner	b2b4780430	Introduce install stages (#2719 ) This change introduces two named parameters for `linkerd install`, split by privilege: - `linkerd install config` - Namespace - ClusterRoles - ClusterRoleBindings - CustomResourceDefinition - ServiceAccounts - `linkerd install control-plane` - ConfigMaps - Secrets - Deployments - Services Comprehensive `linkerd install` is still supported. TODO: - `linkerd check` support - `linkerd upgrade` support - integration tests Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-23 14:52:34 -07:00
Andrew Seigner	2d9e3686e2	Split out config objects from install templates (#2714 ) This is an initial change to separate out config-specific k8s objects from the control-plane components. The eventual goal will be rendering these configs as the first stage of a multi-stage install. Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-18 09:31:35 -07:00
Douglas Jordan	80634d6c8b	Create proxy-injector RBAC resources before deployment (#2707 ) Fixes #2694 Signed-off-by: Douglas Jordan <dwj300@gmail.com>	2019-04-17 10:51:00 -07:00
Katerina	938d64a16f	Web server updated to read the UUID from the linkerd-config ConfigMap. (#2603 ) Signed-off-by: Kateryna Melnyk <kattymelnyk@gmail.com>	2019-04-08 12:56:00 -07:00
Alejandro Pedraza	edb225069c	Add validation webhook for service profiles (#2623 ) Add validation webhook for service profiles Fixes #2075 Todo in a follow-up PRs: remove the SP check from the CLI check. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-05 16:10:47 -05:00
Kevin Lingerfelt	74e48ba301	Remove project injector's -no-init-container flag (#2635 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-04-04 11:09:47 -07:00
Andrew Seigner	e38ad7e9d1	Update Prometheus retention param (#2584 ) `storage.tsdb.retention` is deprecated in favor of `storage.tsdb.retention.time`. Replace all occurrences. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-03-29 10:45:02 -07:00
Oliver Gould	655632191b	config: Store install parameters with global config (#2577 ) When installing Linkerd, a user may override default settings, or may explicitly configure defaults. Consider install options like `--ha --controller-replicas=4` -- the `--ha` flag sets a new default value for the controller-replicas, and then we override it. When we later upgrade this cluster, how can we know how to configure the cluster? We could store EnableHA and ControllerReplicas configurations in the config, but what if, in a later upgrade, the default value changes? How can we know whether the user specified an override or just used the default? To solve this, we add an `Install` message into a new config. This message includes (at least) the CLI flags used to invoke install. upgrade does not specify defaults for install/proxy-options fields and, instead, uses the persisted install flags to populate default values, before applying overrides from the upgrade invocation. This change breaks the protobuf compatibility by altering the `installation_uuid` field introduced in `9c442f6885`. Because this change was not yet released (even in an edge release), we feel that it is safe to break. Fixes https://github.com/linkerd/linkerd2/issues/2574	2019-03-29 10:04:20 -07:00
Oliver Gould	93e7654eba	install: Replace EnableHA with resource values (#2572 ) This change moves resource-templating logic into a dedicated template, creates new values types to model kubernetes resource constraints, and changes the `--ha` flag's behavior to create these resource templates instead of hardcoding the resource constraints in the various templates.	2019-03-27 15:56:30 -07:00
Oliver Gould	fda2035d5c	Use "With .Values" scoping in all templates (#2570 ) Some of our templates have started to use 'with .Values' scoping to limit boilerplate within the tempates. This change makes this uniform in all templates.	2019-03-26 19:09:21 -07:00
Alejandro Pedraza	7efe385feb	Have the Webhook react to pod creation/update only (#2472 ) Have the Webhook react to pod creation/update only This was already working almost out-of-the-box, just had to: - Change the webhook config so it watches pods instead of deployments - Grant some extra ClusterRole permissions - Add the piece that figures what's the OwnerReference and add the label for it - Manually inject service account mount paths - Readd volumes tests Fixes #2342 and #1751 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-03-26 11:53:56 -05:00
Oliver Gould	da0330743f	Provide peer Identities via the Destination API (#2537 ) This change reintroduces identity hinting to the destination service. The Get endpoint includes identities for pods that are injected with an identity-mode of "default" and have the same linkerd control plane. A `serviceaccount` label is now also added to destination response metadata so that it's accessible in prometheus and tap.	2019-03-22 09:19:14 -07:00
Oliver Gould	21796be354	install: Create linkerd-config before pods (#2538 ) Because the linkerd-config resource is created after pods that require it, they can be started before the files are mounted, causing the pods to restart integration tests to fail. If we extract the config into its own template file, it can be inserted before pods are created.	2019-03-21 14:01:07 -07:00
Oliver Gould	f02730a90d	Check the cluster's config for install & inject (#2535 ) The introduction of identity in `0626fa37` created new state in the control plane's configuration that must be considered when re-installing the control plane or when injecting pods. This change alters `install` to fail if it would seem to conflict with an existing installation. This behavior may be disabled with the `--ignore-cluster` flag. Furthermore, `inject` now _requires_ that it can fetch a configuration from the control plane in order to operate. Otherwise the `--ignore-cluster` and `--disable-identity` flags must be specified. This change does not actually instrument pods to use identity yet---it lays the framework for proxy identity without changing the test fixture output (besides a change to how identity HA is configured). Fixes #2531	2019-03-21 12:49:46 -07:00
Oliver Gould	0626fa374a	install: Introduce the Identity controller (#2526 ) https://github.com/linkerd/linkerd2/pull/2521 introduces an "Identity" controller, but there is no way to include it in linkerd installation. This change alters the `install` flow as follows: - An Identity service is _always_ installed; - Issuer credentials may be specified via the CLI; - If no Issuer credentials are provided, they are generated each time `install` is called. - Proxies are NOT configured to use the identity service. - It's possible to override the credential generation logic---especially for tests---via install options that can be configured via the CLI.	2019-03-19 17:04:11 -07:00
Oliver Gould	91c5f07650	proxy: Upgrade to identity-capable proxy (#2524 ) The new proxy has changed its configuration as follows: - `LISTENER` urls are now `LISTEN_ADDR` addresses; - `CONTROL_URL` is now `DESTINATION_SVC_ADDR`; - `_NAMESPACE` vars are no longer needed; - The `PROXY_ID` is now the `DESTINATION_CONTEXT`; - The "metrics" port is now the "admin" port, since it serves more than just metrics; - A readiness probe now checks a dedicated /ready endpoint eagerly. Identity injection is NOT* configured by this branch.	2019-03-19 14:20:39 -07:00
Oliver Gould	81f645da66	Remove `--tls=optional` and `linkerd-ca` (#2515 ) The proxy's TLS implementation has changed to use a new _Identity_ controller. In preparation for this, the `--tls=optional` CLI flag has been removed from install and inject; and the `ca` controller has been deleted. Metrics and UI treatments for TLS have not been removed, as they will continue to be valuable for the new Identity system. With the removal of the old identity scheme, the Destination service's proxy ID field is now set with an opaque string (e.g. `ns:emojivoto`) to enable locality awareness.	2019-03-18 17:40:31 -07:00
Gaurav Kumar	d0bdd4ffb4	Allow configuration of Prometheus log level (#2484 ) (#2487 ) Signed-off-by: Gaurav Kumar <gaurav.kumar9825@gmail.com>	2019-03-18 10:34:58 -07:00
Andrew Seigner	e5d2460792	Remove single namespace functionality (#2474 ) linkerd/linkerd2#1721 introduced a `--single-namespace` install flag, enabling the control-plane to function within a single namespace. With the introduction of ServiceProfiles, and upcoming identity changes, this single namespace mode of operation is becoming less viable. This change removes the `--single-namespace` install flag, and all underlying support. The control-plane must have cluster-wide access to operate. A few related changes: - Remove `--single-namespace` from `linkerd check`, this motivates combining some check categories, as we can always assume cluster-wide requirements. - Simplify the `k8s.ResourceAuthz` API, as callers no longer need to make a decision based on cluster-wide vs. namespace-wide access. Components either have access, or they error out. - Modify the web dashboard to always assume ServiceProfiles are enabled. Reverts #1721 Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-03-12 00:17:22 -07:00
Alejandro Pedraza	0da851842b	Public API endpoint `Config()` (#2455 ) Public API endpoint `Config()` Retrieves Global and Proxy configurations. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-03-07 17:37:46 -05:00

1 2

66 Commits