linkerd2

Commit Graph

Author	SHA1	Message	Date
Andrew Seigner	64ed8e4a74	Introduce Cluster Heartbeat cronjob (#3056 ) `linkerd check`, the web dashboard, and Grafana all perform version checks to validate Linkerd is up to date. It's common for users to seldom execute these codepaths. This makes it difficult to identify what versions of Linkerd are currently in use and what environments it is being run in, which helps prioritize testing and backports. Introduce a `heartbeat` CronJob to the default Linkerd install. The cronjob executes every 24 hours, starting from 5 minutes after `linkerd install` is run. Example check URL: https://versioncheck.linkerd.io/version.json? install-time=1562761177& k8s-version=v1.15.0& meshed-pods=8& rps=3& source=heartbeat& uuid=cc4bb700-3314-426a-9f0f-ec588b9df020& version=git-b97ee9f7 Fixes #2961 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-23 17:12:30 -07:00
Andrew Seigner	48a69cb88a	Bump Prometheus to 2.11.1, Grafana to 6.2.5 (#3123 ) - set `disable_sanitize_html` in `grafana.ini`. - make all text box dimensions whole integers to fix dropdown issue, reported in: https://github.com/linkerd/linkerd2/issues/2955#issuecomment-503085444 - rev all dashboards to `schemaVersion` 18 for Grafana 6.2.5 - `prometheus-benchmark.json` based on: https://grafana.com/grafana/dashboards/9761 - `prometheus.json` based on: `69c93e6401/public/app/plugins/datasource/prometheus/dashboards/prometheus_2_stats.json` - `grafana.json` based on: `85aed0276e/public/app/plugins/datasource/prometheus/dashboards/grafana_stats.json` Fixes #2955 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-23 13:37:56 -07:00
arminbuerkle	010efac24b	Allow custom cluster domain in controller components (#2950 ) * Allow custom cluster domain in destination watcher The change relaxes the constrains of an authority requiring a `svc.cluster.local` suffix to only require `svc` as third part. A unit test could be added though the destination/server and endpoint watcher already test this behaviour. * Update proto to allow setting custom cluster domain Update golden templates * Allow setting custom domain in grpc, web server * Remove cluster domain flags from web srv and public api * Set defaultClusterDomain in validateAndBuild if none is set Signed-off-by: Armin Buerkle <armin.buerkle@alfatraining.de>	2019-07-23 08:59:41 -07:00
Tarun Pothulapati	fcec1cfb8a	Added Anti Affinity when HA is configured (#2893 ) * Added Anti Affinity when HA is configured * Move check to validate() * Test output with anti-affinity when ha upgrade * Add anti-affinity to identity deployment * made host anti-affinity default when ha * Define affinity template in a separate file Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-07-18 10:03:25 -07:00
Andrew Seigner	7756828ae6	Update install failure message to list resources (#3050 ) The existing `linkerd install` error message for existing resources was shared with `linkerd check`. Given the different contexts, the messaging made more sense for `linkerd check` than for `linkerd install`. Modify the error messaging for `linkerd install` to print a bare list of existing resources, and provide instructions for proceeding. For example: ```bash $ linkerd install Unable to install the Linkerd control plane. It appears that there is an existing installation: clusterrole.rbac.authorization.k8s.io/linkerd-linkerd-controller clusterrole.rbac.authorization.k8s.io/linkerd-linkerd-identity If you are sure you'd like to have a fresh install, remove these resources with: linkerd install --ignore-cluster \| kubectl delete -f - Otherwise, you can use the --ignore-cluster flag to overwrite the existing global resources. ``` Fixes #3045 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-09 20:21:19 +02:00
Andrew Seigner	9e09bd5e98	Mark High Availability as non-experimental (#3049 ) Fixes #2419 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-09 20:20:28 +02:00
Andrew Seigner	94fa653cf3	Fix `linkerd check` missing uuid on version check (#3040 ) PR #2603 modified the web process to read the UUID from the `linkerd-config` ConfigMap rather than from a command line flag. The `linkerd check` command relied on that command line flag to retrieve the UUID as part of its version check. Modify `linkerd check` to correctly retrieve the UUID from `linkerd-config`. Also refactor `linkerd-config` retrieval and parsing code to be shared between healthcheck, install, and upgrade. Relates to #2961 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-05 19:39:13 +02:00
Ivan Sim	7e1c14e783	Add the 'linkerd.io/control-plane-ns' label to the Traffic Split CRD (#3026 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-07-02 15:46:25 -07:00
Alex Leong	27373a8b78	Add traffic splitting to destination profiles (#2931 ) This change implements the DstOverrides feature of the destination profile API (aka traffic splitting). We add a TrafficSplitWatcher to the destination service which watches for TrafficSplit resources and notifies subscribers about TrafficSplits for services that they are subscribed to. A new TrafficSplitAdaptor then merges the TrafficSplit logic into the DstOverrides field of the destination profile. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-06-28 13:19:47 -07:00
Ivan Sim	866fe6fa5e	Introduce global resources checks to install and multi-stage install (#2987 ) * Introduce new checks to determine existence of global resources and the 'linkerd-config' config map. * Update pre-check to check for existence of global resources This ensures that multiple control planes can't be installed into different namespaces. * Update integration test clean-up script to delete psp and crd Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-27 09:59:12 -07:00
Andrew Seigner	81790b6735	Bump Prometheus to v2.10.0 (#2979 ) Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-06-21 12:51:31 -07:00
Tarun Pothulapati	a3ce06bd80	Add sideEffects field to Webhooks (#2963 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-06-21 11:06:10 -07:00
Ivan Sim	435fe861d0	Label all Linkerd resources (#2971 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-20 09:44:30 -07:00
Alejandro Pedraza	7fc6c195ad	Set MWC and VWC failure policy to 'fail' in HA mode only (#2943 ) Fixes #2927 Also moved `TestInstallSP` after `TestCheckPostInstall` so we're sure the validating webhook is ready before installing a service profile. Signed-off-by: Alejandro Pedraza Borrero <alejandro@buoyant.io>	2019-06-14 11:50:59 -05:00
Ivan Sim	ecc4465cd1	Introduce Control Plane's PSP and RBAC resources into Helm templates (#2920 ) * Add control plane and CNI PSP and RBAC resources * Add the '--linkerd-cni-enabled' flag to the multi-stage install subcommands This flag ensures that the NET_ADMIN capability is omitted from the control plane's PSP during 'install config' and the proxy-init containers aren't injected during 'install control-plane'. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-12 20:18:46 -07:00
Alejandro Pedraza	8416d326c2	If HA, set the webhooks failure policy to 'Fail' (#2906 ) * If HA, set the webhooks failure policy to 'Fail' I'm adding to the linkerd namespace a new label `linkerd.io/is-control-plane: true` that is used in the webhook configs' selector to skip the proxy injector for this namespace. This avoids running into the timing issues described in #2852. Fixes #2852 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-11 13:11:54 -05:00
Dan	24bbd7c64b	Ensure Prometheus log level is lowercase (#2823 ) (#2870 ) Signed-off-by: Daniel Baranowski <daniel.baranowski@infinityworks.com>	2019-06-07 09:57:08 -07:00
Alejandro Pedraza	66eb829e5a	Fix HA during upgrade (#2900 ) * Fix HA during upgrade If we have a Linkerd installation with HA, and then we do `linkerd upgrade` without specifying `--ha`, the replicas will get set back to 1, yet the resource requests will keep their HA values. Desired behavior: `linkerd install --ha` adds the `ha` value into the linkerd-config, so it should be used during upgrade even if `--ha` is not passed to `linkerd upgrade`. Note we still can do `linkerd upgrade --ha=false` to disable HA. This is a prerequesite to address #2852 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-06 17:27:27 -05:00
Alejandro Pedraza	74ca92ea25	Split proxy-init into separate repo (#2824 ) Split proxy-init into separate repo Fixes #2563 The new repo is https://github.com/linkerd/linkerd2-proxy-init, and I tagged the latest there `v1.0.0`. Here, I've removed the `/proxy-init` dir and pinned the injected proxy-init version to `v1.0.0` in the injector code and tests. `/cni-plugin` depends on proxy-init, so I updated the import paths there, and could verify CNI is still working (there is some flakiness but unrelated to this PR). For consistency, I added a `--init-image-version` flag to `linkerd inject` along with its corresponding override config annotation. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-03 16:24:05 -05:00
Ivan Sim	5a5f8bbfe8	Install MWC and VWC During Installation (#2806 ) * Update helm charts to include webhooks config and TLS secret * Update the webhooks to read the secret cert and key * Update webhooks to not recreate config on restart * Ensure upgrade preserve existing secrets * Revert the change to rename the webhook configs The renaming change breaks upgrade, where the new webhook configs conflict with the existing ones. The older resources aren't deleted during upgrade because they are dynamically created. * Make the secret volume read-only * Remove unnecessary exported getter functions * Remove obsolete mwc and vwc templates Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-05-20 12:43:50 -07:00
Dennis Adjei-Baah	a0fa1dff59	Move tap service into its own pod. (#2773 ) * Split tap into its own pod in the control plane Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2019-05-15 16:28:44 -05:00
Andrew Seigner	266e882d79	Define multi-stage commands as subcommands (#2772 ) The multi-stage args used by install, upgrade, and check were implemented as positional arguments to their respective parent commands. This made the help documentation unclear, and the code ambiguous as to which flags corresponded to which stage. Define `config` and `control-plane` stages as subcommands. The help menus now explicitly state flags supported. Fixes #2729 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-05-02 12:32:01 +02:00
Andrew Seigner	be60b37e93	Group Web and Grafana ServiceAccounts with RBAC (#2756 ) All ServiceAccounts are intended to be grouped together with other RBAC resources, particularly for `linkerd install config` output. Grafana and Web ServiceAccounts were still included with their respective Deployments. Group Grafana and Web ServiceAccounts with other RBAC resources. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 17:33:05 -07:00
Andrew Seigner	15ffd86cf1	Introduce multi-stage upgrade (#2723 ) `linkerd install` supports a 2-stage install process, `linkerd upgrade` did not. Add 2-stage support for `linkerd upgrade`. Also exercise multi-stage functionality during upgrade integration tests. Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 14:29:52 -07:00
Andrew Seigner	ec540a882e	Consolidate k8s APIs (#2747 ) Numerous codepaths have emerged that create k8s configs, k8s clients, and make k8s api requests. This branch consolidates k8s client creation and APIs. The primary change migrates most codepaths to call `k8s.NewAPI` to instantiate a `KubernetesAPI` struct from `pkg`. `KubernetesAPI` implements the `kubernetes.Interface` (clientset) interface, and also persists a `client-go` `rest.Config`. Specific list of changes: - removes manual GET requests from `k8s.KubernetesAPI`, in favor of clientsets - replaces most calls to `k8s.GetConfig`+`kubernetes.NewForConfig` with a single `k8s.NewAPI` - introduces a `timeout` param to `k8s.NewAPI`, currently only used by healthchecks - removes `NewClientSet` in `controller/k8s/clientset.go` in favor of `k8s.NewAPI` - removes `httpClient` and `clientset` from `HealthChecker`, use `KubernetesAPI` instead Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 11:31:38 -07:00
Alejandro Pedraza	53bb7c47f6	Make the auto-injector required and removed proxy-auto-inject flag (#2733 ) Make the auto-injector required and removed proxy-auto-inject flag Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-24 13:06:51 -05:00
Alejandro Pedraza	62d9a80894	New `linkerd inject` default and manual modes (#2721 ) Fixes #2720 and 2711 This changes the default behavior of `linkerd inject` to not inject the proxy but just the `linkerd.io/inject: enabled` annotation for the auto-injector to pick it up (regardless of any namespace annotation). A new `--manual` mode was added, which behaves as before, injecting the proxy in the command output. The unit tests are running with `--manual` to avoid any changes in the fixtures. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-24 09:05:27 -05:00
Andrew Seigner	b2b4780430	Introduce install stages (#2719 ) This change introduces two named parameters for `linkerd install`, split by privilege: - `linkerd install config` - Namespace - ClusterRoles - ClusterRoleBindings - CustomResourceDefinition - ServiceAccounts - `linkerd install control-plane` - ConfigMaps - Secrets - Deployments - Services Comprehensive `linkerd install` is still supported. TODO: - `linkerd check` support - `linkerd upgrade` support - integration tests Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-23 14:52:34 -07:00
Ivan Sim	8d13084f94	Split the `linkerd-version` CLI flag into `control-plane-version` and `proxy-version` (#2702 ) * The 'linkerd-version' CLI flag is renamed to 'control-plane-version' * Add version field to proxy config * Add the control plane version to the global config * Unit test for init image version * Use more specific control plane and proxy versions in unit tests Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-19 11:35:20 -07:00
Andrew Seigner	2d9e3686e2	Split out config objects from install templates (#2714 ) This is an initial change to separate out config-specific k8s objects from the control-plane components. The eventual goal will be rendering these configs as the first stage of a multi-stage install. Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-18 09:31:35 -07:00
Andrew Seigner	43cb3f841b	upgrade: unit tests (#2672 ) This change introduces some unit tests on individual methods in the upgrade code path, along with some minor cleanup. Part of #2637 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-10 14:54:13 -07:00
Oliver Gould	bbe1a60358	upgrade: Generate an Identity config if missing (#2656 ) When upgrading from an older cluster that has a Linkerd config but no identity, we need to generate an identity context so that the cluster is configured properly. Fixes #2650	2019-04-08 16:49:12 -07:00
Oliver Gould	ba65bd8039	Switch UUID implementation (#2667 ) The UUID implementation we use to generate install IDs is technically not random enough for secure uses, which ours is not. To prevent security scanners like SNYK from flagging this false-positive, let's just switch to the other UUID implementation (Already in our dependencies).	2019-04-08 10:58:02 -07:00
Oliver Gould	4fd1de4340	install: Don't reuse flag set (#2649 ) The instalOnlyFlagSet incorrectly extends the recordableFlagSet. I'm not sure if this has any potential for unexpected user interactions, but it's at least confusing when reading the code. This change makes the flag sets distinct.	2019-04-05 14:29:52 -07:00
Alejandro Pedraza	edb225069c	Add validation webhook for service profiles (#2623 ) Add validation webhook for service profiles Fixes #2075 Todo in a follow-up PRs: remove the SP check from the CLI check. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-05 16:10:47 -05:00
Oliver Gould	4c5378f586	install: Change --ha to set a 100m CPU request (#2644 ) When the --ha flag is set, we currently set a 10m CPU request, which corresponds to 1% of a core, which isn't actually enough to keep the proxy responding to health checks if you have 100 processes on the box. Let's give ourselves a little more breathing room. Fixes #2643	2019-04-05 13:41:00 -07:00
harsh jain	31706e5417	Fixes #2568 : Remove cni option from inject subcommand (#2573 ) Signed-off-by: harsh jain <harshjniitr@gmail.com>	2019-04-03 18:46:20 -07:00
Ivan Sim	a80335ed51	Disable external profiles by default (#2594 ) * Disable external profiles by default * Rename the --disable-external-profiles flag to --enable-external-profiles Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-01 15:13:50 -07:00
Oliver Gould	d74ca1bab0	cli: Introduce an upgrade command (#2564 ) The `install` command errors when the deploy target contains an existing Linkerd deployment. The `upgrade` command is introduced to reinstall or reconfigure the Linkerd control plane. Upgrade works as follows: 1. The controller config is fetched from the Kubernetes API. The Public API is not used, because we need to be able to reinstall the control plane when the Public API is not available; and we are not concerned about RBAC restrictions preventing the installer from reading the config (as we are for inject). 2. The install configuration is read, particularly the flags used during the last install/upgrade. If these flags were not set again during the upgrade, the previous values are used as if they were passed this time. The configuration is updated from the combination of these values, including the install configuration itself. Note that some flags, including the linkerd-version, are omitted since they are stored elsewhere in the configurations and don't make sense to track as overrides.. 3. The issuer secrets are read from the Kubernetes API so that they can be re-used. There is currently no way to reconfigure issuer certificates. We will need to create _another_ workflow for updating these credentials. 4. The install rendering is invoked with values and config fetched from the cluster, synthesized with the new configuration.	2019-04-01 13:27:41 -07:00
Oliver Gould	655632191b	config: Store install parameters with global config (#2577 ) When installing Linkerd, a user may override default settings, or may explicitly configure defaults. Consider install options like `--ha --controller-replicas=4` -- the `--ha` flag sets a new default value for the controller-replicas, and then we override it. When we later upgrade this cluster, how can we know how to configure the cluster? We could store EnableHA and ControllerReplicas configurations in the config, but what if, in a later upgrade, the default value changes? How can we know whether the user specified an override or just used the default? To solve this, we add an `Install` message into a new config. This message includes (at least) the CLI flags used to invoke install. upgrade does not specify defaults for install/proxy-options fields and, instead, uses the persisted install flags to populate default values, before applying overrides from the upgrade invocation. This change breaks the protobuf compatibility by altering the `installation_uuid` field introduced in `9c442f6885`. Because this change was not yet released (even in an edge release), we feel that it is safe to break. Fixes https://github.com/linkerd/linkerd2/issues/2574	2019-03-29 10:04:20 -07:00
Oliver Gould	93e7654eba	install: Replace EnableHA with resource values (#2572 ) This change moves resource-templating logic into a dedicated template, creates new values types to model kubernetes resource constraints, and changes the `--ha` flag's behavior to create these resource templates instead of hardcoding the resource constraints in the various templates.	2019-03-27 15:56:30 -07:00
Oliver Gould	24222da13b	install: Create auto-inject configuration (#2562 ) When reading a Linkerd configuration, we cannot determine whether auto-inject should be configured. This change adds auto-inject configuration to the global config structure. Currently, this configuration is effectively boolean, determined by the presence of an empty value (versus a null).	2019-03-26 15:28:54 -07:00
Oliver Gould	9c442f6885	Store install UUID in global config (#2561 ) Currently, the install UUID is regenerated each time `install` is run. When implementing cluster upgrades, it seems most appropriate to reuse the prior UUID, rather than generate a new one. To this end, this change stores an "Installation UUID" in the global linkerd config.	2019-03-26 08:45:40 -07:00
Oliver Gould	21796be354	install: Create linkerd-config before pods (#2538 ) Because the linkerd-config resource is created after pods that require it, they can be started before the files are mounted, causing the pods to restart integration tests to fail. If we extract the config into its own template file, it can be inserted before pods are created.	2019-03-21 14:01:07 -07:00
Oliver Gould	f02730a90d	Check the cluster's config for install & inject (#2535 ) The introduction of identity in `0626fa37` created new state in the control plane's configuration that must be considered when re-installing the control plane or when injecting pods. This change alters `install` to fail if it would seem to conflict with an existing installation. This behavior may be disabled with the `--ignore-cluster` flag. Furthermore, `inject` now _requires_ that it can fetch a configuration from the control plane in order to operate. Otherwise the `--ignore-cluster` and `--disable-identity` flags must be specified. This change does not actually instrument pods to use identity yet---it lays the framework for proxy identity without changing the test fixture output (besides a change to how identity HA is configured). Fixes #2531	2019-03-21 12:49:46 -07:00
Oliver Gould	0626fa374a	install: Introduce the Identity controller (#2526 ) https://github.com/linkerd/linkerd2/pull/2521 introduces an "Identity" controller, but there is no way to include it in linkerd installation. This change alters the `install` flow as follows: - An Identity service is _always_ installed; - Issuer credentials may be specified via the CLI; - If no Issuer credentials are provided, they are generated each time `install` is called. - Proxies are NOT configured to use the identity service. - It's possible to override the credential generation logic---especially for tests---via install options that can be configured via the CLI.	2019-03-19 17:04:11 -07:00
Oliver Gould	91c5f07650	proxy: Upgrade to identity-capable proxy (#2524 ) The new proxy has changed its configuration as follows: - `LISTENER` urls are now `LISTEN_ADDR` addresses; - `CONTROL_URL` is now `DESTINATION_SVC_ADDR`; - `_NAMESPACE` vars are no longer needed; - The `PROXY_ID` is now the `DESTINATION_CONTEXT`; - The "metrics" port is now the "admin" port, since it serves more than just metrics; - A readiness probe now checks a dedicated /ready endpoint eagerly. Identity injection is NOT* configured by this branch.	2019-03-19 14:20:39 -07:00
Oliver Gould	81f645da66	Remove `--tls=optional` and `linkerd-ca` (#2515 ) The proxy's TLS implementation has changed to use a new _Identity_ controller. In preparation for this, the `--tls=optional` CLI flag has been removed from install and inject; and the `ca` controller has been deleted. Metrics and UI treatments for TLS have not been removed, as they will continue to be valuable for the new Identity system. With the removal of the old identity scheme, the Destination service's proxy ID field is now set with an opaque string (e.g. `ns:emojivoto`) to enable locality awareness.	2019-03-18 17:40:31 -07:00
Gaurav Kumar	d0bdd4ffb4	Allow configuration of Prometheus log level (#2484 ) (#2487 ) Signed-off-by: Gaurav Kumar <gaurav.kumar9825@gmail.com>	2019-03-18 10:34:58 -07:00
Andrew Seigner	e5d2460792	Remove single namespace functionality (#2474 ) linkerd/linkerd2#1721 introduced a `--single-namespace` install flag, enabling the control-plane to function within a single namespace. With the introduction of ServiceProfiles, and upcoming identity changes, this single namespace mode of operation is becoming less viable. This change removes the `--single-namespace` install flag, and all underlying support. The control-plane must have cluster-wide access to operate. A few related changes: - Remove `--single-namespace` from `linkerd check`, this motivates combining some check categories, as we can always assume cluster-wide requirements. - Simplify the `k8s.ResourceAuthz` API, as callers no longer need to make a decision based on cluster-wide vs. namespace-wide access. Components either have access, or they error out. - Modify the web dashboard to always assume ServiceProfiles are enabled. Reverts #1721 Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-03-12 00:17:22 -07:00

1 2 3

122 Commits