linkerd2

Commit Graph

Author	SHA1	Message	Date
Andrew Seigner	7f59caa7fc	Bump proxy-init to 1.2.0 (#3397 ) Pulls in latest proxy-init: https://github.com/linkerd/linkerd2-proxy-init/releases/tag/v1.2.0 This also bumps a dependency on cobra, which provides more complete zsh completion. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-09-09 09:06:14 -07:00
Alejandro Pedraza	17dd9bf6bc	Couple of injection events fixes (#3363 ) * Couple of injection events fixes When generating events in quick succession against the same target, client-go issues a PATCH request instead of a POST, so we need the extra RBAC permission. Also we have an informer on pods, so we also need the "watch" permission for them, whose omission was causing an error entry in the logs. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-09-04 11:57:20 -05:00
Alejandro Pedraza	acbab93ca8	Add support for k8s 1.16 (#3364 ) Fixes #3356 1.16 removes some api groups that were already deprecated. From k8s blog post (https://kubernetes.io/blog/2019/07/18/api-deprecations-in-1-16/): ``` - PodSecurityPolicy: will no longer be served from extensions/v1beta1 in v1.16. Migrate to the policy/v1beta1 API, available since v1.10. Existing persisted data can be retrieved/updated via the policy/v1beta1 API. - DaemonSet, Deployment, StatefulSet, and ReplicaSet: will no longer be served from extensions/v1beta1, apps/v1beta1, or apps/v1beta2 in v1.16. Migrate to the apps/v1 API, available since v1.9. Existing persisted data can be retrieved/updated via the apps/v1 API. ``` Previous PRs had already made this change at the Helm templates level, but we still needed to do it at the API calls and tests. The integration tests ran fine for k8s 1.12 and 1.15. They fail on 1.16 because the upgrade integration test tries to install linkerd 2.5 which is not compatible with 1.16. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-09-04 09:59:55 -05:00
arminbuerkle	5c38f38a02	Allow custom cluster domains in remaining backends (#3278 ) * Set custom cluster domain in GetServiceProfileFor * Set custom cluster domain in tap server Move fetching cluster domain for tap server to cmd main * Handle fetchting cluster domain errors separately * Use custom cluster domain for traffic split adaptor Signed-off-by: Armin Buerkle <armin.buerkle@alfatraining.de>	2019-08-27 10:01:36 -07:00
Alejandro Pedraza	02efb46e45	Have the proxy-injector emit events upon injection/skipping injection (#3316 ) * Have the proxy-injector emit events upon injection/skipping injection Fixes #3253 Have the proxy-injector emit an event whenever a injection happens, or when injection is skipped for some reason (also added that reason into the proxy-injector logs). The level is associated to the parent workload (it can't be associated to the pod because at this point the pod hasn't been persisted). The event recorder was setup at the `webhook/server.go` level and passed to the proxy-injector's `Inject` function. The sp-validator thus also has access to the event recorder, but for now it's not using it. Related changes: - Refactored `api.GetOwnerKindAndName()` to have it return a more generic object. - Refactored `report.Injectable()` to also have it return the reason why a workload is not injectable. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-26 13:34:36 -05:00
Ivan Sim	954a45f751	Fix broken unit and integration tests (#3303 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-08-21 18:52:19 -07:00
arminbuerkle	e7d303e03f	Add LINKERD2_PROXY_DESTINATION_GET_SUFFIXES (#3277 ) * Fix missing `clusterDomain` in render RenderTapOutputProfile * Add LINKERD2_PROXY_DESTINATION_GET_SUFFIXES env variable Signed-off-by: Armin Buerkle <armin.buerkle@alfatraining.de>	2019-08-21 14:28:30 -07:00
Ivan Sim	183e42e4cd	Merge the CLI 'installValues' type with Helm 'Values' type (#3291 ) * Rename template-values.go * Define new constructor of charts.Values type * Move all Helm values related code to the pkg/charts package * Bump dependency * Use '/' in filepath to remain compatible with VFS requirement * Add unit test to verify Helm YAML output * Alejandro's feedback * Add unit test for Helm YAML validation (HA) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-08-20 19:26:38 -07:00
cpretzer	4e92064f3b	Add a flag to install-cni command to configure iptables wait flag (#3066 ) Signed-off-by: Charles Pretzer <charles@buoyant.io>	2019-08-15 12:58:18 -07:00
Kevin Leimkuhler	cc3c53fa73	Remove tap from public API and associated test infrastructure (#3240 ) ### Summary After the addition of the tap APIServer, all the logic related to tap in the public API no longer needs to be there. The servers and clients that are created but not used, as well as all the old testing infrastrucure related to tap can be removed. This deprecates TapByResource and therefore required an update to the protobuf files with `bin/protoc-go.sh`. While the change to deprecate this method was extremely small, a lot of protobuf fils were updated in the process. These changes to the code and protobuf files should probably remain coupled since `TapByResource` is officially deprecated in the public API, but a majority of the additions/deletions are related to those files. This draft passes `go test` as well as a local run of the integration tests. Signed-off-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2019-08-14 17:27:37 -04:00
Carol A. Scott	00437709eb	Add trafficsplit metrics to CLI (#3176 ) This PR adds `trafficsplit` as a supported resource for the `linkerd stat` command. Users can type `linkerd stat ts` to see the apex and leaf services of their trafficsplits, as well as metrics for those leaf services.	2019-08-14 10:30:57 -07:00
Ivan Sim	4d01e3720e	Update install and upgrade code to use the new helm charts (#3229 ) * Delete symlink to old Helm chart * Update 'install' code to use common Helm template structs * Remove obsolete TLS assets functions. These are now handle by Helm functions inside the templates * Read defaults from values.yaml and values-ha.yaml * Ensure that webhooks TLS assets are retained during upgrade * Fix a few bugs in the Helm templates (see bullet points): * Merge the way the 'install' ha and non-ha options are handled into one function * Honor the 'NoInitContainer' option in the components templates * Control plane mTLS will not be disabled if identity context in the config map is empty. The data plane mTLS will still be automatically disabled if the context is nil. * Resolve test failures from rebase with master * Fix linter issues * Set service account mount path read-only field * Add TLS variables of the webhooks and tap to values.yaml During upgrade, these secrets are preserved to ensure they remain synced wih the CA bundle in the webhook configurations. These Helm variables are used to override the defaults in the templates. * Remove obsolete 'chart' folder * Fix bugs in templates * Handle missing webhooks and tap TLS assets during upgrade When upgrading from an older version that don't have these secrets, fallback to let Helm create them by creating an empty charts.TLS struct. * Revert the selector labels of webhooks to be compatible with that in 2.4 In 2.4, the proxy injector and profile validator webhooks already have their selector labels defined. Since these attributes are immutable, the recent change to these selectors introduced by the Helm chart work will cause upgrade to fail. * Alejandro's feedback * Siggy's feedback * Removed redundant unexported custom types Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-08-13 14:16:24 -07:00
Alejandro Pedraza	1e82f62d6e	Fix uninject (#3236 ) Now that we inject at the pod level by default, `linkerd uninject` should remove the `linkerd.io/inject: enabled` annotation. Also added a test for that. Fix #3156 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-13 15:06:21 -05:00
Thomas Rampelberg	ca5b4fab2e	Add container metrics and grafana dashboard (#3217 ) * Add container metrics and grafana dashboard * Review cleanup * Update templates	2019-08-12 08:03:57 -07:00
Andrew Seigner	43bc175ea9	Enable tap-admin ClusterRole privileges for `` (#3214 ) The `linkerd-linkerd-tap-admin` ClusterRole had `watch` privileges on `/tap` resources. This disallowed non-namespaced tap requests of the form: `/apis/tap.linkerd.io/v1alpha1/watch/namespaces/linkerd/tap`, because that URL structure is interpreted by the Kubernetes API as watching a resource of type `tap` within the linkerd namespace, rather than tapping the linkerd namespace. Modify `linkerd-linkerd-tap-admin` to have `watch` privileges on ``, enabling any request of the form `/apis/tap.linkerd.io/v1alpha1/watch/namespaces/linkerd/` to succeed. Fixes #3212 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-08 12:04:03 -07:00
Andrew Seigner	0ff39ddf8d	Introduce tap-admin ClusterRole, web privs flag (#3203 ) The web dashboard will be migrating to the new Tap APIService, which requires RBAC privileges to access. Introduce a new ClusterRole, `linkerd-linkerd-tap-admin`, which gives cluster-wide tap privileges. Also introduce a new ClusterRoleBinding, `linkerd-linkerd-web-admin` which binds the `linkerd-web` service account to the new tap ClusterRole. This ClusterRoleBinding is enabled by default, but may be disabled via a new `linkerd install` flag `--restrict-dashboard-privileges`. Fixes #3177 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-08 10:28:35 -07:00
Alejandro Pedraza	3ae653ae92	Refactor proxy injection to use Helm charts (#3200 ) * Refactor proxy injection to use Helm charts Fixes #3128 A new chart `/charts/patch` was created, that generates the JSON patch payload that is to be returned to the k8s API when doing the injection through the proxy injector, and it's also leveraged by the `linkerd inject --manual` CLI. The VFS was used by `linkerd install` to access the old chart under `/chart`. Now the proxy injection also uses the Helm charts to generate the JSON patch (see above) so we've moved the VFS from `cli/static` to a new common place under `/pkg/charts/static`, and the new root for the VFS is now `/charts`. `linkerd install` hasn't yet migrated to use the new charts (that'll happen in #3127), so the only change in that regard was the creation of `/charts/chart` which is a symlink pointing to `/chart` that `install.go` now uses, so that the VFS contains both the old and new charts, as a temporary measure. You can see that `/bin/Dockerfile-bin`, `/controller/Dockerfile` and `/bin/build-cli-bin` do now `go generate` pointing to the new location (and the `go generate` annotation was moved from `/cli/main.go` to `pkg/charts/static/templates.go`). The symlink trick doesn't work when building the binaries through Docker, so `/bin/Dockerfile-bin` replaces the symlink with an actual copy of `/chart`. Also note that in `/controller/Dockerfile` we now need to include the `prod` tag in `go install` like we do in `/bin/Dockerfile-bin` so that the proxy injector does use the VFS instead of the local file system. - The common logic to parse a chart has been moved from `install.go` to `/pkg/charts/util.go`. - The special ENV var in the proxy for "outbound router capacity" that only applies to the Prometheus pod is now handled directly in the proxy partial and all the associated go code could be removed. - The `patch.go` lib for generating the JSON patch in go along with its tests `patch_test.go` are no longer needed. - Lots of functions in `/pkg/inject/inject.go` got removed/simplified with their logic being moved into the charts themselves. As a consequence lots of things in `inject_test.go` became irrelevant. - Moved `template-values.go` from `/pkg/inject` to `pkg/charts` as that contains the go structs representation of the chart variables that will be leveraged in #3127. Don't forget to run `/bin/helm.sh` whenever you make changes to charts ;-) Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-07 17:32:37 -05:00
Tarun Pothulapati	0cbba0b03e	Setting SuccessfulJobHistoryLimit to 0 for CronJobs (#3193 ) * setting successful job history limit to 0 Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-08-07 16:59:14 -05:00
Andrew Seigner	a59c1dd32d	Introduce tap APIService, update `linkerd tap` (#3167 ) The Tap Service enabled tapping of any meshed pod, regardless of user privilege. This change introduces a new Tap APIService. Kubernetes provides authentication and authorization of Tap requests, and then forwards requests to a new Tap APIServer, which implements a Kubernetes aggregated APIServer. The Tap APIServer authenticates the client TLS from Kubernetes, and authorizes the user via a SubjectAccessReview. This change also modifies the `linkerd tap` command to make requests against the new APIService. The Tap APIService implements these Kubernetes-style endpoints: POST /apis/tap.linkerd.io/v1alpha1/watch/namespaces/:ns/tap POST /apis/tap.linkerd.io/v1alpha1/watch/namespaces/:ns/:res/:name/tap GET /apis GET /apis/tap.linkerd.io GET /apis/tap.linkerd.io/v1alpha1 GET /healthz GET /healthz/log GET /healthz/ping GET /metrics GET /openapi/v2 GET /version Users authorize to the new `tap.linkerd.io/v1alpha1` via RBAC. Only the `watch` verb is supported. Access is also available via subresources such as `deployments/tap` and `pods/tap`. This change introduces the following resources into the default Linkerd install: - Global - APIService/v1alpha1.tap.linkerd.io - ClusterRoleBinding/linkerd-linkerd-tap-auth-delegator - `linkerd` namespace: - Secret/linkerd-tap-tls - `kube-system` namespace: - RoleBinding/linkerd-linkerd-tap-auth-reader Tasks not covered by this PR: - `linkerd top` - `linkerd dashboard` - `linkerd profile --tap` - removal of the unauthenticated tap controller Fixes #2725, #3162, #3172 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-01 14:02:45 -07:00
Andrew Seigner	a8830b2323	Set heartbeat cronjobs to not restart on failure (#3174 ) The heartbeat cronjob specified `restartPolicy: OnFailure`. In cases where failure was non-transient, such as if a cluster did not have internet access, this would continuously restart and fail. Change the heartbeat cronjob to `restartPolicy: Never`, as a failed job has no user-facing impact. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-31 13:51:13 -07:00
Kevin Leimkuhler	8d9cfbf670	Inject Tap service name into proxy PodSpec (#3155 ) ### Summary In order for Pods' tap servers to start authorizing tap clients, the tap server must be able to check client names against the expected tap service name. This change injects the `LINKERD2_PROXY_TAP_SVC_NAME` into proxy PodSpecs. ### Details The tap servers on the individual resources being tapped should be able to verify that the client is the tap service. The `LINKERD2_PROXY_TAP_SVC_NAME` is now injected as an environment variable in the proxies so that it can check this value against the client name of the TLS connection. Currently, this environment will go unused. There is an open PR (linkerd2-proxy#290) to use this variable in the proxy, but this is not dependent on that merging first. Note: The variable is not injected if tap is disabled. ### Testing Test output has been updated with the newly injected environment variable. Signed-off-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2019-07-29 15:05:45 -07:00
Tarun Pothulapati	2ba2dea6a6	Added Resource Limits when ha is Configured (#3092 ) * increased ha resource limits * added resource limits to proxy when HA * update golden files in cmd/main Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-07-26 09:46:36 -07:00
Cody Vandermyn	808fa381f9	A Slightly More Restrictive PSP (#3085 ) * Adds more PSP restrictions * Update test fixtures * Updates PSP to be conditional on initContainer - The proxy-init container runs as root and needs the PSP to allow this user when there is an init container. Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2019-07-24 10:12:33 -07:00
Andrew Seigner	64ed8e4a74	Introduce Cluster Heartbeat cronjob (#3056 ) `linkerd check`, the web dashboard, and Grafana all perform version checks to validate Linkerd is up to date. It's common for users to seldom execute these codepaths. This makes it difficult to identify what versions of Linkerd are currently in use and what environments it is being run in, which helps prioritize testing and backports. Introduce a `heartbeat` CronJob to the default Linkerd install. The cronjob executes every 24 hours, starting from 5 minutes after `linkerd install` is run. Example check URL: https://versioncheck.linkerd.io/version.json? install-time=1562761177& k8s-version=v1.15.0& meshed-pods=8& rps=3& source=heartbeat& uuid=cc4bb700-3314-426a-9f0f-ec588b9df020& version=git-b97ee9f7 Fixes #2961 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-23 17:12:30 -07:00
Andrew Seigner	48a69cb88a	Bump Prometheus to 2.11.1, Grafana to 6.2.5 (#3123 ) - set `disable_sanitize_html` in `grafana.ini`. - make all text box dimensions whole integers to fix dropdown issue, reported in: https://github.com/linkerd/linkerd2/issues/2955#issuecomment-503085444 - rev all dashboards to `schemaVersion` 18 for Grafana 6.2.5 - `prometheus-benchmark.json` based on: https://grafana.com/grafana/dashboards/9761 - `prometheus.json` based on: `69c93e6401/public/app/plugins/datasource/prometheus/dashboards/prometheus_2_stats.json` - `grafana.json` based on: `85aed0276e/public/app/plugins/datasource/prometheus/dashboards/grafana_stats.json` Fixes #2955 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-23 13:37:56 -07:00
Alex Leong	d6ef9ea460	Update ServiceProfile CRD to version v1alpha2 and remove validation (#3078 ) The openAPIV3Schema validation in the ServiceProfiles CRD is very limited in what it can validate and is obviated by more sophisticated validation done by the validating admission controller. Therefore, we would like to remove the openAPIV3Schema validation to reduce the size and complexity of the CRD object. To do so, we must also bump the version of the ServiceProfile custom resource from v1alpha1 to v1alpha2. This ensures that when the controller is upgraded, it will attempt to watch the v1alpha2 resource. If it cannot (because, for example, the controller pod started before the ServiceProfile CRD was updated and therefore the v1alpha2 version does not exist) then it will go into a crash loop backoff until it can. This essentially means that the controller will wait for the CRD to be upgraded to include v1alpha2 before it will start. Bumping the version is necessary because if we did not, it would be possible for the controller to start before the CRD is updated (removing the validation). In this case, when the CRD is edited, the controller will lose its list watch on ServiceProfiles and will stop getting updates. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-07-23 11:46:31 -07:00
arminbuerkle	010efac24b	Allow custom cluster domain in controller components (#2950 ) * Allow custom cluster domain in destination watcher The change relaxes the constrains of an authority requiring a `svc.cluster.local` suffix to only require `svc` as third part. A unit test could be added though the destination/server and endpoint watcher already test this behaviour. * Update proto to allow setting custom cluster domain Update golden templates * Allow setting custom domain in grpc, web server * Remove cluster domain flags from web srv and public api * Set defaultClusterDomain in validateAndBuild if none is set Signed-off-by: Armin Buerkle <armin.buerkle@alfatraining.de>	2019-07-23 08:59:41 -07:00
Tarun Pothulapati	fcec1cfb8a	Added Anti Affinity when HA is configured (#2893 ) * Added Anti Affinity when HA is configured * Move check to validate() * Test output with anti-affinity when ha upgrade * Add anti-affinity to identity deployment * made host anti-affinity default when ha * Define affinity template in a separate file Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-07-18 10:03:25 -07:00
Alejandro Pedraza	ba9fd70892	`linkerd upgrade config` bombs when installation had a flag (#3097 ) When installing using some of the flags that persist in install, e.g `linkerd install --ha`, and then doing `linkerd upgrade config` a nil pointer error is thrown. Fixes #3094 `newCmdUpgradeConfig()` was using passing `flags` as nil because `linkerd upgrade config` doesn't expose any flags for the subcommand, but turns out they're still needed down the call stack in `setFlagsFromInstall` to reuse the flags persisted during install. I also added a new unit test catching this. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-07-18 09:09:01 -05:00
Carol A. Scott	ee1a111993	Updating CLI output for `linkerd edges` (#3048 ) This PR improves the CLI output for `linkerd edges` to reflect the latest API changes. Source and destination namespaces for each edge are now shown by default. The `MSG` column has been replaced with `Secured` and contains a green checkmark or the reason for no identity. A new `-o wide` flag shows the identity of client and server if known.	2019-07-17 12:23:34 -07:00
Jonathan Juares Beber	2dcbde08b3	Show pod status more clearly (#1967 ) (#2989 ) During operations with `linkerd stat` sometimes it's not clear the actual pod status. This commit introduces a method, to the `k8s`package, getting the pod status, based on [`kubectl` logic](`33a3e325f7/pkg/printers/internalversion/printers.go (L558-L640)`) to expose the `STATUS` column for pods . Also, it changes the stat command on the` cli` package adding a column when the resource type is a Pod. Fixes #1967 Signed-off-by: Jonathan Juares Beber <jonathanbeber@gmail.com>	2019-07-10 12:44:44 -07:00
Alejandro Pedraza	53e589890d	Have `linkerd endpoints` use `Destination.Get` (#2990 ) * Have `linkerd endpoints` use `Destination.Get` Fixes #2885 We're refactoring `linkerd endpoints` so it hits directly the `Destination.Get` endpoint, instead of relying on the Discovery service. For that, I've created a new `client.go` for Destination and added it to the `APIClient` interface. I've also added a `destinationClient` struct that mimics `tapClient`, and whose common logic has been moved into `stream_client.go`. Analogously, I added a `destinationServer` struct that mimics `tapServer`. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-07-03 09:11:03 -05:00
Ivan Sim	7e1c14e783	Add the 'linkerd.io/control-plane-ns' label to the Traffic Split CRD (#3026 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-07-02 15:46:25 -07:00
Andrew Seigner	902978fe48	Rename debug annotation to enable-debug-sidecar (#3016 ) Linkerd's CLI flags all match 1:1 with their `config.linkerd.io/*` annotation counterparts, except `--enable-debug-sidecar`, which corresponded to `config.linkerd.io/debug`. Additionally, the Linkerd docs assume this 1:1 mapping. Rename the `config.linkerd.io/debug` annotation to `config.linkerd.io/enable-debug-sidecar`. Relates to https://github.com/linkerd/website/issues/381 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-02 20:01:52 +02:00
Alex Leong	27373a8b78	Add traffic splitting to destination profiles (#2931 ) This change implements the DstOverrides feature of the destination profile API (aka traffic splitting). We add a TrafficSplitWatcher to the destination service which watches for TrafficSplit resources and notifies subscribers about TrafficSplits for services that they are subscribed to. A new TrafficSplitAdaptor then merges the TrafficSplit logic into the DstOverrides field of the destination profile. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-06-28 13:19:47 -07:00
Tarun Pothulapati	5c5ec6d816	add admin port label to proxy-injector and sp-validator (#2984 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-06-27 17:25:49 -05:00
Andrew Seigner	81790b6735	Bump Prometheus to v2.10.0 (#2979 ) Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-06-21 12:51:31 -07:00
Tarun Pothulapati	a3ce06bd80	Add sideEffects field to Webhooks (#2963 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-06-21 11:06:10 -07:00
Ivan Sim	435fe861d0	Label all Linkerd resources (#2971 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-20 09:44:30 -07:00
Ivan Sim	e2e976cce9	Add `NET_RAW` capability to the proxy-init container (#2969 ) Also, update control plane PSP to match linkerd/website#94 Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-19 19:34:37 -07:00
Dennis Adjei-Baah	694ba9c2cb	Revert add namespace name to MWC (#2946 ) * revert add namespace name to MWC	2019-06-14 15:26:34 -07:00
Alejandro Pedraza	7fc6c195ad	Set MWC and VWC failure policy to 'fail' in HA mode only (#2943 ) Fixes #2927 Also moved `TestInstallSP` after `TestCheckPostInstall` so we're sure the validating webhook is ready before installing a service profile. Signed-off-by: Alejandro Pedraza Borrero <alejandro@buoyant.io>	2019-06-14 11:50:59 -05:00
Alejandro Pedraza	28025eeb56	Remove UPDATE event from the mutating webhook config (#2919 ) Fixes #2889 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-13 15:42:47 -05:00
Alejandro Pedraza	e9bf014d34	Remove MWVC RBAC from webhook configs (#2925 ) Fixes #2890 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-13 15:42:00 -05:00
Dennis Adjei-Baah	035ba6ae87	update sp-validator MWC golden test file (#2938 )	2019-06-13 13:39:24 -07:00
Dennis Adjei-Baah	8aef9280dd	add namespace name to MWC (#2905 ) When installing multiple control planes, the mutatingwebhookconfiguration of the first control plane gets overwritten by any subsequent control plane install. This is caused by the fixed name given to the mutatingwebhookconfiguration manifest at install time. This commit adds in the namespace to the manifest so that there is a unique configuration for each control plane. Fixes #2887	2019-06-13 12:15:43 -07:00
Ivan Sim	ecc4465cd1	Introduce Control Plane's PSP and RBAC resources into Helm templates (#2920 ) * Add control plane and CNI PSP and RBAC resources * Add the '--linkerd-cni-enabled' flag to the multi-stage install subcommands This flag ensures that the NET_ADMIN capability is omitted from the control plane's PSP during 'install config' and the proxy-init containers aren't injected during 'install control-plane'. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-12 20:18:46 -07:00
Alejandro Pedraza	8416d326c2	If HA, set the webhooks failure policy to 'Fail' (#2906 ) * If HA, set the webhooks failure policy to 'Fail' I'm adding to the linkerd namespace a new label `linkerd.io/is-control-plane: true` that is used in the webhook configs' selector to skip the proxy injector for this namespace. This avoids running into the timing issues described in #2852. Fixes #2852 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-11 13:11:54 -05:00
Cody Vandermyn	33de3574ee	Correctly set securityContext values on injection (#2911 ) The patch provided by @ihcsim applies correct values for the securityContext during injection, namely: `allowPrivilegeEscalation = false`, `readOnlyRootFilesystem = true`, and the capabilities are copied from the primary container. Additionally, the proxy-init container securityContext has been updated with appropriate values. Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2019-06-11 10:34:30 -07:00
Alejandro Pedraza	66eb829e5a	Fix HA during upgrade (#2900 ) * Fix HA during upgrade If we have a Linkerd installation with HA, and then we do `linkerd upgrade` without specifying `--ha`, the replicas will get set back to 1, yet the resource requests will keep their HA values. Desired behavior: `linkerd install --ha` adds the `ha` value into the linkerd-config, so it should be used during upgrade even if `--ha` is not passed to `linkerd upgrade`. Note we still can do `linkerd upgrade --ha=false` to disable HA. This is a prerequesite to address #2852 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-06 17:27:27 -05:00
Alejandro Pedraza	74ca92ea25	Split proxy-init into separate repo (#2824 ) Split proxy-init into separate repo Fixes #2563 The new repo is https://github.com/linkerd/linkerd2-proxy-init, and I tagged the latest there `v1.0.0`. Here, I've removed the `/proxy-init` dir and pinned the injected proxy-init version to `v1.0.0` in the injector code and tests. `/cni-plugin` depends on proxy-init, so I updated the import paths there, and could verify CNI is still working (there is some flakiness but unrelated to this PR). For consistency, I added a `--init-image-version` flag to `linkerd inject` along with its corresponding override config annotation. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-03 16:24:05 -05:00
Tarun Pothulapati	590249c66b	HA for proxy-injector and sp-validator (#2874 ) * Added labels to webhook configurations in charts/ * Multiple replicas of proxy-injector and sp-validator in HA * Use ControllerComponent template variable for webhookconfigurations Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-05-31 14:48:30 -07:00
Carol A. Scott	031cd9d4ba	Simplify logic for linkerd edges and output JSON in consistent order (#2858 ) When `linkerd edges` returns JSON, the data will now be sorted alphabetically by SRC name, meaning edges will be returned in a consistent order. Logic in the CLI `edges.go` has also been simplified. These changes should result in the Travis CI builds passing consistently.	2019-05-29 11:31:58 -07:00
Tarun Pothulapati	1a574def1f	Added labels to webhook configurations in charts/ (#2853 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-05-28 16:44:09 -05:00
Carol A. Scott	8c496e3d0d	Adding unit test for CLI edges command (#2837 ) Adds a unit test for the `linkerd edges` command.	2019-05-28 13:51:45 -07:00
Dennis Adjei-Baah	b9cc66c6c8	fixes roleRef name in linkerd-tap rbac (#2845 ) * fixes roleRef name in linkerd-tap rbac	2019-05-28 10:05:01 -07:00
Ivan Sim	0d489d3099	Add debug container annotation (#2842 ) This new annotation is used by the proxy injector to determine if the debug container needs to be injected. When using 'linkerd install', the 'pkg/inject' library will only inject annotations into the workload YAML. Even though 'conf.debugSidecar' is set in the CLI, the 'injectPodSpec()' function is never invoked on the proxy injector side. Once the workload YAML got picked up by the proxy injector, 'conf.debugSidecar' is already nil, since it's a different, new 'conf' object. The new annotation ensures that the proxy injector injects the debug container. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-05-24 10:11:06 -07:00
Ivan Sim	5a5f8bbfe8	Install MWC and VWC During Installation (#2806 ) * Update helm charts to include webhooks config and TLS secret * Update the webhooks to read the secret cert and key * Update webhooks to not recreate config on restart * Ensure upgrade preserve existing secrets * Revert the change to rename the webhook configs The renaming change breaks upgrade, where the new webhook configs conflict with the existing ones. The older resources aren't deleted during upgrade because they are dynamically created. * Make the secret volume read-only * Remove unnecessary exported getter functions * Remove obsolete mwc and vwc templates Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-05-20 12:43:50 -07:00
Alena Varkockova	7973715ee4	Output check result as json (#2666 ) * Add missing file, make linter happy * Changes from code review * Ivan's code review * PR review feedback * style fixes * Last PR comments Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2019-05-20 09:04:28 -07:00
Dennis Adjei-Baah	a0fa1dff59	Move tap service into its own pod. (#2773 ) * Split tap into its own pod in the control plane Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2019-05-15 16:28:44 -05:00
Alejandro Pedraza	065c221858	Support for resources opting-out of tap (#2807 ) Support for resources opting out of tap Implements the `linkerd inject --disable-tap` flag (although hidden pending #2811) and the config override annotation `config.linkerd.io/disable-tap`. Fixes #2778 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-05-10 14:17:23 -05:00
Ivan Sim	714035fee9	Define default resource spec for proxy-init init container (#2763 ) Fixes #2750 Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-29 11:41:05 -07:00
Andrew Seigner	be60b37e93	Group Web and Grafana ServiceAccounts with RBAC (#2756 ) All ServiceAccounts are intended to be grouped together with other RBAC resources, particularly for `linkerd install config` output. Grafana and Web ServiceAccounts were still included with their respective Deployments. Group Grafana and Web ServiceAccounts with other RBAC resources. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 17:33:05 -07:00
Andrew Seigner	15ffd86cf1	Introduce multi-stage upgrade (#2723 ) `linkerd install` supports a 2-stage install process, `linkerd upgrade` did not. Add 2-stage support for `linkerd upgrade`. Also exercise multi-stage functionality during upgrade integration tests. Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-25 14:29:52 -07:00
Alex Leong	4ea7c62b0d	Revert " Remove validation from service profile CRD definition (#2740 )" (#2752 ) This reverts commit `3de16d47be`. #2740 modified the ServiceProfiles CRD which will cause issues for users upgrading from the old CRD version to the new version. #2748 was an attempt to fix this by bumping the service profile CRD version, however, our testing infrastructure is not well set up to accommodate changes to CRDs because they are resources which are global to the cluster. We revert this change for now and will revisit it in the future when we can give more thought to CRD versioning, upgrade, and testing. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-04-25 13:40:20 -07:00
Ivan Sim	cd37d3f0f5	Fall back to default built-in version if versions config are missing (#2745 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-24 19:49:18 -07:00
Alejandro Pedraza	53bb7c47f6	Make the auto-injector required and removed proxy-auto-inject flag (#2733 ) Make the auto-injector required and removed proxy-auto-inject flag Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-24 13:06:51 -05:00
Alejandro Pedraza	62d9a80894	New `linkerd inject` default and manual modes (#2721 ) Fixes #2720 and 2711 This changes the default behavior of `linkerd inject` to not inject the proxy but just the `linkerd.io/inject: enabled` annotation for the auto-injector to pick it up (regardless of any namespace annotation). A new `--manual` mode was added, which behaves as before, injecting the proxy in the command output. The unit tests are running with `--manual` to avoid any changes in the fixtures. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-24 09:05:27 -05:00
Alex Leong	3de16d47be	Remove validation from service profile CRD definition (#2740 ) Fixes #2736 Signed-off-by: Alex Leong <alex@buoyant.io>	2019-04-23 16:10:50 -07:00
Andrew Seigner	b2b4780430	Introduce install stages (#2719 ) This change introduces two named parameters for `linkerd install`, split by privilege: - `linkerd install config` - Namespace - ClusterRoles - ClusterRoleBindings - CustomResourceDefinition - ServiceAccounts - `linkerd install control-plane` - ConfigMaps - Secrets - Deployments - Services Comprehensive `linkerd install` is still supported. TODO: - `linkerd check` support - `linkerd upgrade` support - integration tests Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-23 14:52:34 -07:00
Dennis Adjei-Baah	3e5917f7e0	Add the ability to inject a debug sidecar (#2726 ) Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2019-04-22 16:53:12 -07:00
Ivan Sim	8d13084f94	Split the `linkerd-version` CLI flag into `control-plane-version` and `proxy-version` (#2702 ) * The 'linkerd-version' CLI flag is renamed to 'control-plane-version' * Add version field to proxy config * Add the control plane version to the global config * Unit test for init image version * Use more specific control plane and proxy versions in unit tests Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-19 11:35:20 -07:00
Andrew Seigner	2d9e3686e2	Split out config objects from install templates (#2714 ) This is an initial change to separate out config-specific k8s objects from the control-plane components. The eventual goal will be rendering these configs as the first stage of a multi-stage install. Part of #2337 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-18 09:31:35 -07:00
Douglas Jordan	80634d6c8b	Create proxy-injector RBAC resources before deployment (#2707 ) Fixes #2694 Signed-off-by: Douglas Jordan <dwj300@gmail.com>	2019-04-17 10:51:00 -07:00
Ivan Sim	4e19827457	Allow identity to be disabled during inject on existing cluster (#2686 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-11 13:37:06 -07:00
Katerina	938d64a16f	Web server updated to read the UUID from the linkerd-config ConfigMap. (#2603 ) Signed-off-by: Kateryna Melnyk <kattymelnyk@gmail.com>	2019-04-08 12:56:00 -07:00
Oliver Gould	d3b0d39f3b	upgrade: Fix the linkerd version in linkerd-config (#2662 ) `92f15e78a9` incorrectly removed the config version override when patching a config from options, which caused upgrade to stop updating the config version. Fixes #2660	2019-04-08 10:57:02 -07:00
Alejandro Pedraza	edb225069c	Add validation webhook for service profiles (#2623 ) Add validation webhook for service profiles Fixes #2075 Todo in a follow-up PRs: remove the SP check from the CLI check. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-05 16:10:47 -05:00
Oliver Gould	4c5378f586	install: Change --ha to set a 100m CPU request (#2644 ) When the --ha flag is set, we currently set a 10m CPU request, which corresponds to 1% of a core, which isn't actually enough to keep the proxy responding to health checks if you have 100 processes on the box. Let's give ourselves a little more breathing room. Fixes #2643	2019-04-05 13:41:00 -07:00
Andrew Seigner	1c938b3f52	Introduce upgrade command unit tests (#2639 ) This change introduces a basic unit test for the `linkerd upgrade` command. Given a mock k8s client with linkerd-config and linkerd-identity-issuer objects, it validates the rendered yaml output against an expected file. To enable this testing, most of the logic in the top-level upgrade command has been moved down into a `validateAndBuild` method. TODO: - test individual functions around mutating options, flags, configs, and values - enable reading the install information from a manifest rather than k8s Part of #2637 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-05 11:55:20 -07:00
Andrew Seigner	2f80add17a	Introduce inject integration tests (#2616 ) This change introduces integration tests for `linkerd inject`. The tests perform CLI injection, with and without params, and validates the output, including annotations. Also add some known errors in logs to `install_test.go`. TODO: - deploy uninjected and injected resources to a default and auto-injected cluster - test creation and update Part of #2459 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-05 11:42:49 -07:00
Kevin Lingerfelt	74e48ba301	Remove project injector's -no-init-container flag (#2635 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-04-04 11:09:47 -07:00
harsh jain	976bc40345	Fixes #2607 : Remove TLS from stat (#2613 ) Removes the TLS percentages from the stat command in the CLI.	2019-04-04 10:37:42 -07:00
Ivan Sim	92f15e78a9	Define proxy version override annotation (#2593 ) * Define proxy version override annotation * Don't override global linkerd version during inject This ensures consistent usages of the config.linkerd.io/linkerd-version and linkerd.io/proxy-version annotations. The former will only be used to track overridden version, while the latter shows the cluster's current default version. * Rename proxy version config override annotation Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-02 14:27:12 -07:00
Ivan Sim	a80335ed51	Disable external profiles by default (#2594 ) * Disable external profiles by default * Rename the --disable-external-profiles flag to --enable-external-profiles Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-01 15:13:50 -07:00
Andrew Seigner	e38ad7e9d1	Update Prometheus retention param (#2584 ) `storage.tsdb.retention` is deprecated in favor of `storage.tsdb.retention.time`. Replace all occurrences. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-03-29 10:45:02 -07:00
Oliver Gould	655632191b	config: Store install parameters with global config (#2577 ) When installing Linkerd, a user may override default settings, or may explicitly configure defaults. Consider install options like `--ha --controller-replicas=4` -- the `--ha` flag sets a new default value for the controller-replicas, and then we override it. When we later upgrade this cluster, how can we know how to configure the cluster? We could store EnableHA and ControllerReplicas configurations in the config, but what if, in a later upgrade, the default value changes? How can we know whether the user specified an override or just used the default? To solve this, we add an `Install` message into a new config. This message includes (at least) the CLI flags used to invoke install. upgrade does not specify defaults for install/proxy-options fields and, instead, uses the persisted install flags to populate default values, before applying overrides from the upgrade invocation. This change breaks the protobuf compatibility by altering the `installation_uuid` field introduced in `9c442f6885`. Because this change was not yet released (even in an edge release), we feel that it is safe to break. Fixes https://github.com/linkerd/linkerd2/issues/2574	2019-03-29 10:04:20 -07:00
Oliver Gould	93e7654eba	install: Replace EnableHA with resource values (#2572 ) This change moves resource-templating logic into a dedicated template, creates new values types to model kubernetes resource constraints, and changes the `--ha` flag's behavior to create these resource templates instead of hardcoding the resource constraints in the various templates.	2019-03-27 15:56:30 -07:00
Risha Mars	eda36e3258	Always show TCP open connections in the CLI (#2533 ) Allow the TCP CONNECTIONS column to be shown on all stat queries in the CLI. This column will now be called TCP_CONN for brevity. Read/Write bytes will still only be shown on -o wide or -o json	2019-03-27 13:34:28 -07:00
Oliver Gould	fda2035d5c	Use "With .Values" scoping in all templates (#2570 ) Some of our templates have started to use 'with .Values' scoping to limit boilerplate within the tempates. This change makes this uniform in all templates.	2019-03-26 19:09:21 -07:00
Oliver Gould	24222da13b	install: Create auto-inject configuration (#2562 ) When reading a Linkerd configuration, we cannot determine whether auto-inject should be configured. This change adds auto-inject configuration to the global config structure. Currently, this configuration is effectively boolean, determined by the presence of an empty value (versus a null).	2019-03-26 15:28:54 -07:00
Alejandro Pedraza	7efe385feb	Have the Webhook react to pod creation/update only (#2472 ) Have the Webhook react to pod creation/update only This was already working almost out-of-the-box, just had to: - Change the webhook config so it watches pods instead of deployments - Grant some extra ClusterRole permissions - Add the piece that figures what's the OwnerReference and add the label for it - Manually inject service account mount paths - Readd volumes tests Fixes #2342 and #1751 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-03-26 11:53:56 -05:00
Oliver Gould	9c442f6885	Store install UUID in global config (#2561 ) Currently, the install UUID is regenerated each time `install` is run. When implementing cluster upgrades, it seems most appropriate to reuse the prior UUID, rather than generate a new one. To this end, this change stores an "Installation UUID" in the global linkerd config.	2019-03-26 08:45:40 -07:00
Oliver Gould	da0330743f	Provide peer Identities via the Destination API (#2537 ) This change reintroduces identity hinting to the destination service. The Get endpoint includes identities for pods that are injected with an identity-mode of "default" and have the same linkerd control plane. A `serviceaccount` label is now also added to destination response metadata so that it's accessible in prometheus and tap.	2019-03-22 09:19:14 -07:00
Oliver Gould	34ea302a32	inject: Configure proxies to enable Identity (#2536 ) This change adds a new `linkerd2-proxy-identity` binary to the `proxy` container image as well as a `linkerd2-proxy-run` entrypoint script. The inject process now sets environment variables on pods to support identity, including identity names for the destination and identity services. As the proxy starts, the identity helper creates a key and CSR in a tmpfs. As the proxy starts, it reads these files, as well as a serviceaccount token, and provisions a certificate from controller. The proxy's /ready endpoint will not succeed until a certificate has been provisioned. The proxy will not participate in identity with services other than the controllers until the Destination controller is modified to provide identities via discovery.	2019-03-21 18:39:05 -07:00
Oliver Gould	21796be354	install: Create linkerd-config before pods (#2538 ) Because the linkerd-config resource is created after pods that require it, they can be started before the files are mounted, causing the pods to restart integration tests to fail. If we extract the config into its own template file, it can be inserted before pods are created.	2019-03-21 14:01:07 -07:00
Oliver Gould	f02730a90d	Check the cluster's config for install & inject (#2535 ) The introduction of identity in `0626fa37` created new state in the control plane's configuration that must be considered when re-installing the control plane or when injecting pods. This change alters `install` to fail if it would seem to conflict with an existing installation. This behavior may be disabled with the `--ignore-cluster` flag. Furthermore, `inject` now _requires_ that it can fetch a configuration from the control plane in order to operate. Otherwise the `--ignore-cluster` and `--disable-identity` flags must be specified. This change does not actually instrument pods to use identity yet---it lays the framework for proxy identity without changing the test fixture output (besides a change to how identity HA is configured). Fixes #2531	2019-03-21 12:49:46 -07:00
Oliver Gould	0626fa374a	install: Introduce the Identity controller (#2526 ) https://github.com/linkerd/linkerd2/pull/2521 introduces an "Identity" controller, but there is no way to include it in linkerd installation. This change alters the `install` flow as follows: - An Identity service is _always_ installed; - Issuer credentials may be specified via the CLI; - If no Issuer credentials are provided, they are generated each time `install` is called. - Proxies are NOT configured to use the identity service. - It's possible to override the credential generation logic---especially for tests---via install options that can be configured via the CLI.	2019-03-19 17:04:11 -07:00
Oliver Gould	91c5f07650	proxy: Upgrade to identity-capable proxy (#2524 ) The new proxy has changed its configuration as follows: - `LISTENER` urls are now `LISTEN_ADDR` addresses; - `CONTROL_URL` is now `DESTINATION_SVC_ADDR`; - `_NAMESPACE` vars are no longer needed; - The `PROXY_ID` is now the `DESTINATION_CONTEXT`; - The "metrics" port is now the "admin" port, since it serves more than just metrics; - A readiness probe now checks a dedicated /ready endpoint eagerly. Identity injection is NOT* configured by this branch.	2019-03-19 14:20:39 -07:00
Oliver Gould	81f645da66	Remove `--tls=optional` and `linkerd-ca` (#2515 ) The proxy's TLS implementation has changed to use a new _Identity_ controller. In preparation for this, the `--tls=optional` CLI flag has been removed from install and inject; and the `ca` controller has been deleted. Metrics and UI treatments for TLS have not been removed, as they will continue to be valuable for the new Identity system. With the removal of the old identity scheme, the Destination service's proxy ID field is now set with an opaque string (e.g. `ns:emojivoto`) to enable locality awareness.	2019-03-18 17:40:31 -07:00

1 2 3 4 5 ...

324 Commits