linkerd2

Commit Graph

Author	SHA1	Message	Date
Joakim Roubert	55326a61ac	bin/web: Fix shellcheck issues (#4425 ) Signed-off-by: Joakim Roubert <joakimr@axis.com>	2020-05-18 10:46:28 -07:00
Joakim Roubert	9c639dc3b7	bin/test-scale: Fix shellcheck issues (#4424 ) Remove superfluous echo commands in assignments. Add quotes. Simplify the for loops that shellcheck didn't like. Signed-off-by: Joakim Roubert <joakimr@axis.com>	2020-05-18 10:41:49 -07:00
Joakim Roubert	5eba710f54	bin/mkube: Update according to shellcheck suggestions (#4419 ) Also clean up sed Windows path filtering. Signed-off-by: Joakim Roubert <joakimr@axis.com>	2020-05-18 10:03:42 -07:00
Joakim Roubert	1e8bfed83f	bin/fmt: Use sort -u instead of sort \| uniq (#4418 ) No need to pipe output to another program when the functionality exists in sort. Signed-off-by: Joakim Roubert <joakimr@axis.com>	2020-05-18 09:52:53 -07:00
Kevin Leimkuhler	659756e93f	Bump golangci-lint version (#4356 ) Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-05-15 16:22:17 -07:00
Joakim Roubert	0b58a56637	Use -n instead of ! -z in shell scripts (#4404 ) Signed-off-by: Joakim Roubert <joakim.roubert@axis.com>	2020-05-15 14:03:06 -05:00
Alejandro Pedraza	d0d97e9426	Upgrade to Helm v3 (#4373 ) Upgraded to Helm v3.2.1 from v2.16.1, getting rid of Tiller and making other simplifications. Note that the version placeholder in the `values.yaml` files had to be changed from `{version}` to `linkerdVersionValue` because the former confuses Helm v3.	2020-05-14 12:11:47 -05:00
Alejandro Pedraza	fdd7809f13	Increase timeout for Helm cleanup in integration tests (#4317 ) * Increase timeout for Helm cleanup in integration tests Tests were failing sporadically, waiting for the Helm namespace to get cleaned up. I verified that it is getting cleaned up, but taking more time sometimes.	2020-05-01 09:48:37 -05:00
Zahari Dichev	5149152ef3	Multicluster gateway and remote setup command (#4265 ) Add multicluster gateway and setup command Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-04-29 20:33:23 +03:00
drholmie	7a560a723d	Linkerd CLI Chocolatey Package (#4205 ) * Add Linkerd CLI Chocolatey Package This PR partially fixes #3063 by building a chocolatey package for Linkerd2's Windows CLI It adds the build scripts for the Linkerd chocolatey package and based on discussions in https://github.com/linkerd/linkerd2/pull/3921 Signed-off-by: Animesh Narayan Dangwal <animesh.leo@gmail.com>	2020-04-29 09:41:54 -07:00
Alejandro Pedraza	66ec92aa09	Additional Jest reporter for GH Annotations (#4294 ) Second part of #4176 Added extra Jest reporter when running js tests from CI, which will send to stdout a GH annotation for each test failure, something like: ``` ::error file=/home/alpeb/src/forks/linkerd2/web/app/js/components/Navigation.test.jsx::Navigation › checks state when versions do not match ``` See the [health metrics RFC](https://github.com/linkerd/rfc/blob/master/design/0002-ci-health-metrics.md) for more context.	2020-04-28 13:10:27 -05:00
Alejandro Pedraza	437f53cdcf	Fix bin/root-tag when applied to annotated tags (#4299 ) Fixes #4298 Since we started using using annotated tags for releases (because they need to be signed), `bin/root-tag` will append `^0` to them when used after checking out a release tag. E.g.: ``` $ bin/root-tag edge-20.4.4^0 ``` which breaks version checking by the CLI. This PR removes that trailing `^0` whenever it's present	2020-04-27 11:08:51 -05:00
Kevin Leimkuhler	00b8ea22a0	Update kind version (#4280 ) #4195 relaxed the clock skew check to match the Kubernetes 1.17 default heartbeat interval. This is the same issue that was preventing an update to the `kind` version used. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-04-22 11:38:44 -07:00
Alejandro Pedraza	322ba5fd2f	`linkerd uninstall` errors when attempting to delete PSP (#4234 ) * Bug in `linkerd uninstall` when attempting to delete PSP We were using a wrong apiVersion for PSP in `linkerd uninstall`'s output, which avoids removing that resource: ``` $ linkerd uninstall \| kubectl delete -f - clusterrole.rbac.authorization.k8s.io "linkerd-linkerd-controller" deleted clusterrole.rbac.authorization.k8s.io "linkerd-linkerd-destination" deleted ... mutatingwebhookconfiguration.admissionregistration.k8s.io "linkerd-proxy-injector-webhook-config" deleted validatingwebhookconfiguration.admissionregistration.k8s.io "linkerd-sp-validator-webhook-config" deleted namespace "linkerd" deleted error: unable to recognize "uninstall.yml": no matches for kind "PodSecurityPolicy" in version "extensions/v1beta1" $ kubectl get psp -oname podsecuritypolicy.policy/linkerd-linkerd-control-plane ``` I've also replaced the uninstall integration test with a new separate suite that performs the installation, waits for it to be ready, uninstalls, and then confirms `linkerd check --pre` returns as expected.	2020-04-07 11:01:11 -05:00
Alex Leong	d8eebee4f7	Upgrade to client-go 0.17.4 and smi-sdk-go 0.3.0 (#4221 ) Here we upgrade our dependencies on client-go to 0.17.4 and smi-sdk-go to 0.3.0. Since smi-sdk-go uses client-go 0.17.4, these upgrades must be performed simultaneously. This also requires simultaneously upgrading our dependency on linkerd/stern to a SHA which also uses client-go 0.17.4. This keeps all of our transitive dependencies synchronized on one version of client-go. This ALSO requires updating our codegen scripts to use the 0.17.4 version of code-generator and running it to generate 0.17.4 compatible generated code. I took this opportunity to update our code generation script to properly use the version of code-generater from `go.mod` rather than a hardcoded SHA. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-04-01 10:07:23 -07:00
Alejandro Pedraza	0c8171d466	Fix bin/kind-load for pull requests (#4222 ) * Fix bin/kind-load for pull requests Followup to #4212 External PRs were failing because: 1) The image tarballs weren't being loaded from the `images-archives` directory 2) Concurrent calls to `bin/kind` were attempting to download the KinD binary simultaneously, resulting in a "text file busy" error. To avoid that, now we just call `bin/kind` synchronously one time beforehand.	2020-04-01 12:04:24 -05:00
Alejandro Pedraza	22f1606b73	Extract common logic in scripts and CI to load images into KinD (#4212 ) Fixes #4206 Followup to #4167 Extract common logic to load images into KinD, from `bin/kind-load`, `bin/install-pr`, `.github/workflows/kind_integration.yml` and `.github/workflows/release.yml`. Besides removing the duplication, `bin/kind-load` will benefit in performance by having each image be loaded in parallel. ``` Load into KinD the images for Linkerd's proxy, controller, web, grafana, debug and cni-plugin. Usage: bin/kind-load [--images] [--images-host ssh://linkerd-docker] Examples: # Load images from the local docker instance bin/kind-load # Load images from tar files located in the current directory bin/kind-load --images # Retrieve images from a remote docker instance and then load them into KinD bin/kind-load --images --images-host ssh://linkerd-docker Available Commands: --images: use 'kind load image-archive' to load the images from local .tar files in the current directory. --images-host: the argument to this option is used as the remote docker instance from which images are first retrieved (using 'docker save') to be then loaded into KinD. This command requires --images. ```	2020-03-30 16:28:28 -05:00
Kevin Leimkuhler	29db6c12a1	Fix script argument regex (#4188 ) Currently the release tag regex matches against arguments that have `edge` or `stable` as a substring. It should only match against arguments that are either `edge` or `stable`. For example, the graceful error handling is not triggered for the following: ``` ❯ bin/create-release-tag edge-20.3.3 bin/create-release-tag: line 92: release_tag: unbound variable ``` This PR fixes the regex so that the above results in graceful error handling. ``` ❯ bin/create-release-tag edge-20.3.3 Error: valid release channels: edge, stable Usage: bin/create-release-tag edge bin/create-release-tag stable 2.4.8 ```	2020-03-19 15:13:17 -07:00
Alejandro Pedraza	1cbc26a2c1	Upgrade golangci-lint to v1.23.8 (#4181 ) * Upgrade golangci-lint to v1.23.8 This should help with some timeouts we're seeing in CI. I fixed some new warnings found in `inject.go` and `uninject.go`. Also we now have to explicitly disable linting `/controller/gen`. The linter was also complaining that in `/pkg/k8s/fake.go` the `spClient.Interface` and `tsclient.Interface` returned in the function `newFakeClientSetsFromManifests()` aren't used, but I opted to ignore that to leave them available for future tests.	2020-03-18 09:13:19 -05:00
Kevin Leimkuhler	6369cffacc	Add KinD option to `install-pr` script (#4167 ) ## Motivation After #4147 added the `install-pr` script, installing PRs into existing clusters does not work if that cluster is a KinD cluster Changing the script to be able to use KinD, and specifically automate `kind load` would be helpful! ## Solution The script can now be used in the following ways. ``` ❯ bin/install-pr --help Install Linkerd with the changes made in a GitHub Pull Request. Usage: --context: The name of the kubeconfig context to use # Install Linkerd into the current cluster bin/install-pr 1234 # Install Linkerd into the current KinD cluster bin/install-pr [-k\|--kind] 1234 # Install Linkerd into the 'kind-pr-1234' KinD cluster bin/install-pr [-k\|--kind] --context kind-pr-1234 1234 ``` The script assumes that the cluster (KinD or not) has already been created. If the cluster is a KinD cluster, the `-k\|--kind` flag should be passed. If the `--context` flag is not passsed, the install defaults to the current context (`kubectl config current-context`). I also added a [`-h\|--help]` option that describes how to use the script.	2020-03-17 10:54:33 -07:00
Alex Leong	df59448046	Use curl (#4162 ) We use curl for fetching remote files in our `bin` scripts. Replace the use of `wget` with `curl` in `bin/shellcheck` for consistency. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-03-10 12:39:12 -07:00
Alex Leong	586911e340	Add bin/install-pr script (#4147 ) # Install PR This script takes a Github pull request number as an argument, downloads the docker images from the pull request's artifacts, pushes them, and installs them on your Kubernetes cluster. Requires a Github personal access token in the $GITHUB_TOKEN environment variable. Signed-off-by: Alex Leong <alex@buoyant.io>	2020-03-10 10:58:03 -07:00
Kevin Leimkuhler	d69445db55	Improve release tag script (#4144 ) ## Motivation Closes #4140 Automatically create new edge release tag: ``` ❯ bin/create-release-tag edge edge-20.3.2 tag created and signed. tag: edge-20.3.2 To push tag, run: git push origin edge-20.3.2 ``` Validate new stable release tag: ``` ❯ bin/create-release-tag stable 2.7.1 stable-2.7.1 tag created and signed. tag: stable-2.7.1 To push tag, run: git push origin stable-2.7.1 ``` ## Solution The release tag script now takes a release channel argument. If the release channel argument is `stable`, a second argument is required for the version. If the release channel is `edge`, the script gets the current edge version and creates a new edge version with the current year: `YY`, month: `MM`, and increments the current month minor if it is not a new month. If the release channel is `stable`, the script will only validate the version. Example error cases: ``` ❯ bin/create-release-tag Error: create-release-tag accepts 1 or 2 arguments Usage: create-release-tag edge create-release-tag stable x.x.x ``` ``` ❯ bin/create-release-tag foo Error: valid release channels: edge, stable Usage: bin/create-release-tag edge bin/create-release-tag stable 2.4.8 ``` ``` ❯ bin/create-release-tag edge 2.7.1 Error: accepts 1 argument Usage: bin/create-release-tag edge ``` ``` ❯ bin/create-release-tag stable Error: accepts 2 arguments Usage: bin/create-release-tag stable 2.4.8 ``` ``` ❯ bin/create-release-tag stable 2.7 Error: version reference incorrect Usage: bin/create-release-tag stable 2.4.8 ``` ``` ❯ bin/create-release-tag stable 2.7.1.1 Error: version reference incorrect Usage: bin/create-release-tag stable 2.4.8 ```	2020-03-10 10:03:46 -07:00
cpretzer	54deffaadb	Fix shellcheck warning (#4137 ) This is a followup to #4129, fixing this warning: ``` In ./bin/create-release-tag line 32: tmp=$(. "$bindir"/_release.sh; extract_release_notes) ^-------------------^ SC2119: Use extract_release_notes "$@" if function's $1 should mean script's $1. ``` In order to use functions in bash that use optional arguments that don't generate this warning, we have to disable the SC2120 check, as explained here: https://github.com/koalaman/shellcheck/wiki/SC2120#exceptions	2020-03-05 09:49:18 -08:00
Alejandro Pedraza	578a2d1960	CI: Adjustments to the release job (#4129 ) Extracted the logic to pull the latest release notes, out of `bin/create-release-tag` into `bin/_release.sh` so that it can be reused in the `release.yml` workflow, which needs to use that inside `gh_release` when creating the github release in order to have prettier markup release notes instead of a plaintext message pulled out of the tag message. The new extracted function also receives an optional argument with the name of the file to put the release notes into, because the `body_path` parameter in `softprops/action-gh-release` doesn't work with dynamic vars. Finally, now the `website_publish` job will only launch until the `gh_release` has succeeded.	2020-03-05 09:03:30 -05:00
Andrew Seigner	a37316a336	Introduce `bin/shellcheck`, add to ci (#4118 ) PR #4117 was root-caused with the help of `shellcheck`. This change introduces a `bin/shellcheck` script, and adds it to CI. In CI, many checks are disabled to allow it to pass. This will at least prevent introduction of new classes of shell issue, and should motivate re-enabling more checks over time. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2020-03-02 13:18:08 -08:00
Andrew Seigner	b52dc35587	Fix `bin/fetch-proxy` on Linux (#4117 ) `bin/fetch-proxy` was failing on Linux: ```bash $ bin/fetch-proxy linkerd2-proxy-v2.87.0/ linkerd2-proxy-v2.87.0/LICENSE linkerd2-proxy-v2.87.0/bin/ linkerd2-proxy-v2.87.0/bin/linkerd2-proxy bin/fetch-proxy: 31: [: Linux: unexpected operator /home/siggy/code/linkerd2/target/proxy/linkerd2-proxy-v2.87.0 ``` Also in CI: https://github.com/linkerd/linkerd2/runs/473746447?check_suite_focus=true#step:5:32 Unfortunately `bin/fetch-proxy` still returned a zero exit status, because `set -e` does not apply to commands that are part of `if` statements. From https://ss64.com/bash/set.html: ``` -e Exit immediately if a simple command exits with a non-zero status, unless the command that fails is part of an until or while loop, part of an if statement, part of a && or \|\| list, or if the command's return status is being inverted using !. -o errexit ``` Fortunately when the `if` command failed, it fell through to the `else` clause for Linux, and copied `linkerd-proxy` successfully. Root cause was a `==` instead of `=`. `shellcheck` confirms, and also recommends quoting: ```bash $ shellcheck bin/fetch-proxy In bin/fetch-proxy line 31: if [ $(uname) == "Darwin" ]; then ^-- SC2046: Quote this to prevent word splitting. ^-- SC2039: In POSIX sh, == in place of = is undefined. ``` Apply `shellcheck` recommendations. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2020-03-02 12:33:20 -08:00
Zahari Dichev	edd7fd203d	Service Mirroring Component (#4028 ) This PR introduces a service mirroring component that is responsible for watching remote clusters and mirroring their services locally. Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-03-02 21:16:08 +02:00
Kevin Leimkuhler	44f1078498	Fix `fetch-proxy` script on macos (#4112 ) `sha256sum` is not installed by default. Use `openssl dgst -sha256` instead.	2020-02-27 17:03:02 -08:00
Kevin Leimkuhler	e37cb3b932	Add success message for tag script (#4111 ) This adds a message after running the `create-release-script` that I intended to add as part of the initial PR. Example output: ``` ❯ bin/create-release-tag $TAG tag created and signed. tag: edge-93.1.1 To push tag, run: git push origin edge-93.1.1 ```	2020-02-27 10:03:41 -08:00
Kevin Leimkuhler	4aac6445c4	Add script to create release tag (#4091 ) ## Motivation Creating a release tag is a manual process that is prone to error by the release responsible member. Additionally, the automated release project will require that a message is included that is a copy of the recent `CHANGES.md` changes. These steps can be scripted so that the member will just need to run a release script. ## Solution A `bin/create-release-tag` script will: - Take a `$TAG` argument (maybe can remove this in the future) to use as the tag name - Pull out the top section of `CHANGES.md` to use as the commit message - Create the a tag with `$TAG` name and release changes as the message ## Example ``` $ TAG="edge-20.2.3" $ bin/create-release-tag $TAG $ git push $TAG ``` Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2020-02-22 16:30:17 -08:00
Alejandro Pedraza	ea523a46b0	Fixed shellcheck warnings on bin/helm-build (#4080 ) Followup to #4058 ``` $ shellcheck -x bin/helm-build; echo $? 0 ```	2020-02-21 09:51:21 -05:00
Alejandro Pedraza	9b64f0dc94	Reuse bin/helm-build in Helm integration tests (#4088 ) Have the preliminary setup for the Helm integration tests use `bin/helm-build` instead of directly calling `helm dependency update`. This allows testing `bin/helm-build` itself, and also lints the linkerd2 and linkerd2-cni charts (the latter lint call is being added as well in this PR).	2020-02-21 09:26:10 -05:00
Alejandro Pedraza	77af716ab2	bin/helm-build automatically updates version in values.yaml (#4058 ) * bin/helm-build automatically updates version in values.yaml Have the Helm charts building script (`bin/helm-build`) update the linkerd version in the `values.yaml` files according to the tagged version, thus removing the need of doing this manually on every release. This is akin to the update we do in `version.go` at CLI build time. Note that `shellcheck` is issuing some warnings about this script, but that's on code that was already there, so that will be handled in an followup PR.	2020-02-18 11:19:58 -05:00
Zahari Dichev	9b29a915d3	Improve cni resources labels (#4032 ) Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-02-11 12:10:08 +02:00
Alejandro Pedraza	1e8223e143	Allow CI to run concurrent builds in master (#4001 ) * Allow CI to run concurrent builds in master Fixes #3911 Refactors the `cloud_integration` test to run in separate GKE clusters that are created and torn down on the fly. It leverages a new "gcloud" github action that is also used to set up gcloud in other build steps (`docker_deploy` and `chart_deploy`). The action also generates unique names for those clusters, based on the git commit SHA and `run_id`, a recently introduced variable that is unique per CI run and available to all the jobs. This fixes part of #3635 in that CI runs on the same SHA don't interfere with one another (in the `cloud_integration` test; still to do for `kind_integration`). The "gcloud" GH action is hosted under its own repo in https://github.com/linkerd/linkerd2-action-gcloud	2020-02-07 16:23:36 -05:00
Zahari Dichev	c609564dc8	Add helm upgrade integration test (#3976 ) In light of the breaking changes we are introducing to the Helm chart and the convoluted upgrade process (see linkerd/website#647) an integration test can be quite helpful. This simply installs latest stable through helm install and then upgrades to the current head of the branch. Signed-off-by: Zahari Dichev zaharidichev@gmail.com	2020-02-04 08:27:46 +02:00
Zahari Dichev	0dac920362	Init helm before cni dependency update (#3969 ) Moves helm init before cni dependency update and fixes the following problem: https://github.com/linkerd/linkerd2/runs/406581136#step:4:16 Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-01-24 09:34:33 -08:00
Zahari Dichev	a9d38189fb	Fix CNI config parsing (#3953 ) This PR addreses the problem introduced after #3766. Fixes #3941 Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2020-01-23 09:55:04 -08:00
Tarun Pothulapati	eac06b973c	Move common values to global (#3839 ) * move values to global in template Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * update inject and cli Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * update unit tests Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * fix linting issues Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * remote controllerImageVersion from global Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * move identity out of global Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * update var name and comments Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * update bin and helm tests Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * update helm readme Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * fix proxy config Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * fix proxy config indentation Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * more linting issues Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com> * remove unnecessary lines Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2020-01-06 14:31:41 -08:00
Alejandro Pedraza	bb790b22b4	Upgrade `kind` to v0.6.1 (#3864 ) * Upgrade `kind` to v0.6.1 Fixes #3852 Upgraded `/bin/kind` to pull v0.6.1. Also have `workflow.yml` use `KUBECONFIG` explicitly for setting the location of the config file, now that `kind get kubeconfig-path` has been deprecated (check https://github.com/kubernetes-sigs/kind/releases/tag/v0.6.0 for detailed info). Note that in the build server the kind binary for this version is `kind-0.6.1`, leaving the `kind` binary still pointing to v0.5.1 while this gets merged and all the PR branches get this.	2019-12-30 14:32:37 -05:00
Alejandro Pedraza	8c18b0b972	Upgraded `Helm` cli to v2.16.1 (#3865 ) Needed for k8s 1.16	2019-12-23 16:39:26 -05:00
Alejandro Pedraza	1ed70c8aff	Build linkerd2-cni Helm chart in `bin/helm-build` (#3846 ) Fixes #3801 This will package and build the `linkerd2-cni` chart from the `charts/linkerd2-cni` directory and update our Helm Hub's `index.yaml` file to index it. This will only be run in the `chart_deploy` job of our Github Actions when an edge/stable tag is pushed. Once that happens, users will be able to install the chart with a command like: ``` helm install linkerd-edge/linkerd2-cni ``` Docs update will follow.	2019-12-20 10:25:11 -05:00
Eugene Glotov	748da80409	Inject preStop hook into the proxy sidecar container to stop it last (#3798 ) * Inject preStop hook into the proxy sidecar container to stop it last This commit adds support for a Graceful Shutdown technique that is used by some Kubernetes administrators while the more perspective configuration is being discussed in https://github.com/kubernetes/kubernetes/issues/65502 The problem is that RollingUpdate strategy does not guarantee that all traffic will be sent to a new pod _before_ the previous pod is removed. Kubernetes inside is an event-driven system and when a pod is being terminating, several processes can receive the event simultaneously. And if an Ingress Controller gets the event too late or processes it slower than Kubernetes removes the pod from its Service, users requests will continue flowing into the black whole. According [to the documentation](https://kubernetes.io/docs/concepts/workloads/pods/pod/#termination-of-pods) > 1. If one of the Pod’s containers has defined a `preStop` hook, > it is invoked inside of the container. If the `preStop` hook is still > running after the grace period expires, step 2 is then invoked with > a small (2 second) extended grace period. > > 2. The container is sent the `TERM` signal. Note that not all > containers in the Pod will receive the `TERM` signal at the same time > and may each require a preStop hook if the order in which > they shut down matters. This commit adds support for the `preStop` hook that can be configured in three forms: 1. As command line argument `--wait-before-exit-seconds` for `linkerd inject` command. 2. As `linkerd2` Helm chart value `Proxy.WaitBeforeExitSeconds`. 2. As `config.alpha.linkerd.io/wait-before-exit-seconds` annotation. If configured, it will add the following preHook to the proxy container definition: ```yaml lifecycle: preStop: exec: command: - /bin/bash - -c - sleep {{.Values.Proxy.WaitBeforeExitSeconds}} ``` To achieve max benefit from the option, the main container should have its own `preStop` hook with the `sleep` command inside which has a smaller period than is set for the proxy sidecar. And none of them must be bigger than `terminationGracePeriodSeconds` configured for the entire pod. An example of a rendered Kubernetes resource where `.Values.Proxy.WaitBeforeExitSeconds` is equal to `40`: ```yaml # application container lifecycle: preStop: exec: command: - /bin/bash - -c - sleep 20 # linkerd-proxy container lifecycle: preStop: exec: command: - /bin/bash - -c - sleep 40 terminationGracePeriodSeconds: 160 # for entire pod ``` Fixes #3747 Signed-off-by: Eugene Glotov <kivagant@gmail.com>	2019-12-18 16:58:14 -05:00
Tarun Pothulapati	efb1101bdb	Switch to smaller-case values in linkerd2-cni (#3827 ) * update linkerd2-cni templates and cli * update readme and docs * update helm unit tests * update helm build script * use smaller case linkerd version Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-12-16 15:09:57 -08:00
Alejandro Pedraza	2a4c71760d	Enable cert rotation test to work with dynamic namespaces, take two (#3795 ) * Enable cert rotation test to work with dynamic namespaces This PR adds support for dynamic cert generation when running the cert rotation intergration tests. This allows to avoid baking in the namespace in the certificate CN, thereby allowing us to run these tests on the clouds. The tests in #3775 were failing because the second secret holding the issuer cert replacement was a leaf cert and not a root/intermediary cert capable of signing the CSRs. This is how the replacement cert looked like: ```bash $ k -n l5d-integration-external-issuer get secrets linkerd-identity-issuer-new -ojson \| jq '.data\|.["tls.crt"]' \| tr -d '"' \| base64 -d \| step certificate inspect - Certificate: Data: Version: 3 (0x2) Serial Number: 2 (0x2) Signature Algorithm: ECDSA-SHA256 Issuer: CN=identity.l5d-integration-external-issuer.cluster.local Validity Not Before: Dec 6 19:16:08 2019 UTC Not After : Dec 5 19:16:28 2020 UTC Subject: CN=identity.l5d-integration-external-issuer.cluster.local Subject Public Key Info: Public Key Algorithm: ECDSA Public-Key: (256 bit) X: 93:d5:fa:f8:d1:44:4f:9a:8c:aa:0c:9e:4f:98:a3: 8d:28:d9:cc:f2:74:4c:5f:76:14:52:47:b9:fb:c9: a3:33 Y: d2:04:74:95:2e:b4:78:28:94:8a:90:b2:fb:66:1b: e7:60:e5:02:48:d2:02:0e:4d:9e:4f:6f:e9:0a:d9: 22:78 Curve: P-256 X509v3 extensions: X509v3 Key Usage: critical Digital Signature, Key Encipherment X509v3 Extended Key Usage: TLS Web Server Authentication, TLS Web Client Authentication X509v3 Subject Alternative Name: DNS:identity.l5d-integration-external-issuer.cluster.local Signature Algorithm: ECDSA-SHA256 30:46:02:21:00:f6:93:2f:10:ba:eb:be:bf:77:1a:2d:68:e6: 04:17:a4:b4:2a:05:80:f7:c5:f7:37:82:7b:b7:9c:a1:66:6a: e1:02:21:00:b3:65:06:37:49:06:1e:13:98:7c:cf:f9:71:ce: 5a:55:de:f6:1b:83:85:b0:a8:88:b7:cf:21:d1:16:f2:10:f9 ``` For it to be a root/intermediate cert it should have had `CA:TRUE` under the `X509v3 extensions` section. Why did the test pass sometimes? When it did pass for me, I could see in the linkerd-identity proxy logs something like: ``` ERR! [ 320.964592s] linkerd2_proxy_identity::certify Received invalid ceritficate: invalid certificate: UnknownIssuer ``` so the cert retrieved from identity still was invalid but for some reason the proxy, sometimes, keeps on going despite that. And when one would delete the linkerd-identity pod, its proxy wouldn't come up at all, also showing that error. With the changes from this branch, we no longer see that error in the logs and after deleting the linkerd-identity pod it comes back gracefully.	2019-12-11 15:50:06 -05:00
Zahari Dichev	6faf64e49f	Revert "Enable cert rotation test to work with dynamic namespaces (#3775 )" (#3787 ) This reverts commit `0e45b9c03d`. Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2019-12-05 15:33:22 -05:00
Zahari Dichev	0e45b9c03d	Enable cert rotation test to work with dynamic namespaces (#3775 ) This PR adds support for dynamic cert generation when running the cert rotation intergration tests. This allows to avoid baking in the namespace in the certificate CN, thereby allowing us to run these tests on the clouds. * Enable cert rotation test to work with dynamic namespaces Signed-off-by: Zahari Dichev <zaharidichev@gmail.com> * Address comments Signed-off-by: Zahari Dichev <zaharidichev@gmail.com> * Address further comments Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>	2019-12-05 10:08:01 +02:00
Joakim Roubert	e1b3fdb029	Fix whitespace path handling in non-docker (build) scripts (#3650 ) * Fix whitespace path handling in non-docker (build) scripts Handling of whitespace paths was not fully implemented; this patch adds the missing pieces. Also, only use bash where bash-specific functionality is used/needed. Signed-off-by: Joakim Roubert <joakimr@axis.com>	2019-11-26 09:48:41 -05:00
Alex Leong	0026103362	Unit and integration test fixups (#3730 ) - Added cleanup step at the end of all integration tests. - Disable external_issuer_integration_tests in cloud_tests due to namespace issue. Running this via `kind` tests is sufficient for now. - Set a flakey test to `Skip`, relates to #3332. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-11-15 03:40:42 -08:00
Zahari Dichev	2d224302de	Add integration test for external issuer and cert rotation flows (#3709 ) Signed-off-by: zaharidichev <zaharidichev@gmail.com>	2019-11-14 06:58:32 +02:00
Alejandro Pedraza	3324966702	Upgrade go to 1.13.4 (#3702 ) Fixes #3566 As explained in #3566, as of go 1.13 there's a strict check that ensures a dependency's timestamp matches it's sha (as declared in go.mod). Our smi-sdk dependency has a problem with that that got resolved later on, but more work would be required to upgrade that dependency. In the meantime a quick pair of replace statements at the bottom of go.mod fix the issue.	2019-11-13 12:54:36 -05:00
Zahari Dichev	7dd5dfc2ba	Check health of meshed apps before and after linkerd upgrade (#3641 ) * Check stats of deployed app before and after linkerd upgrade to ensure nothing broke Signed-off-by: zaharidichev <zaharidichev@gmail.com> * Address naming remarks Signed-off-by: zaharidichev <zaharidichev@gmail.com> * Improve application health checking Signed-off-by: zaharidichev <zaharidichev@gmail.com>	2019-11-07 20:48:12 +02:00
Zahari Dichev	1bb9d66757	Integration test for custom cluster domain (#3660 ) Signed-off-by: zaharidichev <zaharidichev@gmail.com>	2019-11-04 14:49:52 -08:00
Joakim Roubert	80d644eb1d	docker-build-proxy: make apt work behind proxy (#3643 ) This patch sends the proxy settings to docker build if present. Without this, the docker build will fail on apt-get update on a system that is behind a proxy. Change-Id: I3fcbad4d9a9c30e5f0a00f03c6d8629ed8cc97b0 Signed-off-by: Joakim Roubert <joakimr@axis.com>	2019-11-04 13:17:44 -08:00
Joakim Roubert	478145ce45	Fix whitespace path handling in docker (build) scripts (#3634 ) Handling of whitespace paths was not fully implemented; this patch adds the missing pieces. Also, only use bash where bash-specific functionality is used/needed. Signed-off-by: Joakim Roubert <joakimr@axis.com>	2019-10-30 15:55:38 -07:00
Joakim Roubert	b5309fad04	build-cli-bin: Use case for host_platform selection (#3626 ) Increase readability and extensibility. Change-Id: I0670950e14b59da0971397d08016176650602247 Signed-off-by: Joakim Roubert <joakimr@axis.com>	2019-10-28 16:49:56 -05:00
Joakim Roubert	3411e22bdc	fetch-proxy: Make POSIX compatible (#3625 ) * fetch-proxy: Make POSIX compatible * fetch-proxy: Update old comment to match current behavior Getting the directory where the script resides can easily be done without bash-specific functionality, and hence the script can be POSIX compatible. Change-Id: I30bd69dccbc950bdce3dc5da4bea279305a7b1f9 Signed-off-by: Joakim Roubert <joakimr@axis.com>	2019-10-28 16:45:23 -05:00
Joakim Roubert	0341af86e8	build-cli-bin: POSIX compatible & handle whitespace paths (#3623 ) Getting the directory where the script resides can easily be done without bash-specific functionality, and hence the script can be POSIX compatible. Also adding the missing pieces for handling paths with whitespaces. Change-Id: Ie2e867929be0322e476342438d9cf4a3d36f58f1 Signed-off-by: Joakim Roubert <joakimr@axis.com>	2019-10-28 16:36:53 -05:00
Oliver Gould	87e03ae940	Update proxy update commit messages with tag info (#3594 ) Each proxy release tag now includes a message. This change updates the git-commit-proxy-version script to include this message in the commit message in this repo.	2019-10-18 10:20:38 -07:00
Alejandro Pedraza	e76c5c3d9d	Keep old releases in Helm repo index (#3589 ) * Keep old releases in Helm repo index When building the Helm repo index file, keep the references to the old releases. Also rename and keep the old index file in case something goes wrong when generating the new one. Fixes #3561	2019-10-16 17:21:53 -05:00
Alex Leong	3dcff52b9f	Switch from using golangci fmt to using goimports (#3555 ) CI currently enforcing formatting rules by using the fmt linter of golang-ci-lint which is invoked from the bin/lint script. However it doesn't seem possible to use golang-ci-lint as a formatter, only as a linter which checks formatting. This means any formatter used by your IDE or invoked manually may or may not use the same formatting rules as golang-ci-lint depending on which formatter you use and which specific revision of that formatter you use. In this change we stop using golang-ci-lint for format checking. We introduce `tools.go` and add goimports to the `go.mod` and `go.sum` files. This allows everyone to easily get the same revision of goimports by running `go install -mod=readonly golang.org/x/tools/cmd/goimports` from inside of the project. We add a step in the CI workflow that uses goimports via the `bin/fmt` script to check formatting. Some shell gymnastics were required in the `bin/fmt` script to work around some limitations of `goimports`: * goimports does not have a built-in mechanism for excluding directories, and we need to exclude the vendor director as well as the generated Go sources * goimports returns a 0 exit code, even when formatting errors are detected Signed-off-by: Alex Leong <alex@buoyant.io>	2019-10-16 13:56:11 -07:00
Saurav Tiwary	1e44513f30	Clean username before using as docker image tag (#3572 ) * Clean username before using as docker image tag * Allow Alphanumerics instead of just alphabets in docker image tag Incorporate Alex's suggestions Fixes #3570 Signed-off-by: Saurav Tiwary <srv.twry@gmail.com>	2019-10-15 16:36:48 -07:00
Alejandro Pedraza	3de35ccc58	Remove Discovery service leftovers (#3500 ) Followup to #2990, which refactored `linkerd endpoints` to use the `Destination.Get` API instead of the `Discovery.Endpoints` API, leaving the Discovery with no implented methods. This PR removes all the Discovery code leftovers. Fixes #3499	2019-10-15 11:20:21 -05:00
cpretzer	8f83a56431	Revert upgrade to buster based on CNI test failure after merge (#3486 )	2019-09-26 13:40:43 -07:00
cpretzer	5455a344d8	Update base docker image to debian latest stable: buster (#3438 ) * Update base docker image to debian latest stable: buster Signed-off-by: Charles Pretzer <charles@buoyant.io> * Update all files to use buster image	2019-09-26 09:02:12 -07:00
Kevin Leimkuhler	151104ec5a	Add script to load images into kind cluster (#3458 ) ## Summary [kind](https://github.com/kubernetes-sigs/kind) has been a helpful tool for running local Kubernetes clusters and testing linkerd builds. Once images are built with `bin/docker-build`, the images must be loaded into the kind cluster. This script should be run after `bin/docker-build` and will load the images into the specified kind cluster. Example: ``` $ bin/docker-build $ kind get clusters # show available clusters to load images on to kleimkuhler $ bin/kind-load kleimkuhler $ ./target/cli/linux/linkerd install \| kubectl apply -f - ``` Signed-off-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2019-09-23 14:43:31 -07:00
Oliver Gould	d51f7f77a7	proxy: Update to v2.71.0 (#3433 ) Update the proxy release process to fetch artifacts from tagged GitHub releases. * Use GitHub Actions for Pull Requests (linkerd/linkerd2-proxy#343) * ci: Run tests inside rust container (linkerd/linkerd2-proxy#344) * update tracing crates (linkerd/linkerd2-proxy#346) * core: Introduce the Recover trait (linkerd/linkerd2-proxy#347) * ci: Automate releases via GitHub Actions (linkerd/linkerd2-proxy#349) * Add opencensus exporter (linkerd/linkerd2-proxy#338) * Add trace context crate (linkerd/linkerd2-proxy#339) * ci: Use a readymade release action (linkerd/linkerd2-proxy#351) * Add 587 to the list of ports to disable protocol detection (linkerd/linkerd2-proxy#350) * Record SHA of package artifact (linkerd/linkerd2-proxy#353)	2019-09-17 15:18:24 -07:00
Alejandro Pedraza	8270ba363c	Add chart_deploy into workflow.yml (#3415 ) * Have CI push the Helm artifacts into GCS - Added missing OWNERS and README files - Added maintainers section to Chart.yaml - Changed NOTES.txt so it points to the installation of the CLI - Set the proxy-init version to v1.1.0 in values.yaml Ref #3256 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-09-11 12:09:50 -05:00
Alejandro Pedraza	bd702b99ae	Last changes before submitting to the Helm incubator (#3292 ) * Last changes before submitting to the Helm incubator - Added missing OWNERS and README files - Added maintainers section to Chart.yaml - Changed NOTES.txt so it points to the installation of the CLI - Set the proxy-init version to v1.1.0 in values.yaml - Added missing ProfileValidator vars, and add 'do not edit' comment to the Identity.Issuer.CrtExpiryAnnotation value - Added new self-hosted repo - Added option to bin/helm-build - Added DisableHeartBeat to README Ref #3256 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-09-10 14:24:39 -05:00
Andrew Seigner	89deacd8d6	Decrease proxy and web Docker image sizes (#3384 ) The `proxy` and `web` Docker images were 161MB and 186MB, respectively. Most of the space was tools installed into the `linkerd.io/base` image. Decrease `proxy` and `web` Docker images to 73MB and 90MB, respectively. Switch these images to be based off of `debian:stretch-20190812-slim`. Also set `-ldflags "-s -w"` for `proxy-identity` and `web`. Modify `linkerd.io/base` to also be based off of `debian:stretch-20190812-slim`, update tag to `2019-09-04.01`. Fixes #3383 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-09-05 11:28:33 -07:00
Alejandro Pedraza	368d16f23c	Fix auto-injecting pods and integration tests reporting (#3335 ) * Fix auto-injecting pods and integration tests reporting When creating an Event when auto-injection occurs (#3316) we try to fetch the parent object to associate the event to it. If the parent doesn't exist (like in the case of stand-alone pods) the event isn't created. I had missed dealing with one part where that parent was expected. This also adds a new integration test that I verified fails before this fix. Finally, I removed from `_test-run.sh` some `\|\| exit_code=$?` that was preventing the whole suite to report failure whenever one of the tests in `/tests` failed. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-28 15:04:20 -05:00
陈谭军	981f5bc85d	fix-up spelling mistake (#3328 ) Signed-off-by: chentanjun <2799194073@qq.com>	2019-08-27 10:24:53 -07:00
Andrew Seigner	ea27e0ca0e	Introduce integration tests into all ci runs (#3293 ) The integration tests under `/test` were run separately via l5d-bot, lacking the feedback and job management provided by ci. Enable integration tests in ci, via a docker build and kind clusters executed on a remote DOCKER_HOST. CI runs are now broken into two stages, run serially. Each stage is composed of jobs run in parallel: - Setup stage - Validate go deps - Remote docker build - Kind cluster setup (deep) - Kind cluster setup (upgrade) - Kind cluster setup (helm) - Test stage - Go unit tests - Node.js unit tests - Kind integration tests (deep) - Kind integration tests (upgrade) - Kind integration tests (helm) This PR also modifies `bin/test-run.sh` to always set `--failfast` for Go tests. Also introduce `bin/docker` and `bin/kubectl` scripts, to ensure cacheable, pinned executables in ci. The existing integration tests for master merges and docker pushes, running against GKE, remain in place. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-26 11:41:17 -07:00
Andrew Seigner	653ec8c5b7	Refactor bin/test-run for running tests separately (#3304 ) The `bin/test-run` script executed upgrade, helm, and deep integration test in series, but was structured in a way that did not permit running these tests individually. Move most of the logic from `bin/test-run` to a supporting library, `bin/test-run.sh`, which will provide the ability to execute integration tests individually. `bin/test-run`'s behavior is unchanged, it continues to run upgrade, helm, and deep integration tests in series. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-22 11:05:06 -07:00
Andrew Seigner	4e058bfea2	Introduce bin/kind, move executables to target/bin (#3289 ) `bin/helm` and `bin/protoc` were downloading their binaries into `./target`, while `bin/lint` was downloading to the root of the repo. Also travis was caching `./target`, which could become problematic if that part of the test script relied on `target/cli/linux/linkerd`. Standardize helm, kind, lint, and protoc to all download into `./target/bin`, and modify travis to strictly cache that subdirectory. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-21 19:49:21 -07:00
Alejandro Pedraza	879650cef9	Wait for `helm delete` to finish in integration test (#3259 ) * Wait for `helm delete` to finish in integration test Followup to #3251 In `helm_cleanup` block till the linkerd namespace has been deleted Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-14 19:15:34 -05:00
Ivan Sim	e52afc1197	Update the Helm build script (#3248 ) * Update Helm build script to pin the Helm CLI version * Update Linkerd version in the Helm values file Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-08-14 16:04:56 -07:00
Andrew Seigner	6c0ee2475b	Cleanup helm before full test-cleanup (#3251 ) PR #3247 introduced additional helm cleanup in `bin/test-cleanup`. During the integration tests, `bin/test-cleanup` is called prior to `helm_cleanup` in `bin/test-run`. This causes `helm_cleanup` to fail, as resources have already been deleted by `bin/test-cleanup`, and the integration tests fail with `FAIL: error cleaning up Helm`. Modify the integration tests to first call `helm_cleanup` prior to calling `bin/test-cleanup`. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-14 10:47:17 -07:00
Andrew Seigner	9826cbdfe0	Label and cleanup helm after integration tests (#3247 ) When helm integration tests fail, `bin/test-run` exits prior to calling `helm_cleanup`, leaving behind a helm namespace and clusterrolebinding. Update `bin/test-cleanup` to delete any remaining helm resources. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-14 09:30:48 -07:00
Ivan Sim	4d01e3720e	Update install and upgrade code to use the new helm charts (#3229 ) * Delete symlink to old Helm chart * Update 'install' code to use common Helm template structs * Remove obsolete TLS assets functions. These are now handle by Helm functions inside the templates * Read defaults from values.yaml and values-ha.yaml * Ensure that webhooks TLS assets are retained during upgrade * Fix a few bugs in the Helm templates (see bullet points): * Merge the way the 'install' ha and non-ha options are handled into one function * Honor the 'NoInitContainer' option in the components templates * Control plane mTLS will not be disabled if identity context in the config map is empty. The data plane mTLS will still be automatically disabled if the context is nil. * Resolve test failures from rebase with master * Fix linter issues * Set service account mount path read-only field * Add TLS variables of the webhooks and tap to values.yaml During upgrade, these secrets are preserved to ensure they remain synced wih the CA bundle in the webhook configurations. These Helm variables are used to override the defaults in the templates. * Remove obsolete 'chart' folder * Fix bugs in templates * Handle missing webhooks and tap TLS assets during upgrade When upgrading from an older version that don't have these secrets, fallback to let Helm create them by creating an empty charts.TLS struct. * Revert the selector labels of webhooks to be compatible with that in 2.4 In 2.4, the proxy injector and profile validator webhooks already have their selector labels defined. Since these attributes are immutable, the recent change to these selectors introduced by the Helm chart work will cause upgrade to fail. * Alejandro's feedback * Siggy's feedback * Removed redundant unexported custom types Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-08-13 14:16:24 -07:00
Alejandro Pedraza	d64a2f3689	Add integration test for `helm install` (#3223 ) Ref #3143 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-13 09:14:32 -05:00
Alejandro Pedraza	0410b772f9	Bash function to setup Helm (#3218 ) * helm binary wrapper Ref #3143 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-12 10:08:12 -05:00
Alejandro Pedraza	3ae653ae92	Refactor proxy injection to use Helm charts (#3200 ) * Refactor proxy injection to use Helm charts Fixes #3128 A new chart `/charts/patch` was created, that generates the JSON patch payload that is to be returned to the k8s API when doing the injection through the proxy injector, and it's also leveraged by the `linkerd inject --manual` CLI. The VFS was used by `linkerd install` to access the old chart under `/chart`. Now the proxy injection also uses the Helm charts to generate the JSON patch (see above) so we've moved the VFS from `cli/static` to a new common place under `/pkg/charts/static`, and the new root for the VFS is now `/charts`. `linkerd install` hasn't yet migrated to use the new charts (that'll happen in #3127), so the only change in that regard was the creation of `/charts/chart` which is a symlink pointing to `/chart` that `install.go` now uses, so that the VFS contains both the old and new charts, as a temporary measure. You can see that `/bin/Dockerfile-bin`, `/controller/Dockerfile` and `/bin/build-cli-bin` do now `go generate` pointing to the new location (and the `go generate` annotation was moved from `/cli/main.go` to `pkg/charts/static/templates.go`). The symlink trick doesn't work when building the binaries through Docker, so `/bin/Dockerfile-bin` replaces the symlink with an actual copy of `/chart`. Also note that in `/controller/Dockerfile` we now need to include the `prod` tag in `go install` like we do in `/bin/Dockerfile-bin` so that the proxy injector does use the VFS instead of the local file system. - The common logic to parse a chart has been moved from `install.go` to `/pkg/charts/util.go`. - The special ENV var in the proxy for "outbound router capacity" that only applies to the Prometheus pod is now handled directly in the proxy partial and all the associated go code could be removed. - The `patch.go` lib for generating the JSON patch in go along with its tests `patch_test.go` are no longer needed. - Lots of functions in `/pkg/inject/inject.go` got removed/simplified with their logic being moved into the charts themselves. As a consequence lots of things in `inject_test.go` became irrelevant. - Moved `template-values.go` from `/pkg/inject` to `pkg/charts` as that contains the go structs representation of the chart variables that will be leveraged in #3127. Don't forget to run `/bin/helm.sh` whenever you make changes to charts ;-) Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-07 17:32:37 -05:00
Alejandro Pedraza	5ae41fe856	Fixd bash checks in `bin/helm.sh` and `bin/test-cleanup` (#3205 ) `bin/helm.sh`: you should see the following only if you have Tiller installed in your cluster (which is installed with `helm init`): ``` Performing dry run installation NAME: linkerd Performing dry run installation (HA mode) NAME: linkerd ``` `bin/test-cleanup`: when linkerd is not installed: Before: ```bash $ bin/test-cleanup cleaning up control-plane namespaces in k8s-context [] cleaning up data-plane namespaces in k8s-context [] cleaning up rolebindings in kube-system namespace in k8s-context [] ``` After this PR's changes: ``` $bin/test-cleanup cleaning up control-plane namespaces in k8s-context [] no control-plane namespaces found cleaning up data-plane namespaces in k8s-context [] no data-plane namespaces found no clusterrolebindings found no clusterroles found no mutatingwebhookconfigurations found no validatingwebhookconfigurations found no podsecuritypolicies found no customresourcedefinitions found no apiservices found cleaning up rolebindings in kube-system namespace in k8s-context [] ``` Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-08-07 12:41:33 -05:00
Ivan Sim	2bbf26748f	Add Control Plane Helm Templates And Proxy Partials (#3146 ) * Updated controller template with proxy partials * Declare dependency in requirements.yaml * Add partial template for proxy's metadata * Add proxy-init partial template * Script to lint Helm charts and update their dependencies * Update partials chart Chart.yaml * Add proxy-init and resource partial templates * Replace hard coded namespace variable in proxy env var * Ignore chart dependencies .tgz files * Add missing fields and re-order YAML elements to match CLI output * Reuse control plane's resource partial template in 'partials' chart * Set the proxy's destination service address env var * Add Grafana's template * Update api version of controller RBAC * Add Heartbeat template * Remove duplicated resources partial template * Add remainder control plane components templates * Add template for the 'linkerd-config' config map * Add debug container template * Update proxy partial with 'disable-identity' and 'disable-tap' variables Note that these are inject-only variables. Also added the LINKERD2_PROXY_TAP_SVC_NAME env var. * Add validation conditions to ensure identity and tap aren't disabled for control plane components * Add partials for service account token mount path and security context capabilities * Change proxy and proxy-init templates to use global scope Some of the nested variables are removed from values.yaml to ensure changes made to root-level variables are propagated directly into the partial templates. The previous approach of using YAML anchors in the values.yaml to share common values can get out-of-sync when values are changed via the Helm's `--set` option. * Update templates and values file to match #3161 * Perform a dry run installation if there is a local Tiller * Reorder JSON elements in linkerd-config * Re-adjust nested partials indentation to work with inject 'patch' chart Previously, the partials will render their content as an element in the list. While it works for installation, the toJson function in the 'inject' patch code ends up converting it into a JSON list, instead of the expected JSON object. * Trap the last fail command in the Helm shell script * Add the identity trust anchor * Address Thomas' feedback on handling HA All the HA-related variables are moved to values-ha.yaml * Convert ignore ports string to JSON list in linkerd-config Also fixed some indentation issues. * Add values-ha.yaml * Include the service account token mount path only if identity is enabled * Fixed malformed JSON in linkerd-config config map * Rename chart to 'linkerd2' * Add NOTES.txt * Fix incorrect variable path in proxy template * Remove fake TLS assets * Add 'required' constraint to identity trust anchors variable * Update tap templates per #3167 * Bump default version to edge-19.8.1 due to dependency on RSA support Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-08-06 09:18:19 -07:00
Andrew Seigner	a59c1dd32d	Introduce tap APIService, update `linkerd tap` (#3167 ) The Tap Service enabled tapping of any meshed pod, regardless of user privilege. This change introduces a new Tap APIService. Kubernetes provides authentication and authorization of Tap requests, and then forwards requests to a new Tap APIServer, which implements a Kubernetes aggregated APIServer. The Tap APIServer authenticates the client TLS from Kubernetes, and authorizes the user via a SubjectAccessReview. This change also modifies the `linkerd tap` command to make requests against the new APIService. The Tap APIService implements these Kubernetes-style endpoints: POST /apis/tap.linkerd.io/v1alpha1/watch/namespaces/:ns/tap POST /apis/tap.linkerd.io/v1alpha1/watch/namespaces/:ns/:res/:name/tap GET /apis GET /apis/tap.linkerd.io GET /apis/tap.linkerd.io/v1alpha1 GET /healthz GET /healthz/log GET /healthz/ping GET /metrics GET /openapi/v2 GET /version Users authorize to the new `tap.linkerd.io/v1alpha1` via RBAC. Only the `watch` verb is supported. Access is also available via subresources such as `deployments/tap` and `pods/tap`. This change introduces the following resources into the default Linkerd install: - Global - APIService/v1alpha1.tap.linkerd.io - ClusterRoleBinding/linkerd-linkerd-tap-auth-delegator - `linkerd` namespace: - Secret/linkerd-tap-tls - `kube-system` namespace: - RoleBinding/linkerd-linkerd-tap-auth-reader Tasks not covered by this PR: - `linkerd top` - `linkerd dashboard` - `linkerd profile --tap` - removal of the unauthenticated tap controller Fixes #2725, #3162, #3172 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-08-01 14:02:45 -07:00
Andrew Seigner	f0f3f8e5c5	Bump golangci-lint to 1.17.1 (#3150 ) Also add `bodyclose` linter Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-29 10:49:03 -07:00
Andrew Seigner	18b74aa8a8	Introduce Go modules support (#2481 ) The repo relied on `dep` for managing Go dependencies. Go 1.11 shipped with Go modules support. Go 1.13 will be released in August 2019 with module support enabled by default, deprecating GOPATH. This change replaces `dep` with Go modules for dependency management. All scripts, including Docker builds and ci, should work without any dev environment changes. To execute `go` commands directly during development, do one of the following: 1. clone this repo outside of `GOPATH`; or 2. run `export GO111MODULE=on` Summary of changes: - Docker build scripts and ci set `-mod=readonly`, to ensure dependencies defined in `go.mod` are exactly what is used for the builds. - Dependency updates to `go.mod` are accomplished by running `go build` and `go test` directly. - `bin/go-run`, `bin/build-cli-bin`, and `bin/test-run` set `GO111MODULE=on`, permitting usage inside and outside of GOPATH. - `gcr.io/linkerd-io/go-deps` tags hashed from `go.mod`. - `bin/update-codegen.sh` still requires running from GOPATH, instructions added to BUILD.md. Fixes #1488 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-25 14:41:38 -07:00
Alex Leong	d6ef9ea460	Update ServiceProfile CRD to version v1alpha2 and remove validation (#3078 ) The openAPIV3Schema validation in the ServiceProfiles CRD is very limited in what it can validate and is obviated by more sophisticated validation done by the validating admission controller. Therefore, we would like to remove the openAPIV3Schema validation to reduce the size and complexity of the CRD object. To do so, we must also bump the version of the ServiceProfile custom resource from v1alpha1 to v1alpha2. This ensures that when the controller is upgraded, it will attempt to watch the v1alpha2 resource. If it cannot (because, for example, the controller pod started before the ServiceProfile CRD was updated and therefore the v1alpha2 version does not exist) then it will go into a crash loop backoff until it can. This essentially means that the controller will wait for the CRD to be upgraded to include v1alpha2 before it will start. Bumping the version is necessary because if we did not, it would be possible for the controller to start before the CRD is updated (removing the validation). In this case, when the CRD is edited, the controller will lose its list watch on ServiceProfiles and will stop getting updates. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-07-23 11:46:31 -07:00
Ivan Sim	f535c2d3d2	Integration Test Script Pre-/Post-Test Cleanup (#3108 ) * Updates for the integration test script 1. Remove existing resources prior to starting the test 2. Remove existing resources post upgrade test 3. Fail fast if 'install_test.go` fails 4. Don't perform cleanup if any of the tests fail for debugging opportunity * Remove pre-test cleanup from .travis.yaml This is now done in the bin/test-run script so that it can be shared between l5d-bot and staging. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-07-19 11:13:03 -07:00
Alejandro Pedraza	68f2f694e3	Improve object cleanup when integration tests fail (#3080 ) Integration tests may fail and leave behind namespaces that following builds aren't able to clean up because the git sha is being included in the namespace name, and the following builds don't know about those shas. This modifies the `test-cleanup` script to delete based on object labels instead of relying on the objects names, now that after 2.4 all the control plane components are labeled. Note that this will also remove non-testing linkerd namespaces, but we were already kinda doing that partially because we were removing the cluster-level resources (CRDs, webhook configs, clusterroles, clusterrolebindings, psp). `test-cleanup` no longer receives a namespace name as an argument. The data plane namespaces aren't labeled though, so I've added the `linkerd.io/is-test-data-plane` label for them in `CreateNamespaceIfNotExists()`, and making sure all tests that need a data plaine explicitly call that method instead of creating the namespace as a side-effect in `KubectlApply()`. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-07-12 15:01:10 -05:00
Oliver Gould	c1aaaf8114	git-commit-proxy-version: Omit SHAs from commit (#3076 ) The git-commt-proxy-version script attempted to link to the specifc SHAs. GitHub doesn't actually render these links, and the information is redundant since we link the appropriate PR.	2019-07-11 14:48:30 -07:00
Oliver Gould	7699ef256d	git-commit-proxy-version: fixup invalid git invocation (#3075 )	2019-07-11 14:21:03 -07:00
Oliver Gould	38597083eb	Add bin/git-commit-proxy-version (#3071 ) Each time we update the proxy from the linkerd2-proxy repo, we make the change slightly differently. The bin/git-commit-proxy-version does all the steps needed to update the proxy version up to and including making a commit to this repo. The proxy version is now stored in a .proxy-version file and is consumed directly by Dockerfile-proxy, which both simplifies the Dockerfile and the update process. This script formats commit messages and emits output as follows: ``` commit c05198a851f69bdc7007974a0ef1f4c01c98d0ce (HEAD -> ver/proxy-update) Author: Oliver Gould <ver@buoyant.io> Date: Thu Jul 11 17:23:05 2019 +0000 proxy: Update to linkerd/linkerd2-proxy#3a3ec3b * linkerd/linkerd2-proxy#0cc58cd fallback: Clarify fallback layering (linkerd/linkerd2-proxy#288) * linkerd/linkerd2-proxy#b71349a Replace `log` and `env-logger` with `tracing` and `tracing-fmt` (linkerd/linkerd2-proxy#277) * linkerd/linkerd2-proxy#3a3ec3b Use a constant-time load balancer (linkerd/linkerd2-proxy#266) diff --git a/.proxy-version b/.proxy-version index f81f40de..d7faa12d 100644 --- a/.proxy-version +++ b/.proxy-version @@ -1 +1 @@ -05b012d +3a3ec3b ```	2019-07-11 14:04:46 -07:00
Andrew Seigner	50e82de47b	Fix upgrade tests conflicting with integration (#3069 ) Integration tests on master broke following the 2.4 release, caused by the recent disabling of multi control-plane support, coupled with the upgrade integration test (which now upgrades from 2.4 to current sha). The integration tests do the following: 1. install the current sha 2. test the current sha 3. install the latest stable in an `upgrade` namespace 4. in the `upgrade` namespace, upgrade from stable to latest sha 5. test the upgraded installation Step 3 breaks because `linkerd install` with stable-2.4 will fail if existing global resources (from step 1) are present. For now, modify the integration tests to do the following: 1. install the latest stable in an `upgrade` namespace 2. in the `upgrade` namespace, upgrade from stable to latest sha 3. test the upgraded installation 4. upon successful step 3, remove all related resources 5. install the current sha 6. test the current sha Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-11 17:44:18 +02:00
Alex Leong	9a61c2adc2	Bump proxy dep (#3042 ) Pick up the following proxy changes: * Update httparse to v1.3.4 * canonicalize: stop resolving when the receiver is dropped * router: Remove interval from router eviction Signed-off-by: Alex Leong <alex@buoyant.io>	2019-07-05 17:17:16 -07:00
Eliza Weisman	c849eed4a9	proxy: update to linkerd/linkerd2-proxy#0a7e206 (#3024 ) * 0a7e206 Update h2 to v0.1.25 (linkerd2/linkerd2-proxy#282) * 0e3ef79 Propagate HTTP2 errors from client RST_STREAMs (linkerd2/linkerd2-proxy#281) Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2019-07-02 16:21:32 -07:00
Alex Leong	f90a3c09ed	Bump proxy version to pick up traffic split (#3012 ) Signed-off-by: Alex Leong <alex@buoyant.io>	2019-06-28 15:32:14 -07:00
Ivan Sim	866fe6fa5e	Introduce global resources checks to install and multi-stage install (#2987 ) * Introduce new checks to determine existence of global resources and the 'linkerd-config' config map. * Update pre-check to check for existence of global resources This ensures that multiple control planes can't be installed into different namespaces. * Update integration test clean-up script to delete psp and crd Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-06-27 09:59:12 -07:00
Kevin Leimkuhler	64e666fc11	Bump proxy for edge-19.6.3 (#2986 ) * Fixed the proxy rejecting HTTP2 requests that don't have an `:authority` * Improved idle service eviction to reduce resource consumption for clients that send requests to many services Signed-off-by: Kevin Leimkuhler <kleimkuhler@icloud.com>	2019-06-21 14:50:34 -07:00
Dennis Adjei-Baah	84fbd7fc08	delete webhook configs using script (#2966 )	2019-06-20 09:45:11 -07:00
Dennis Adjei-Baah	bd7d567fe1	travis integration test cleanup (#2945 ) * Update travis to clean up cluster level resources	2019-06-18 09:53:21 -07:00
Oliver Gould	374a4dbcb1	proxy: update to linkerd/linkerd2-proxy#35df8ab (#2939 ) 439fbfed Update to rust-1.35.0 (linkerd/linkerd2-proxy#265) db26495e Honor `l5d-override-dst` for inbound service profiles (linkerd/linkerd2-proxy#267) a476e995 metrics: Include the prefix of a Report in log lines (linkerd/linkerd2-proxy#262) 1a52a5e6 discovery: Fall back in MakeService, only on InvalidArgument (linkerd/linkerd2-proxy#268) 35df8ab4 metrics: Classify response errors (linkerd/linkerd2-proxy#269)	2019-06-13 14:15:19 -07:00
Oliver Gould	39b8942095	proxy: Update to linkerd/linkerd2-proxy#790a86a (#2898 ) commit 790a86aa9db463af479647bb91b8b55280d74d4 Author: Sean McArthur <sean@buoyant.io> Date: Tue Jun 4 20:28:05 2019 -0700 Update h2 to v0.1.23 (#264) - Fixes leaked DATA frames if never polled. Signed-off-by: Sean McArthur <sean@buoyant.io>	2019-06-05 08:08:04 -07:00
Alejandro Pedraza	74ca92ea25	Split proxy-init into separate repo (#2824 ) Split proxy-init into separate repo Fixes #2563 The new repo is https://github.com/linkerd/linkerd2-proxy-init, and I tagged the latest there `v1.0.0`. Here, I've removed the `/proxy-init` dir and pinned the injected proxy-init version to `v1.0.0` in the injector code and tests. `/cni-plugin` depends on proxy-init, so I updated the import paths there, and could verify CNI is still working (there is some flakiness but unrelated to this PR). For consistency, I added a `--init-image-version` flag to `linkerd inject` along with its corresponding override config annotation. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-06-03 16:24:05 -05:00
Oliver Gould	20715da2c9	proxy: Update to linkerd2/linkerd2-proxy#ed32e496 (#2868 ) linkerd2/linkerd2-proxy#b3dcc6e0 Use the proxy's log formatting in tests (linkerd2/linkerd2-proxy#258) linkerd2/linkerd2-proxy#1c91a398 Rewrite the destination client and remove DNS fallback (linkerd2/linkerd2-proxy#259) linkerd2/linkerd2-proxy#ed32e496 Update h2 to v0.1.21 (linkerd2/linkerd2-proxy#261)	2019-05-30 13:01:00 -07:00
Andrew Seigner	bd4c2788fa	Modify upgrade integration test to upgrade stable (#2835 ) In #2679 we introduced an upgrade integration test. At the time we only supported upgrading from a recent edge. Since that PR, a stable build was released supporting upgrade. Modify the upgrade integration test to upgrade from the latest stable rather than latest edge. This fulfills the original intent of #2669. Also add some known k8s event warnings. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-05-20 13:36:07 -07:00
Oliver Gould	f4da6c228c	Update the proxy to linkerd/linkerd2-proxy#3e0e00c (#2828 ) commit b27dfb2d21aa8ca5466ea0edce17d27094ace7c1 Author: Takanori Ishibashi <takanori.1112@gmail.com> Date: Wed May 15 05:58:42 2019 +0900 updaes->updates (#250) Signed-off-by: Takanori Ishibashi <takanori.1112@gmail.com> commit 16441c25a9d423a6ab12b689b830d9ae3798fa00 Author: Eliza Weisman <eliza@buoyant.io> Date: Tue May 14 14:40:03 2019 -0700 Pass router::Config directly to router::Layer (#253) Currently, router `layer`s are constructed with a single argument, a type implementing `Recognize`. Then, the entire router stack is built with a `router::Config`. However, in #248, it became necessary to provide the config up front when constructing the `router::layer`, as the layer is used in a fallback layer. Rather than providing a separate type for a preconfigured layer, @olix0r suggested we simply change all router layers to accept the `Config` when they're constructed (see https://github.com/linkerd/linkerd2-proxy/pull/248#discussion_r283575008). This branch changes `router::Layer` to accept the config up front. The `router::Stack` types `make` function now requires no arguments, and the implementation of `Service` for `Stack` can be called with any `T` (as the target is now ignored). Signed-off-by: Eliza Weisman <eliza@buoyant.io> commit b70c68d4504a362eac6a7828039a2e5c7fcd308a Author: Eliza Weisman <eliza@buoyant.io> Date: Wed May 15 13:14:04 2019 -0700 Load balancers fall back to ORIG_DST when no endpoints exist (#248) Currently, when no endpoints exist in the load balancer for a destination, we fail the request. This is because we expect endpoints to be discovered by both destination service queries _and_ DNS lookups, so if there are no endpoints for a destination, it is assumed to not exist. In linkerd/linkerd2#2661, we intend to remove the DNS lookup from the proxy and instead fall back to routing requests for which no endpoints exist in the destination service to their SO_ORIGINAL_DST IP address. This means that the current approach of failing requests when the load balancer has no endpoints will no longer work. This branch introduces a generic `fallback` layer, which composes a primary and secondary service builder into a new layer. The primary service can fail requests with an error type that propages the original request, allowing the fallback middleware to call the fallback service with the same request. Other errors returned by the primary service are still propagated upstream. In contrast to the approach used in #240, this fallback middleware is generic and not tied directly to a load balancer or a router, and can be used for other purposes in the future. It relies on the router cache eviction added in #247 to drain the router when it is not being used, rather than proactively destroying the router when endpoints are available for the lb, and re-creating it when they exist again. A new trait, `HasEndpointStatus`, is added in order to allow the discovery lookup to communicate the "no endpoints" state to the balancer. In addition, we add a new `Update::NoEndpoints` variant to `proxy::resolve::Update`, so that when the control plane sends a no endpoints update, we switch from the balancer to the no endpoints state _immediately_, rather than waiting for all the endpoints to be individually removed. When the balancer has no endpoints, it fails all requests with a fallback error, so that the fallback middleware A subsequent PR (#248) will remove the DNS lookups from the discovery module. Closes #240. Signed-off-by: Eliza Weisman <eliza@buoyant.io> commit 6525b0638ad18e74510f3156269e0613f237e2f5 Author: Zahari Dichev <zaharidichev@gmail.com> Date: Wed May 15 23:35:09 2019 +0300 Allow disabling tap by setting an env var (#252) This PR fixes linkerd/linkerd2#2811. Now if `LINKERD2_PROXY_TAP_DISABLED` is set, the tap is not served at all. The approach taken is that the `ProxyParts` is changed so the `control_listener` is now an `Option` that will be None if tap is disabled as this control_listener seems to be exclusively used to serve the tap. Feel free to suggest a better approach. Signed-off-by: Zahari Dichev <zaharidichev@gmail.com> commit 91f32db2ea6d74470fd689c713ff87dc7586222d Author: Zahari Dichev <zaharidichev@gmail.com> Date: Thu May 16 00:45:23 2019 +0300 Assert that outbound TLS works before identity is certified (#251) This commit introduces TLS capabilities to the support server as well as tests to ensure that outbound TLS works even when there is no verified certificate for the proxy yet. Fixes linkerd/linkerd2#2599 Signed-off-by: Zahari Dichev <zaharidichev@gmail.com> commit 45aadc6b1b28e6daea0c40e694a86ae518887d85 Author: Sean McArthur <sean@buoyant.io> Date: Wed May 15 14:25:39 2019 -0700 Update h2 to v0.1.19 Includes a couple HPACK fixes Signed-off-by: Sean McArthur <sean@buoyant.io> commit 3e0e00c6dfbf5a9155b887cfd594f611edfc135f Author: Oliver Gould <ver@buoyant.io> Date: Thu May 16 08:11:06 2019 -0700 Update mio to 0.6.17 (#257) To pick up https://github.com/tokio-rs/mio/pull/939	2019-05-16 10:19:17 -07:00
Eliza Weisman	18a6b596ee	proxy: Update to linkerd/linkerd2-proxy#5f89351 (#2814 ) commit 5f89351081eff47a4ab8cd88e2e1a69a04f86541 Author: Oliver Gould <ver@buoyant.io> Date: Thu May 9 16:39:24 2019 -0700 Upgrade tower dependencies (#249) Tower must be updated in order to pickup tower-rs/tower#281 to address linkerd/linkerd2#2804. This adopts released crates where possible. commit 5d5eed6f8180b8db4090d995e71fdf7b0890c647 Author: Zahari Dichev <zaharidichev@gmail.com> Date: Thu May 9 01:08:34 2019 +0300 Assert that TLS connection is refused if identity is not certified yet (#243) This branch adds tls capability to the support cient used in tests. In addition to that it adds two tests verifying that a TLS connection is refused in case the identity is not certified yet. This attempts to fix #https://github.com/linkerd/linkerd2/issues/2598 and provide facility to write tests for https://github.com/linkerd/linkerd2/issues/2676. As these are still some of my first lines of Rust code, it is advised to approach everything with a healthy dose of doubt :) Signed-off-by: Zahari Dichev <zaharidichev@gmail.com> commit 1b9bb3745e44c959d1d41d14fed2b2822c82b5ba Author: Oliver Gould <ver@buoyant.io> Date: Wed May 8 14:28:37 2019 -0700 Introduce dispatch timeouts around buffers (#246) The proxy has several buffers, especially where it routes requests over shared stacks. If any of these routes is unavailable, then a request may remain buffered indefinitely. Previously, before service profiles were introduced, there was a default _response_ timeout that would cause these requests to fail; but since this response timeout is now optional (and is only applied once the request has been routed within a proxy), then we need a new mechanism to prevent requests from getting "stuck". This change does the following: - all proxied requests are annotated with a dispatch deadline; - each time a request is bufered, a timeout is registered. - if the timeout fires, the response exception fails, a 503 is returned, and the request is dropped. - if the request is processed into the inner stack, the timeout is ignored. The dispatch timeout limits the _time a request is buffered in a proxy_. This is distinct from the response timeout, as the server's response may naturally be delayed for any number of (non-proxy-related) reasons. The `insert_target` module has been generalized to `insert` to support setting the DispatchDeadline extension. The `buffer` module has been augmented with generic deadline-extraction logic. The `svc` module now exposes its own builder type that notably adds a `buffer_pending` helper. It's helpful to pull a builder type into the proxy to assist debugging type errors when modifying stacks. Fixes linkerd/linkerd2#2779 linkerd/linkerd2#2795 commit caf899557c3b041190f63544da865396231b3e30 Author: Oliver Gould <ver@buoyant.io> Date: Fri May 3 15:55:32 2019 -0700 router: Fail requests when the route is not ready (#241) In linkerd/linkerd2#2779, we plan to expire requests while they are buffered. However, the router _implicitly_ buffers requests in the executor when the inner service is not ready. This change alters the route to wrap all inner layers in a `LoadShed` so it can expect all services to `poll_ready()` immediately. commit 587bad101d9e5daeacb24b6733097c350a798356 Author: Eliza Weisman <eliza@buoyant.io> Date: Fri May 3 14:18:08 2019 -0700 Remove Destination service query concurrency limit (#244) Currently, the proxy enforces a limit on the number of concurrent queries (i.e., the number of gRPC streams) to the Destination service. This limit was added based on information about the behaviour of the Destination service that is now known to be incorrect. This branch removes the limit on concurrent queries from the proxy's `control::destination` module. Although it should now be possible to simplify this code as a result of this change, I've refrained from doing any major refactoring in this branch --- my intention is to do this after the DNS fallback behaviour has also been removed, as together with this change, that will result in a _significant_ simplification of the module. Additionally, I've removed the tests for the concurrency limit, as they are no longer relevant. The `LINKERD2_PROXY_DESTINATION_CLIENT_CONCURRENCY_LIMIT` environment variable was also removed; this is not a breaking change as neither the CLI nor the proxy injector will currently set this env var. Signed-off-by: Eliza Weisman <eliza@buoyant.io> commit cbdf45b44f7e4d852dc0497716062167ab9539fb Author: Sean McArthur <sean@buoyant.io> Date: Thu May 2 11:47:48 2019 -0700 Remove h2::Error requirement from metrics Signed-off-by: Sean McArthur <sean@buoyant.io> commit 3276949d4608dc4344b7bed3de2fc4b3080c2c6e Author: Sean McArthur <sean@buoyant.io> Date: Thu May 2 09:44:00 2019 -0700 delete unused proxy::http::metrics::class module Signed-off-by: Sean McArthur <sean@buoyant.io> Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2019-05-10 10:57:30 -07:00
Andrew Seigner	5ece3430eb	Fix proxy build to build go-deps and set version (#2797 ) The `docker-build-proxy` script builds `Dockerfile-proxy`. That Dockerfile depends on a go-deps image, and takes a `LINKERD_VERSION` arg. The `docker-build-proxy` script was neither ensuring go-deps had been built, nor setting `LINKERD_VERSION`. The former resulted in the build failing if go-deps did not exist. The latter resulted in `dev-undefined` log messages in the `linkerd-proxy` container. Fix `docker-build-proxy` to ensure go-deps are built, and also set the `LINKERD_VERSION`. This brings this script more in-line with the other `docker-build-*` scripts. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-05-07 13:17:18 +02:00
Oliver Gould	3b729ec458	proxy: Update to linkerd/linkerd2-proxy#5018026 (#2777 ) commit 073a1beb4a7cd709c6b1eaa56a319c1829a94d11 Author: Sean McArthur <sean@buoyant.io> Date: Mon Apr 29 17:54:01 2019 -0700 tap: remove need to clone Services (#238) This refactors the tap system to not require intermediary channels to register matches and taps when a request comes through. The Dispatcher that used to exist in order to prevent tapping more requests than the limit asked for has been removed. In its place is a shared atomic counter to keep the count under the limit. The resulting behavior should be the same. There should be improved performance as tap registration doesn't need go through a second channel, and requests don't need to be delayed waiting for the dispatcher to be able to process its queue. Signed-off-by: Sean McArthur <sean@buoyant.io> commit 7a3be8c8737188e5debbc465f9a33da0d79b8b80 Author: Zahari Dichev <zaharidichev@gmail.com> Date: Wed May 1 01:57:01 2019 +0300 Replace fixed reconnect backoff with exponential one (#237) When reconnecting to a destination, use an exponential, jittered backoff strategy. Signed-off-by: Zahari Dichev <zaharidichev@gmail.com> commit 32b813aad4fe2fcf0252e8c2215d6835101d2337 Author: Oliver Gould <ver@buoyant.io> Date: Tue Apr 30 15:58:20 2019 -0700 Support endpoint weights (#230) This change modifies the proxy to honor weights provided by the destination service. When the destination service replies with a weight, this value is divided by 10,000 to produce a weight on [0.0, ~400000.0]. This weight is used by load the load balancer to modify load interpretation and therefore request distribution. A weight of 0.0 will cause the endpoint's load to be effectively infinite so that requests will only be sent to the endpoint when no other endpoints exists or when the other endpoints that were considered had 0-weights. commit 501802671a346250b6dbaae73f29d9be7a4c2086 Author: Sean McArthur <sean@buoyant.io> Date: Wed May 1 13:42:38 2019 -0700 Remove buffers from endpoint stacks (#239) Due to the `http::settings::router`, a `buffer` was needed in each endpoint stack. This meant that the service was always ready, even if the client were falling over (and reconnecting). In turn, this meant that the balancer would pick one of these endpoint stacks, because it was always ready! This change includes a test of a failing endpoint, that the balancer no longer assumes it is ready, and has the following functional changes: - Removed `http::settings::router`, instead the client HTTP settings are detected as part of the `DstAddr`. This means that each balancer only has endpoints with the same HTTP settings. - Removed `buffer` layer from inside the endpoint stacks. Signed-off-by: Sean McArthur <sean@buoyant.io>	2019-05-01 15:00:47 -07:00
Oliver Gould	9ffe8b5966	docker-build: Build the proxy container first (#2769 ) When developing on the proxy, it's convenient to build the proxy while the linkerd2 image is building at a given tag; but because the proxy is built last, it's difficult to build the proxy at the same tag simultaneously. This is made easier by building the proxy first so that the parallel build can be initiated after this. This shouldn't impact other development workflows.	2019-04-29 16:01:31 -07:00
Oliver Gould	bd4aa58e50	proxy: Upgrade the proxy for tower updates (#2758 ) commit 61db2e77a247f7b0235b67581f60e8a92f8543cb Author: Sean McArthur <sean@seanmonstar.com> Date: Tue Apr 23 17:20:43 2019 -0700 Replace linkerd2-stack with tower-layer (#236) Signed-off-by: Sean McArthur <sean@buoyant.io> commit 2d6c7145cadf709832f3507bcefdaee509ebde81 Author: Sean McArthur <sean@seanmonstar.com> Date: Thu Apr 18 12:40:48 2019 -0700 Add load shedding when over max-in-flight requests. (#225) Also adds configuration for inbound and outbound max-in-flight requests. Signed-off-by: Sean McArthur <sean@buoyant.io> commit f4b5cd0b4a25d7d942e018b42af1157ae2e7dbb0 Author: Oliver Gould <ver@buoyant.io> Date: Wed Apr 17 13:53:49 2019 -0700 Upgrade tower (#232) This avails the proxy of newer load balancer features, an updated buffer implementation, etc. The new buffer implementation requires that we implement TypedExecutor for our logging executor; and more error types have been made dynamic.	2019-04-26 08:58:24 -05:00
Alejandro Pedraza	53bb7c47f6	Make the auto-injector required and removed proxy-auto-inject flag (#2733 ) Make the auto-injector required and removed proxy-auto-inject flag Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-04-24 13:06:51 -05:00
Dennis Adjei-Baah	3e5917f7e0	Add the ability to inject a debug sidecar (#2726 ) Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2019-04-22 16:53:12 -07:00
Ivan Sim	1c0f147718	Integration test for the 'upgrade' command (#2679 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-04-11 19:37:50 -07:00
Oliver Gould	c8a7c0f57f	Update proxy to fix a connection starvation issue (#2689 ) In https://github.com/linkerd/linkerd2-proxy/pull/233, we fixed an issue in the proxy where, when the proxy performed TLS discovery (on inbound connections), detection on a slow or idle connection could block all other connections from being accepted on the listener. Fixes #2581 #2585 #2630	2019-04-11 13:02:06 -07:00
Carol A. Scott	24fa7dd70b	Adding documentation to bin/web --help (#2673 ) Adds documentation for the new dashboard integration tests to bin/web --help.	2019-04-09 10:58:12 -07:00
Carol A. Scott	d4e955f805	Updating webdriverio libraries (#2665 ) Updates the WebdriverIO libraries used in the front-end integration tests so that officially-supported libraries are used where possible.	2019-04-08 13:19:50 -07:00
Kevin Leimkuhler	10f8c786c7	proxy: Bump proxy for edge-19.4.2 (#2654 ) This bump pulls in: * New proxy tests Signed-off-by: Kevin Leimkuhler <kevinl@buoyant.io>	2019-04-05 15:50:19 -07:00
Kevin Leimkuhler	1f2401c7a3	proxy: Bump pinned version to f2d907b (#2609 ) * proxy: Bump pinned version to f2d907b This change picks up: * Added configuration for overriding the connection backoff * Added configuration for overriding the HTTP/2 stream or connection window size * Disable potentially info-leaking header Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2019-04-01 21:31:16 -07:00
Andrew Seigner	38f504beb1	Introduce test-scale script (#2578 ) Introduce a `bin/test-scale` script to deploy Linkerd alongside sample apps at scale. This script deploys the following: - Linkerd control-plane, with service profiles - 5 namespaces x 5 replicas of each: - Emojivoto demo app - Books demo app, with service profiles - Lifecycle / bb test environment Fixes #2517 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-01 12:51:53 -07:00
Andrew Seigner	b454f8fbc1	Introduce auto inject integration tests (#2595 ) The integration tests were not exercising proxy auto inject. Introduce a `--proxy-auto-inject` flag to `install_test.go`, which now exercises install, check, and smoke test deploy for both manual and auto injected use cases. Part of #2569 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-04-01 10:32:56 -07:00
Andrew Seigner	48ddde2146	Introduce script to test multiple cloud providers (#2592 ) Introduce a `bin/test-clouds` and cleanup script, to run integration tests against 4 cloud providers. Also modify the integration tests to accept a `--context` param to specify the Kubernetes context to run the tests against. Fixes #2516 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-03-29 16:22:30 -07:00
Carol A. Scott	0251f50fa4	Adding local and cloud integration testing for dashboard (#2586 ) Adds local and cloud integration testing for the dashboard using WebdriverIO and SauceLabs. Includes documentation on how to set up and run the Sauce Connect proxy locally. Adds a `bin/web integration` script that takes `local` or `cloud` arguments to run the tests. Note: for web development, the web server launched by `bin/web run` and `bin/web dev` is now 7777, not 8084, because the Sauce Connect proxy can only tunnel to certain ports.	2019-03-29 15:48:00 -07:00
Alex Leong	63996e8b8a	Bump proxy version (#2539 ) Picks up the following proxy change: * Add a oneshot to notify the profiles daemon if the stream is dropped Signed-off-by: Alex Leong <alex@buoyant.io>	2019-03-21 15:17:52 -07:00
Thomas Rampelberg	4eb89bb8c2	Stop background processes on failure (#2478 ) * Stop background processes on failure * Exit successfully * Move trap into dev only * Move install linkerd up * Fold dev into run	2019-03-20 10:25:36 -07:00
Oliver Gould	91c5f07650	proxy: Upgrade to identity-capable proxy (#2524 ) The new proxy has changed its configuration as follows: - `LISTENER` urls are now `LISTEN_ADDR` addresses; - `CONTROL_URL` is now `DESTINATION_SVC_ADDR`; - `_NAMESPACE` vars are no longer needed; - The `PROXY_ID` is now the `DESTINATION_CONTEXT`; - The "metrics" port is now the "admin" port, since it serves more than just metrics; - A readiness probe now checks a dedicated /ready endpoint eagerly. Identity injection is NOT* configured by this branch.	2019-03-19 14:20:39 -07:00
Oliver Gould	81f645da66	Remove `--tls=optional` and `linkerd-ca` (#2515 ) The proxy's TLS implementation has changed to use a new _Identity_ controller. In preparation for this, the `--tls=optional` CLI flag has been removed from install and inject; and the `ca` controller has been deleted. Metrics and UI treatments for TLS have not been removed, as they will continue to be valuable for the new Identity system. With the removal of the old identity scheme, the Destination service's proxy ID field is now set with an opaque string (e.g. `ns:emojivoto`) to enable locality awareness.	2019-03-18 17:40:31 -07:00
Kevin Lingerfelt	e862e98d1a	Bump proxy to 4ed4dcc (#2494 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-03-13 16:57:07 -07:00
Andrew Seigner	155c063348	Faster test cleanup (#2492 ) `bin/test-cleanup` takes 48s on ci. This change sets `kubectl --wait=false`, so the command should return immediately rather than waiting for resources to be fully deleted. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-03-13 10:07:26 -07:00
Andrew Seigner	d4fdbe4991	Fix web init to not check for ServiceProfiles (#2470 ) linkerd/linkerd2#2428 modified SelfSubjectAccessReview behavior to no longer paper-over failed ServiceProfile checks, assuming that ServiceProfiles will be required going forward. There was a lingering ServiceProfile check in the web's startup that started failing due to this change, as the web component does not have (and should not need) ServiceProfile access. The check was originally implemented to inform the web component whether to expect "single namespace" mode or ServiceProfile support. Modify the web's initialization to always expect ServiceProfile support. Also remove single namespace integration test Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-03-07 15:20:46 -08:00
Kevin Leimkuhler	4fba211b98	proxy: Bump pinned version to 6d10dd6 (#2448 ) This picks up the following: * [dc00685](https://github.com/linkerd/linkerd2-proxy/commit/dc00685) Increase inbound/outbound router capacity * [6d10dd6](https://github.com/linkerd/linkerd2-proxy/commit/6d10dd6) Set `l5d-remote-ip` on inbound requests and outbound responses Signed-off-by: Kevin Leimkuhler <kevinl@buoyant.io>	2019-03-05 15:09:59 -08:00
Eliza Weisman	9c0537c318	Signed-off-by: Eliza Weisman <eliza@buoyant.io> (#2410 ) proxy: bump pinned version to 7e55196 This picks up the following commit: * 7e55196 Bump tower-grpc (linkerd/linkerd2-proxy#202) The new `tower-grpc` version (tower-rs/tower-grpc#115) improves the messages attached to internal gRPC issues. This will aid significantly in debugging the proxy's gRPC communication with the control plane.	2019-02-27 14:17:17 -08:00
Ivan Sim	c5b905281c	Proxy: bump pinned version to 0fe8063 (#2406 ) This picks up the following commits: * 0fe8063 replace `Error::cause` with `Error::source` (#2370) (linkerd/linkerd2-proxy#201) * 1ea7559 Minor cleanup in the config tests (linkerd/linkerd2-proxy#188) * d0ef56b Update ring to 0.14.6 (linkerd/linkerd2-proxy#197) * c54377f fs-watch: Use a properly sized buffer for inotify events (linkerd/linkerd2-proxy#195) * 23e02a6 Update Router to wait for inner poll_ready before calling inner call * 2de8e9b Update metrics quickcheck to 0.8, and hyper to 0.12.24 * d1bbd4b make: Optionally include debug symbols with builds (linkerd/linkerd2-proxy#193) * 738a541 Fix compilation warnings in fs-watch (linkerd/linkerd2-proxy#192) * 6cc7558 Apply rustfmt (linkerd/linkerd2-proxy#191) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-02-27 12:55:01 -08:00
Andrew Seigner	48e161f012	Revert CRD deletion in integration test-cleanup (#2399 ) linkerd/linkerd#2349 introduced ServiceProfile CRD deletion to `bin/test-cleanup`. Unfortunately that CRD is cluster-wide and shared across any Linkerd's currently installed. Revert CRD deletion. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-26 16:37:17 -08:00
Andrew Seigner	ec5a0ca8d9	Authorization-aware control-plane components (#2349 ) The control-plane components relied on a `--single-namespace` param, passed from `linkerd install` into each individual component, to determine which namespaces they were authorized to access, and whether to support ServiceProfiles. This command-line flag was redundant given the authorization rules encoded in the parent `linkerd install` output, via [Cluster]Role[Binding]s. Modify the control-plane components to query Kubernetes at startup to determine which namespaces they are authorized to access, and whether ServiceProfile support is available. This allows removal of the `--single-namespace` flag on the components. Also update `bin/test-cleanup` to cleanup the ServiceProfile CRD. TODO: - Remove `--single-namespace` flag on `linkerd install`, part of #2164 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-26 11:54:52 -08:00
Andrew Seigner	6ef33e8955	Add note about brew dependency in `build-cli-bin` (#2381 ) Homebrew/homebrew-core#36957 introduces a brew formula for the linkerd cli. It depends on `bin/build-cli-bin` to build a local linkerd cli binary. This change adds a note to `bin/build-cli-bin`, to consider brew when making changes to that script. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-25 16:08:32 -08:00
Andrew Seigner	43d29d629e	Bump base Docker images (#2241 ) - `debian:jessie-slim` -> `stretch-20190204-slim` - `golang:1.10.3` -> `1.11.5` - `gcr.io/linkerd-io/base:2017-10-30.01` -> `2019-02-19.01` - bump `golangci-lint` to 1.15.0 - use `GOCACHE` in travis Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-22 15:59:18 -08:00
Andrew Seigner	31f5181492	Make test-cleanup delete clusterrole[binding]s (#2343 ) The `bin/test-cleanup` script was correctly deleting all namespaces created by `bin/test-run`, but was leaving behind clusterroles and clusterrolebindings, defined cluster-wide. Update `test-cleanup` to delete clusterroles and clusterrolebindings created by `test-run`. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-21 11:01:15 -08:00
Andrew Seigner	6a1ca2cc95	Fix build-cli-bin to use generated templates (#2341 ) The `bin/build-cli-bin` script, intended to build a local `linkerd` cli binary, was compiling the binary configured to read template files out of the local machine's GOPATH. This change modifies `build-cli-bin` to build a `linkerd` binary the same way `docker-build-cli-bin` does. Specifically, by generating static template files for inclusion in the build, and adding the `-tags prod` flag to ensure those files are compiled in. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-20 19:02:52 -08:00
Ivan Sim	9084615710	CLI install/inject config protobuf (#2291 ) Define the global and proxy configs protobuf types that will be used by CLI install, inject and the proxy-injector. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-02-19 12:28:30 -08:00
Andrew Seigner	044e0a5bb4	Fix golangci-lint config to use default golint (#2284 ) golangci-lint disables some checks for golint, including checks for well-formed comments on all exported symbols This change disables the golangci-lint's `exclude-use-default` setting, to run golint with default settings. Also introduce a `.golangci.yml` file to centralize config. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-14 13:55:30 -08:00
Alejandro Pedraza	0c4039a671	Add integration tests for single-namespace mode (#2247 ) Add integration tests for single-namespace mode Fixes #2127 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com>	2019-02-14 09:19:11 -05:00
Andrew Seigner	2305974202	Introduce golangci-lint tooling, fixes (#2239 ) `golangci-lint` performs numerous checks on Go code, including golint, ineffassign, govet, and gofmt. This change modifies `bin/lint` to use `golangci-lint`, and replaces usage of golint and govet. Also perform a one-time gofmt cleanup: - `gofmt -s -w controller/` - `gofmt -s -w pkg/` Part of #217 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-13 11:16:28 -08:00
Oliver Gould	8a8ee649c5	proxy: Log canonicalization warnings on only the first error (#2250 ) commit 59d00f69653730353ec246b8cb2eb39d80a54d3e Author: Oliver Gould <ver@buoyant.io> Date: Mon Feb 11 10:51:37 2019 -0800 Log canonicalization warnings on only the first error (#189) When a canonicalization task fails to resolve a name, our logging is not particularly clear about the current state of the stack. Specifically, it's difficult to know whether the stack has resolved the name successfully before. With this change, canonicalization failures are logged (at warning, not error) only when the task has not previously resolved a name. Subsequent errors are now logged at the debug level (instead of warning).	2019-02-11 12:52:09 -08:00
Andrew Seigner	72812baf99	Introduce Discovery API and endpoints command (#2195 ) The Proxy API service lacked introspection of its internal state. Introduce a new gRPC Discovery API, implemented by two servers: 1) Proxy API Server: returns a snapshot of discovery state 2) Public API Server: pass-through to the Proxy API Server Also wire up a new `linkerd endpoints` command. Fixes #2165 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-07 14:02:21 -08:00
Kevin Leimkuhler	9cca1df3b6	Proxy: bump pinned version to 7add4fc (#2225 ) * Remove destination address from endpoint metric labels (linkerd/linkerd2#187) * Set proxy_id in calls to Get and GetProfile (linkerd/linkerd2#183) * Add l5d-client-id on inbound requests if meshed TLS (linkerd/linkerd2#184) Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2019-02-07 12:17:51 -08:00
Oliver Gould	44e31f0f67	Configure proxy keepalives via the environment (#2193 ) In linkerd/linkerd2-proxy#186, the proxy supports configuration of TCP keepalive values. This change sets `LINKERD2_PROXY_INBOUND_ACCEPT_KEEPALIVE` and `LINKERD2_PROXY_OUTBOUND_CONNECT_KEEPALIVE` to 10s when injecting the proxy, so that remote connections are configured with a keepalive. This configuration is NOT yet exposed through the CLI. This may be done in a followup, if necessary. Fixes #1949	2019-02-04 16:16:43 -08:00

1 2 3 4 5 ...

331 Commits