linkerd2

Commit Graph

Author	SHA1	Message	Date
zak	8c413ca38b	Wire up stats commands for daemonsets (#2006 ) (#2086 ) DaemonSet stats are not currently shown in the cli stat command, web ui or grafana dashboard. This commit adds daemonset support for stat. Update stat command's help message to reference daemonsets. Update the public-api to support stats for daemonsets. Add tests for stat summary and api. Add daemonset get/list/watch permissions to the linkerd-controller cluster role that's created using the install command. Update golden expectation test files for install command yaml manifest output. Update web UI with daemonsets Update navigation, overview and pages to list daemonsets and the pods associated to them. Add daemonset paths to server, and ui apps. Add grafana dashboard for daemonsets; a clone of the deployment dashboard. Update dependencies and dockerfile hashes Add DaemonSet support to tap and top commands Fixes of #2006 Signed-off-by: Zak Knill <zrjknill@gmail.com>	2019-01-24 14:34:13 -08:00
Andrew Seigner	2217a6cd2b	Add cli command to install linkerd service profiles (#2099 ) This introduces a `linkerd install-sp` command to install service profiles into the linkerd control plane. It installs one service profile for each of controller-api, proxy-api, prometheus, and grafana. Fixes #1901 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-24 11:34:19 -08:00
Andrew Seigner	c9ac77cd7c	Introduce version consistency checks (#2130 ) Version checks were not validating that the cli version matched the control plane or data plane versions. Add checks via the `linkerd check` command to validate the cli is running the same version as the control and data plane. Also add types around `channel-version` string parsing and matching. A consequence being that during development `version.Version` changes from `undefined` to `dev-undefined`. Fixes #2076 Depends on linkerd/website#101 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-23 16:54:43 -08:00
Alena Varkockova	28f662c9c6	Introduce resource selector and deprecate namespace field for ListPods (#2025 ) * Introduce resource selector and deprecate namespace field for ListPods * Changes from code review * Properly deprecate the field * Do not check for nil * Fix the mockProm usage * Protoc changes revert * Changed from code review Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2019-01-23 10:35:55 -08:00
Andrew Seigner	93322bed6d	Fix Unicode and color output for Windows (#2114 ) The default font in Windows console did not support the Unicode characters recently added to check and inject commands. Also the color library the linkerd cli depends on was not being used in a cross-platform way. Replace the existing Unicode characters used in `check` and `inject` with characters available in most fonts, including Windows console. Similarly replace the spinner used in `check` with one that uses characters available in most fonts. Modify `check` and `inject` to use `color.Output` and `color.Error`, which wrap `os.Stdout` and `os.Stderr`, and perform special tranformations when on Windows. Add a `--no-color` option to `linkerd logs`. While stern uses the same color library that `check`/`inject` use, it is not yet using the `color.Output` API for Windows support. That issue is tracked at: https://github.com/wercker/stern/issues/69 Relates to https://github.com/linkerd/linkerd2/pull/2087 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-23 09:55:44 -08:00
Alejandro Pedraza	eacc09b7ba	Follow up to #2054 : move magic strings into constants (#2122 ) Follow up to #2054: move magic strings into constants Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-22 14:53:24 -05:00
Alejandro Pedraza	a5628780f7	Fix dupe word in example (#2121 ) Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-22 13:43:31 -05:00
Kevin Lingerfelt	1989e28d8f	Add newline separator above uninject reports (#2104 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-17 11:12:54 -08:00
Alejandro Pedraza	bd1955c317	When injecting, perform an uninject as a first step (#2089 ) * When injecting, perform an uninject as a first step Fixes #1970 The fixture `inject_emojivoto_already_injected.input.yml` is no longer rejected, so I created the corresponding golden file. Note that we'll still forbid injection over resources already injected with third party meshes (Istio, Contour), so now we have `HasExisting3rdPartySidecars()` to detect that. * Generalize HasExistingSidecars() to cater for both the auto-injector and * Convert `linkerd uninject` result format to the one used in `linkerd inject`. * More updates to the uninject reports. Revert changes to the HasExistingSidecars func. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-17 10:15:23 -08:00
Dennis Adjei-Baah	f9cd9366d9	Surface logs from control plane pods (#2037 ) When debugging control plane issues or issues pertaining to a linkerd proxy, it can be cumbersome to get logs from affected containers quickly. This PR adds a new `logs` command to the Linkerd CLI to surface log lines from any container within linkerd's control plane. This feature relies heavily on [stern](https://github.com/wercker/stern), which already provides this behavior. This PR integrates this package into the Linkerd CLI to allow users to quickly retrieve logs whenever they run into issues when using Linkerd. Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2019-01-16 19:24:42 -08:00
Alex Leong	a562f8b9fd	Improve routes command to list all routes (#2066 ) Fixes #1875 This change improves the `linkerd routes` command in a number of important ways: * The restriction on the type of the `--to` argument is lifted and any resource type can now be used. Try `--to ns/books`, `--to po/webapp-ABCDEF`, `--to au/linkerd.io`, or even `--to svc`. * All routes for the target will now be populated in the table, even if there are no Prometheus metrics for that route. * [UNKNOWN] has been renamed to [DEFAULT] * The `Service/Authority` column will now list `Service` in all cases except for when an authority target is explicitly requested. ``` $ linkerd routes deploy/traffic --to deploy/webapp ROUTE SERVICE SUCCESS RPS LATENCY_P50 LATENCY_P95 LATENCY_P99 GET / webapp 100.00% 0.5rps 50ms 180ms 196ms GET /authors/{id} webapp 100.00% 0.5rps 100ms 900ms 980ms GET /books/{id} webapp 100.00% 0.9rps 38ms 93ms 99ms POST /authors webapp 100.00% 0.5rps 35ms 48ms 50ms POST /authors/{id}/delete webapp 100.00% 0.5rps 83ms 180ms 196ms POST /authors/{id}/edit webapp 0.00% 0.0rps 0ms 0ms 0ms POST /books webapp 45.16% 2.1rps 75ms 425ms 485ms POST /books/{id}/delete webapp 100.00% 0.5rps 30ms 90ms 98ms POST /books/{id}/edit webapp 56.00% 0.8rps 92ms 875ms 975ms [DEFAULT] webapp 0.00% 0.0rps 0ms 0ms 0ms ``` This is all made possible by a shift in the way we handle the destination resource. When we get a request with a `ToResource`, we use the k8s API to find all Services which include at least one pod belonging to that resource. We then fetch all service profiles for those services and display the routes from those serivce profiles. This shift in thinking also precipitates a change in the TopRoutes API where we no longer need special cases for `ToAll` (which can be specified by `--to au`) or `ToAuthority` (which can be specified by `--to au/<authority>`) and instead can use a `ToResource` to handle all cases. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-01-16 17:15:35 -08:00
Andrew Seigner	92f2cd9b63	Update check and inject output (#2087 ) The outputs of the `check` and `inject` commands did not vary much between successful and failed executions, and were a bit verbose and challenging to parse. Reorganize output of `check` and `inject` commands, to provide more output when errors occur, and less output when successful. Specific changes: `linkerd check` - visually group checks by category - introduce `hintURL`'s, to provide doc links when checks fail - add spinners when retrying, remove additional retry lines - colored unicode characters to indicate success/warning/failure `linkerd inject` - modify default output to mirror `kubectl apply` - only output non-successful inject reports - support `--verbose` flag to output all inject reports Fixes #1471, #1653, #1656, #1739 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-16 15:14:14 -08:00
Alex Leong	771542dde2	Add support for retries (#2038 )	2019-01-16 14:13:48 -08:00
Alex Leong	3398e9391a	Fix panic when swagger has no paths (#2092 ) Fixes #2059 Fix a panic in linkerd profile --open-api when the swagger spec has no Paths field. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-01-16 13:39:51 -08:00
Andrew Seigner	dacd8819ff	Group checkers by category (#2083 ) The linkerd check command organized the various checks via loosely coupled category IDs, category names, and checkers themselves, all with ordering defined by consumers of this code. This change removes category IDs in favor of category names, groups all checkers by category, and enforces ordering at the HealthChecker level. Part of #1471, depends on #2078. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-14 18:01:32 -08:00
Andrew Seigner	04373414ef	Modify all health checks to be specified via enums (#2078 ) The set of health checks to be executed were dependent on a combination of check enums and boolean options. This change modifies the health checks to be governed strictly by a set of enums. Next steps: - tightly couple category IDs to names - tightly couple checks to their parent categories - programmatic control over check ordering Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-14 17:16:15 -08:00
Kevin Lingerfelt	6a3f4a1a99	Leave dashboard command running when open URL fails (#2081 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-14 16:46:11 -08:00
Alejandro Pedraza	d358605269	Added `linkerd uninject` command (#2054 ) Add `linkerd uninject` command uninject.go iterates through the resources annotations, labels, initContainers and Containers, removing what we know was injected by linkerd. The biggest part of this commit is the refactoring of inject.go, to make it more generic and reusable by uninject. The idea is that in a following PR this functionality will get reused by `linkerd inject` to uninject as as preliminary step to injection, as a solution to #1970. This was tested successfully on emojivoto with: ``` 1) inject: kubectl get -n emojivoto deployment -o yaml \| bin/linkerd inject - \| kubectl apply -f - 2) uninject: kubectl get -n emojivoto deployment -o yaml \| bin/linkerd uninject - \| kubectl apply -f - ``` Also created unit tests for uninject.go. The fixture files from the inject tests could be reused. But as now the input files act as outputs, they represent existing resources and required these changes (that didn't affect inject): - Rearranged fields in alphabetical order. - Added fields that are only relevant for existing resources (e.g. creationTimestamp and status.replicas in StatefulSets) Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-14 17:39:31 -05:00
Kevin Lingerfelt	ed3fbd75f3	Setup port-forwarding for linkerd dashboard command (#2052 ) * Setup port-forwarding for linkerd dashboard command * Output port-forward logs when --verbose flag is set Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-10 10:16:08 -08:00
Alena Varkockova	172398292d	Add validation to CRD for service profiles (#2024 ) * Add validation to CRD for service profiles Signed-off-by: Alena Varkockova <varkockova.a@gmail.com> * Use properties instead of oneof Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2019-01-10 07:04:40 -08:00
Alejandro Pedraza	281ba37e6d	More granular control on checks made by CLI commands (#2033 ) Have the CLI commands `get`, `routes`, `stat`, `tap`and `top` perform a more limited set of checks Fixes #1854	2019-01-10 09:13:44 -05:00
Kevin Lingerfelt	a27bb2e0ce	Proxy grafana requests through web service (#2039 ) * Proxy grafana requests through web service * Fix -grafana-addr default, clarify -api-addr flag * Fix version check in grafana dashboards * Fix comment typo Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-04 16:07:57 -08:00
Alejandro Pedraza	8639a8753f	Fix reporting of injected resources (#2031 ) * Fix reporting of injected resources When reporting resources provided as a list, it was picking just one item from the list and ignoring the rest. Also improved test for injecting list of resources. Fixes #1676 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-03 15:14:11 -05:00
Andrew Seigner	a91c77d0bf	Followups from lint/comment changes (#2032 ) This is a followup branch from #2023: - delete `proxy/client.go`, move code to `destination-client` - move `RenderTapEvent` and stat functions from `util` to `cmd` Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-02 15:28:09 -08:00
Alena Varkockova	ef02cd6828	Disallow both pre and proxy flags for check (#2026 ) Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2019-01-02 13:36:04 -08:00
Thomas Rampelberg	612ebe2b81	Add rules_file loading into prometheus (#1966 )	2018-12-20 11:47:13 -08:00
Radu M	07cbfe2725	Fix most golint issues that are not comment related (#1982 ) Signed-off-by: Radu Matei <radu@radu-matei.com>	2018-12-20 10:37:47 -08:00
Alex Leong	0a87ebcd5d	Add --routes flag to linkerd top (#1952 ) Fixes #1910 Add a `--routes` flag to the `top` command which show requests by route instead of by path. This also includes an overhaul of the live table rendering code to be more flexible and better accommodate the various combinations of columns that may be visible or hidden. Co-authored-by: Andrew Seigner <siggy@buoyant.io> Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-18 11:25:34 -08:00
Alex Leong	cb3fa1245b	Remove TLS column from routes command output (#1956 ) Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-14 21:52:49 -08:00
Kevin Lingerfelt	b16f4da1be	Re-add RBAC permission to access installed namespace in single ns mode (#1987 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-13 15:32:53 -08:00
Kevin Lingerfelt	0866bb2a41	Remove runAsGroup field from security context settings (#1986 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-13 15:12:13 -08:00
Kevin Lingerfelt	86e95b7ad3	Disable serivce profiles in single-namespace mode (#1980 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-13 14:37:18 -08:00
Cody Vandermyn	d847f66ec5	Create new service accounts for linkerd-web and linkerd-grafana. Chan… (#1978 ) * Create new service accounts for linkerd-web and linkerd-grafana. Change 'serviceAccount:' to 'serviceAccountName:' * Use dynamic namespace name Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2018-12-12 18:10:50 -08:00
Cody Vandermyn	aa5e5f42eb	Use an emptyDir for Prometheus and Grafana (#1971 ) * Allow input of a volume name for prometheus and grafana * Make Prometheus and Grafana volume names 'data' by default and disallow user editing via cli flags * Remove volume name from options Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2018-12-12 15:54:03 -08:00
Kevin Lingerfelt	fd44896644	Remove namespace definition from --single-namespace installs (#1974 ) * Remove namespace definition from --single-namespace installs * DRY up code in healthcheck.go Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-12 14:53:02 -08:00
Cody Vandermyn	8e4d9d2ef6	add securityContext with runAsUser: {{.ControllerUID}} to the various cont… (#1929 ) * add securityContext with runAsUser: {{.ProxyUID}} to the various containers in the install template * Update golden to reflect new additions * changed to a different user id than the proxy user id * Added a controller-uid install option * change the port that the proxy-injector runs * The initContainers needs to be run as the root user. * move security contexts to container level Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2018-12-11 11:51:28 -08:00
Alejandro Pedraza	8c67bfbcc6	Add parameter to stats API to skip retrieving Prometheus stats (#1871 ) * Add parameter to stats API to skip retrieving Prometheus stats Used by the dashboard to populate list of resources. Fixes #1022 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Prometheus queries check results were being ignored * Refactor verifyPromQueries() to also test when no prometheus queries should be generated * Add test for SkipStats=true Includes adding ability to public.GenStatSummaryResponse to not generate basicStats * Fix previous test	2018-12-10 16:48:12 -08:00
Andrew Seigner	bef9479f57	Add input validation for profile command (#1934 ) Fixes #1878 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-12-05 15:13:10 -08:00
Alex Leong	7169eaef27	Stop routes with the same name from different services from clobbering each other (#1936 ) If the `linkerd routes` command gets two routes with the same name, it will only display one of them, even if the routes are from different services. This is particularly obvious with the default `[UNKNOWN]` route. We now display all routes, even if they have the same name. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 15:05:19 -08:00
Alex Leong	cbb196066f	Support service profiles for external authorities (#1928 ) Add support for service profiles created on external (non-service) authorities. For example, this allows you to create a service profile named `linkerd.io` which will apply to calls made to `linkerd.io`. This is done by changing the `LINKERD2_PROXY_DESTINATION_PROFILE_SUFFIXES` to `.` so that the proxy will attempt to lookup a service profile for any authority. We provide the `--disable-external-profiles` proxy flag to revert this behavior in case it is a problem. We also refactor the proxy-api implementation of GetProfiles so that it does the profile lookup, regardless of if the authority looks like a Kubernetes service name or not. To simplify this, support for multiple resolves (which was unused) was removed. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 14:32:59 -08:00
Oliver Gould	12ec5cf922	install: Add a -disable-h2-upgrade flag (#1926 ) The proxy-api service _always_ suggests that two meshed pods communicate via HTTP/2 (i.e. via transparent protocol upgrading, if necessary). This can complicate debugging and diagnostics at times, so it's important that we have a way to deploy linkerd without this auto-upgrade behavior. This change adds a `-disable-h2-upgrade` flag to the `linkerd install` command that disables transparent upgrading for the whole cluster.	2018-12-05 12:50:47 -08:00
Alex Leong	380ec52a39	Rework routes command to accept any resource (#1921 ) We rework the routes command so that it can accept any Kubernetes resource, making it act much more similarly to the stat command. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 11:11:34 -08:00
Alex Leong	4f3e55e937	Rename path to path_regex in ServiceProfile CRD (#1923 ) We rename path to path_regex in the ServiceProfile CRD to make it clear that this field accepts a regular expression. We also take this opportunity to remove unnecessary line anchors from regular expressions now that these anchors are added in the proxy. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 10:42:47 -08:00
Kevin Lingerfelt	37ae423bb3	Add linkerd- prefix to all objects in linkerd install (#1920 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-04 15:41:47 -08:00
Andrew Seigner	ad2366f208	Revert proxy readiness initialDelaySeconds change (#1912 ) Reverts part of #1899 to workaround readiness failures. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-12-04 14:27:55 -08:00
Risha Mars	e8a39cd17e	Add ability to download a service profile template from the web UI (#1893 ) Adds an endpoint, at /profiles/new that allows you to input a service name and namespace, and download a service profile yaml template. This will enable future work, where we can add more of the yaml customization via a form in the dashboard, and use that data to help the user configure routes.	2018-12-03 16:48:43 -08:00
Andrew Seigner	37a5455445	Add filtering by job in stat, tap, top; fix panic (#1904 ) Filtering by Kubernetes job was not supported. Also filtering by any unknown type caused a panic. Add filtering support by Kubernetes job, with special case mapping `job` to `k8s_job`, to not conflict with Prometheus' job label. Fix panic when unknown type specified as a `--from` or `--to` flag. Fix `job` label from `linkerd-proxy` overwriting Prometheus `job` label at collection time. This caused all metrics collected by proxy sidecars in Kubernetes jobs to be collected into an incorrect Prometheus job, rather than the expected `linkerd-proxy` Prometheus job. Fix `unsupported resource type` tap error message incorrectly printing the target resource rather than the destination. Set `--controller-log-level debug` in `install_test.go` for easier debugging. Expose `slow-cooker`'s metrics via a k8s service in the tap integration test, to validate proxy requests with a job as destination. Fixes #1872 Part of #627 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-12-03 15:34:49 -08:00
Andrew Seigner	d121071f87	Adjust proxy, Prometheus, and Grafana probes (#1899 ) * Adjust proxy, Prometheus, and Grafana probes High `readinessProbe.initialDelaySeconds` values delayed the controller's readiness by up to 30s, preventing cli commands from succeeding shortly after control plane deployment. Decrease `readinessProbe.initialDelaySeconds` in the proxy, Prometheus, and Grafana to the default 0s. Also change `linkerd check` controller pod ordering to: controller, prometheus, web, grafana. Detailed probe changes: - proxy - decrease `readinessProbe.initialDelaySeconds` from 10s to 0s - prometheus - decrease `readinessProbe.initialDelaySeconds` from 30s to 0s - decrease `readinessProbe.timeoutSeconds` from 30s to 1s - decrease `livenessProbe.timeoutSeconds` from 30s to 1s - grafana - decrease `readinessProbe.initialDelaySeconds` from 30s to 0s - decrease `readinessProbe.timeoutSeconds` from 30s to 1s - decrease `readinessProbe.failureThreshold` from 10 to 3 - increase `livenessProbe.initialDelaySeconds` from 0s to 30s Fixes #1804 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-12-03 10:41:11 -08:00
Alex Leong	f9d66cf4de	Add --open-api option to linkerd profiles command (#1867 ) The `--open-api` flag is an alternative to the `--template` flag for the `linkerd profile` command. It reads an OpenAPI specification file (also called a swagger file) and uses it to generate a corresponding service profile. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-11-30 09:25:19 -08:00
Alex Leong	835e34b500	Left-align the routes column and sort by route name (#1879 ) Signed-off-by: Alex Leong <alex@buoyant.io>	2018-11-28 09:32:59 -08:00
Ben Lambert	297cb570f2	Added a --ha flag to install CLI (#1852 ) This change allows some advised production config to be applied to the install of the control plane. Currently this runs 3x replicas of the controller and adds some pretty sane requests to each of the components + containers of the control plane. Fixes #1101 Signed-off-by: Ben Lambert <ben@blam.sh>	2018-11-20 23:03:59 -05:00
Alex Leong	7a7f6b6ecb	Add TopRoutes method the the public api and route CLI command to consume it (#1860 ) Add a routes command which displays per-route stats for services that have service profiles defined. This change has three parts: * A new public-api RPC called `TopRoutes` which serves per-route stat data about a service * An implementation of TopRoutes in the public-api service. This implementation reads per-route data from Prometheus. This is very similar to how the StatSummaries RPC and much of the code was able to be refactored and shared. * A new CLI command called `routes` which displays the per-route data in a tabular or json format. This is very similar to the `stat` command and much of the code was able to be refactored and shared. Note that as of the currently targeted proxy version, only outbound route stats are supported so the `--from` flag must be included in order to see data. This restriction will be lifted in an upcoming change once we add support for inbound route stats as well. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-11-19 12:20:30 -08:00
Alejandro Pedraza	bbcf5a8c9f	Allow stat summary to query for multiple resources (#1841 ) * Refactor util.BuildResource so it can deal with multiple resources First step to address #1487: Allow stat summary to query for multiple resources Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Update the stat cli help text to explain the new multi resource querying ability Propsal for #1487: Allow stat summary to query for multiple resources Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Allow stat summary to query for multiple resources Implement this ability by issuing parallel requests to requestStatsFromAPI() Proposal for #1487 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Update tests as part of multi-resource support in `linkerd stat` (#1487) - Refactor stat_test.go to reuse the same logic in multiple tests, and add cases and files for json output. - Add a couple of cases to api_utils_test.go to test multiple resources validation. Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * `linkerd stat` called with multiple resources should keep an ordering (#1487) Add SortedRes holding the order of resources to be followed when querying `linkerd stat` with multiple resources Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Extra validations for `linkerd stat` with multiple resources (#1487) Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * `linkerd stat` resource grouping, ordering and name prefixing (#1487) - Group together stats per resource type. - When more than one resource, prepend name with type. - Make sure tables always appear in the same order. Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Allow `linkerd stat` to be called with multiple resources A few final refactorings as per code review. Fixes #1487 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com>	2018-11-14 10:44:04 -08:00
Alex Leong	32d556e732	Improve ergonomics of service profile spec (#1828 ) We make several changes to the service profile spec to make service profiles more ergonomic and to make them more consistent with the destination profile API. * Allow multiple fields to be simultaneously set on a RequestMatch or ResponseMatch condition. Doing so is equivalent to combining the fields with an "all" condition. * Rename "responses" to "response_classes" * Change "IsSuccess" to "is_failure" Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-31 12:00:22 -07:00
Alex Leong	d8b5ebaa6d	Remove the proxy-api container (#1813 ) A container called `proxy-api` runs in the Linkerd2 controller pod. This container listens on port 8086 and serves the proxy-api but does nothing other than forward gRPC requests to the destination container which listens on port 8089. We remove the proxy-api container altogether and change the destination container to listen on port 8086 instead of 8089. The result is that clients still use the proxy-api by connecting to `proxy-api.<ns>.svc.cluster.local:8086` but the controller has one fewer containers. This results in a simpler system that is easier to reason about. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-29 16:31:43 -07:00
Alex Leong	82ca821e62	Use fqdn for service profile name (#1808 ) Service profiles must be named in the form `"<service>.<namespace>"`. This is inconsistent with the fully normalized domain name that the proxy sends to the controller. It also does not permit creating service profiles for non-Kubernetes services. We switch to requiring that service profiles must be named with the FQDN of their service. For Kubernetes services, this is `"<service>.<namespace>.svc.cluster.local"`. This change alone is not sufficient for allowing service profile for non-Kubernetes services because the k8s resolver will ignore any DNS names which are not Kubernetes services. Further refactoring of the resolver will be required to allow looking up non-Kubernetes service profiles in Kuberenetes. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-29 14:35:42 -07:00
Alex Leong	6cffad277b	Make service profile validation a warning instead of an error (#1807 ) The existence of an invalid service profile causes `linkerd check` to fail. This means that it is not possible to open the Linkerd dashboard with the `linkerd dashboard` command. While service profile validation is useful, it should not lock users out. Add the ability to designate health checks as warnings. A failed warning health check will display a warning output in `linkerd check` but will not affect the overall success of the command. Switch the service profile validation to be a warning. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-26 13:28:10 -07:00
Alex Leong	652ca161ef	Add linkerd profile --template command (#1773 ) Add a new CLI command: `linkerd profile --template` which outputs a sample service profile yaml. Users can edit this sample and then `kubectl apply` it to add a service profile. The sample serves as "documentation by example" of what service profiles may contain. Example usage: ```bash linkerd profile -n emojivoto --template web-svc > web-svc-profile.yaml # edit web-svc-profile.yaml in your favorite editor kubectl apply -f web-svc-profile.yaml ``` Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-19 13:34:54 -07:00
Alena Varkockova	87b2773930	Fix the validation of docker registry, improve the error message (#1780 ) Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2018-10-18 15:36:28 -07:00
Alex Leong	43c22fe967	Implement getProfiles method in destination service (#1759 ) We implement the getProfiles method in the destination service. This method returns a stream of destination profiles for a given authority. It does this by looking up the ServiceProfile resource in the controller namespace named `<svc>.<ns>` where `<svc>` is the name of the service and `<ns>` is the namespace of the service. This PR includes: * Adding a ServiceProfile Custom Resource Definition to linkerd install * A watch based implementation of the getProfiles method in the destination service, similar to the implementation of get. * An update to the destination client script that allows querying the getProfiles method. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-16 15:39:12 -07:00
Kevin Lingerfelt	e9874b9c3e	Improve docker layer caching for web image (#1757 ) * Improve docker layer caching for web image * Move all web files to /linkerd dir Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-10-16 11:10:52 -07:00
Alejandro Pedraza	37bc8a69db	Added support for json output in `linkerd stat` (#1749 ) Added support for json output in `linkerd stat` through a new (-o\|--output)=json option. Fixes #1417 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com>	2018-10-15 14:10:48 -07:00
Alejandro Pedraza	2d6fde274c	Make room for columns in `linkerd top` (#1750 ) * Make room for columns in `linkerd top` Make room for columns in `linkerd top`. Columns with data longer than some predetermined minimum length were stepping over each other. Proposal for #1728 * Removed unneeded truncations Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com>	2018-10-11 13:08:46 -07:00
Kevin Lingerfelt	46c887ca00	Add --single-namespace install flag for restricted permissions (#1721 ) * Add --single-namespace install flag for restricted permissions * Better formatting in install template * Mark --single-namespace and --proxy-auto-inject as experimental * Fix wording of --single-namespace check flag * Small healthcheck refactor Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-10-11 10:55:57 -07:00
Ivan Sim	4fba6aca0a	Proxy init and sidecar containers auto-injection (#1714 ) * Support auto sidecar-injection 1. Add proxy-injector deployment spec to cli/install/template.go 2. Inject the Linkerd CA bundle into the MutatingWebhookConfiguration during the webhook's start-up process. 3. Add a new handler to the CA controller to create a new secret for the webhook when a new MutatingWebhookConfiguration is created. 4. Declare a config map to store the proxy and proxy-init container specs used during the auto-inject process. 5. Ignore namespace and pods that are labeled with linkerd.io/auto-inject: disabled or linkerd.io/auto-inject: completed 6. Add new flag to `linkerd install` to enable/disable proxy auto-injection Proposed implementation for #561. * Resolve missing packages errors * Move the auto-inject label to the pod level * PR review items * Move proxy-injector to its own deployment * Ignore pods that already have proxy injected This ensures the webhook doesn't error out due to proxy that are injected using the command * PR review items on creating/updating the MWC on-start * Replace API calls to ConfigMap with file reads * Fixed post-rebase broken tests * Don't mutate the auto-inject label Since we started using healhcheck.HasExistingSidecars() to ensure pods with existing proxies aren't mutated, we don't need to use the auto-inject label as an indicator. This resolves a bug which happens with the kubectl run command where the deployment is also assigned the auto-inject label. The mutation causes the pod auto-inject label to not match the deployment label, causing kubectl run to fail. * Tidy up unit tests * Include proxy resource requests in sidecar config map * Fixes to broken YAML in CLI install config The ignore inbound and outbound ports are changed to string type to avoid broken YAML caused by the string conversion in the uint slice. Also, parameterized the proxy bind timeout option in template.go. Renamed the sidecar config map to 'linkerd-proxy-injector-webhook-config'. Signed-off-by: ihcsim <ihcsim@gmail.com>	2018-10-10 12:09:22 -07:00
Darko Radisic	6fee0f3c2b	Added --context flag to specify the context to use to talk to the Kubernetes apiserver (#1743 ) * Added --context flag to specify the context to use to talk to the Kubernetes apiserver * Fix tests that are failing * Updated context flag description Signed-off-by: Darko Radisic <ffd2subroutine@users.noreply.github.com>	2018-10-08 12:37:35 -07:00
Ben Lambert	69cebae1a2	Added ability to configure sidecar CPU + Memory requests (#1731 ) Horizontal Pod Autoscaling does not work when container definitions in pods do not all have resource requests, so here's the ability to add CPU + Memory requests to install + inject commands by proving proxy options --proxy-cpu + --proxy-memory Fixes #1480 Signed-off-by: Ben Lambert <ben@blam.sh>	2018-10-08 10:51:29 -07:00
Oliver Gould	eaec37c64f	cli: Use updated proxy config environment vars In linkerd/linkerd2-proxy#99, several proxy configuration variables were deprecated. This change updates the CLI to use the updated names to avoid deprecation warnings during startup.	2018-10-03 11:15:39 -07:00
Risha Mars	427078844d	Add HTTP method to Top display in the CLI and Web UI (#1709 ) * Add Method column to Top tables in Web UI * Add method to CLI top table	2018-09-25 11:15:11 -07:00
Rodrigo Chacon	783bb1c3a7	cli: add support for LINKERD_NAMESPACE environment variable (#1695 ) Signed-off-by: Rodrigo Chacon <rochacon@gmail.com>	2018-09-21 17:24:10 -07:00
Alena Varkockova	8ab9b4981b	Make wait flag configurable for check and dashboard (#1654 ) Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2018-09-19 10:42:29 -07:00
Kevin Lingerfelt	f1b3827194	Bump default check retry time to 5 minutes (#1645 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-09-14 10:58:03 -07:00
Alena Varkockova	169dcf4e70	Make wait=true a default option for check and dashboard (#1640 ) * Remove wait option and make it a default for check * Switch the wait default to true * Wait by default also for dashboard Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2018-09-14 09:59:04 -07:00
Andrew Seigner	5d85680ec1	Introduce inject check for known sidecars (#1619 ) `linkerd inject` was not checking its input for known sidecars and initContainers. Modify `linkerd inject` to check for existing sidecars and initContainers, specifically, Linkerd, Istio, and Contour. Part of #1516 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-11 15:09:19 -07:00
Andrew Seigner	bae05410fd	Bump Prometheus to v2.4.0, Grafana to 5.2.4 (#1625 ) Prometheus v2.3.1 -> v2.4.0 Grafana 5.1.3 -> 5.2.4 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-11 14:45:55 -07:00
Alex Leong	bd15482329	Add with-source flag to top (#1614 ) Fixes #1593 Add a `--hide-sources` flag to `linkerd top`. Setting this removes the source column from the output. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-09-11 14:21:36 -07:00
Andrew Seigner	7eec5f181d	Inject warns on UDP ports (#1617 ) linkerd only routes TCP data, but `linkerd inject` does not warn when it injects into pods with ports set to `protocol: UDP`. Modify `linkerd inject` to warn when injected into a pod with `protocol: UDP`. The Linkerd sidecar will still be injected, but the stderr output will include a warning. Also add stderr checking on all inject unit tests. Part of #1516. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-11 10:12:45 -07:00
Andrew Seigner	c5a719da47	Modify inject to warn when file is un-injectable (#1603 ) If an input file is un-injectable, existing inject behavior is to simply output a copy of the input. Introduce a report, printed to stderr, that communicates the end state of the inject command. Currently this includes checking for hostNetwork and unsupported resources. Malformed YAML documents will continue to cause no YAML output, and return error code 1. This change also modifies integration tests to handle stdout and stderr separately. example outputs... some pods injected, none with host networking: ``` hostNetwork: pods do not use host networking...............................[ok] supported: at least one resource injected..................................[ok] Summary: 4 of 8 YAML document(s) injected deploy/emoji deploy/voting deploy/web deploy/vote-bot ``` some pods injected, one host networking: ``` hostNetwork: pods do not use host networking...............................[warn] -- deploy/vote-bot uses "hostNetwork: true" supported: at least one resource injected..................................[ok] Summary: 3 of 8 YAML document(s) injected deploy/emoji deploy/voting deploy/web ``` no pods injected: ``` hostNetwork: pods do not use host networking...............................[warn] -- deploy/emoji, deploy/voting, deploy/web, deploy/vote-bot use "hostNetwork: true" supported: at least one resource injected..................................[warn] -- no supported objects found Summary: 0 of 8 YAML document(s) injected ``` TODO: check for UDP and other init containers Part of #1516 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-10 10:34:25 -07:00
Kevin Lingerfelt	b5ff29c8aa	Add data plane check to validate proxy version (#1574 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-09-04 15:22:38 -07:00
Kevin Lingerfelt	c7a79da89c	Add data plane check to validate proxies are ready (#1570 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-31 15:51:57 -07:00
Alex Leong	0f7d684ca9	Increase default max-rps for tap and top (#1531 ) The default value for the max-rps argument to the tap and top commands is an overly conservative 1rps. This causes the data to come in very slowly and much data to be discarded. Furthermore, because tap requests are windowed to 10 seconds, this causes long pauses between updates. We fix this in two ways. Firstly we reduce the window size to 1s so that updates will come in at least once per second, even when the actual RPS of the data path is extremely high. Secondly, we increase the default max-rps parameter from 1 to 100. This allows tap to paint an accurate picture of the data much more quickly and sidesteps some sampling bias that happens when the max-rps is low. In general, tap events tend to happen in bursts. For example, one request in may trigger one or more requests out. Likewise, a single upstream event may trigger several requests to the tapped pod in quick succession. Sampling bias will occur when the max-rps is less than the actual rps and when the tap event limit subdivides these event bursts (biasing towards the first few events in the burst). The greater the max-rps, the less the effects of this bias. Fixes #1525 Signed-off-by: Alex Leong <alex@buoyant.io>	2018-08-28 14:16:39 -07:00
Risha Mars	136b9cc7c1	Add linkerd check flag to run data plane checks (#1528 ) Adds a --proxy flag to the linkerd check CLI command which will run to-be-implemented data plane checks	2018-08-28 10:16:24 -07:00
Risha Mars	fff09c5d06	Only tap pods that are meshed (#1535 ) Previously, we would tap any resource's pods, regardless of whether the pods were meshed or not. We can't actually tap non-meshed pods, so I'm adding a check that will filter out non-meshed pods from the pods that tap watches. Previous behaviour: When attempting to hang a non meshed pod, it would establish a watch on the pods, but then never return any results. In the CLI you could just cancel it with Ctrl-C. In the web, clicking Stop would send a WebSocket.close(1000) but wouldn't actually close the connection... Behaviour after change : If no pods under the specified resource are meshed, it'll return an error of no pods being found to tap	2018-08-28 09:59:52 -07:00
Risha Mars	27e52a6cc0	Add ReadinessProbe and LivenessProbe to injected proxy containers (#1530 ) Adds basic probes to the linkerd-proxy containers injected by linkerd inject. - Currently the Readiness and Liveness probes are configured to be the same. - I haven't supplied a periodSeconds, but the default is 10. - I also set the initialDelaySeconds to 10, but that might be a bit high. https://kubernetes.io/docs/tasks/configure-pod-container/configure-liveness-readiness-probes/	2018-08-27 11:55:17 -07:00
Kevin Lingerfelt	4450a7536d	Add --wait flag for CLI check and dashboard commands (#1503 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-22 12:56:42 -07:00
Kevin Lingerfelt	49f6c4c770	Refactor healthcheck init and observe setup (#1502 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-22 12:30:45 -07:00
Kevin Lingerfelt	53cd3b50d5	Add --pre flag for linkerd check command (#1497 ) * Add --pre flag for linkerd check command * Small adjustments to check help text Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-20 17:09:43 -07:00
Kevin Lingerfelt	e97be1f5da	Move all healthcheck-related code to pkg/healthcheck (#1492 ) * Move all healthcheck-related code to pkg/healthcheck * Fix failed check formatting * Better version check wording Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-20 16:50:22 -07:00
Eliza Weisman	b8434d60d4	Add resource metadata to Tap CLI output (#1437 ) Closes #1170. This branch adds a `-o wide` (or `--output wide`) flag to the Tap CLI. Passing this flag adds `src_res` and `dst_res` elements to the Tap output, as described in #1170. These use the metadata labels in the tap event to describe what Kubernetes resource the source and destination peers belong to, based on what resource type is being tapped, and fall back to pods if either peer is not a member of the specified resource type. In addition, when the resource type is not `namespace`, `src_ns` and `dst_ns` elements are added, which show what namespaces the the source and destination peers are in. For peers which are not in the Kubernetes cluster, none of these labels are displayed. The source metadata added in #1434 is used to populate the `src_res` and `src_ns` fields. Also, this branch includes some refactoring to how tap output is formatted. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-08-20 14:25:26 -07:00
Alex Leong	094a375015	[RFC] linkerd top (#1435 ) This an initial implementation of the `linkerd top` command. This command launches an ncurses style tabular view of current requests (using data from tap). Most of the command line arguments are the same as tap and allow selecting the resource to inspect and filtering which requests to view. Fixes #1283 Signed-off-by: Alex Leong <alex@buoyant.io>	2018-08-15 18:10:23 -07:00
Kevin Lingerfelt	00a0572098	Better CLI error messages when control plane is unavailable (#1428 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-09 15:40:41 -07:00
Eliza Weisman	9d8f58cb16	Add additional validation for stat command-line arguments (#1415 ) Closes #776. This branch adds the following validation to the `linkerd stat` command: * The `--to` and `--from` flags are now mutually exclusive * The `--to-namespace` and `--from-namespace` commands are also mutually exclusive. * The `namespace` resource type conflicts with the `--namespace`, `--to-namespace`, and `--from-namespace` flags. Examples: ``` $ bin/go-run cli/main.go stat deploy --to deploy/foo --from deploy/bar Error: --to and --from flags are mutually exclusive Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat deploy --to-namespace foo --from-namespace bar Error: --to-namespace and --from-namespace flags are mutually exclusive Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat namespace foo --namespace bar Error: --namespace flag is incompatible with namespace resource type Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat ns --to-namespace bar Error: --to-namespace flag is incompatible with namespace resource type Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat namespace --from-namespace bar Error: --from-namespace flag is incompatible with namespace resource type Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat ns/foo --from-namespace bar Error: --from-namespace flag is incompatible with namespace resource type Usage: linkerd stat [flags] (RESOURCE) ... ``` Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-08-08 15:35:47 -07:00
Kevin Lingerfelt	82940990e9	Rename mailing lists, remove all remaining conduit references (#1416 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-07 17:00:55 -07:00
Kevin Lingerfelt	4845b4ec04	Restore linkerd.io/control-plane* labels (#1411 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-07 13:53:29 -07:00
Kevin Lingerfelt	e0a01c5dd8	Remove node scrape target, kubernetes grafana dashboard (#1410 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-07 13:41:38 -07:00
Kevin Lingerfelt	bd19e8aaff	Update prometheus to only scrape proxies in the same mesh (#1402 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-06 12:05:55 -07:00
Eliza Weisman	01cc30d102	Increase outbound router capacity for Prometheus pod's proxy (#1358 ) Currently, when a cluster has over 100 pods injected with the Linkerd2 proxy, Prometheus metrics are not collected correctly. This is because Prometheus appears to be making more concurrent requests than its' proxy's outbound router cache can handle See issue #1322 for further details. This branch introduces a workaround for this issue, by increasing the outbound router cache capacity to 10000 routes for the Prometheus pod's proxy only. The router capacity limit of 100 active routes is primarily due to the limitation of the number of active Destination service lookups, so increasing the capacity for the Prometheus pod specifically is probably okay, as the scrape requests are made to IP addresses directly and therefore will not cause service discovery lookups. This change was originally implemented and tested in @siggy's PR #1228. I've rebased his branch onto the current `master`, and updated the code to reflect the project name change. Signed-off-by: Eliza Weisman <eliza@buoyant.io> Co-authored-by: Andrew Seigner <siggy@buoyant.io>	2018-08-02 16:44:11 -07:00
Ivan Sim	eb04217a12	Update inject cmd to read from folder (#1377 ) This change is a simplified implementation of the Builder.Path() and Visitor().ExpandPathsToFileVisitors() functions used by kubectl to parse files and directories. The filepath.Walk() function is used to recursively traverse directories. Every .yaml or .json resource file in the directory is read into its own io.Reader. All the readers are then passed to the YAMLDecoder in the InjectYAML() function. Fixes #1376 Signed-off-by: ihcsim <ihcsim@gmail.com>	2018-08-01 17:12:00 -07:00
Kevin Lingerfelt	8fe9e53f67	Remove remaining conduit references in codebase (#1381 ) * Remove remaining conduit references in codebase * Shorten emojivoto config url Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-31 11:19:34 -07:00
Kevin Lingerfelt	51848230a0	Send glog logs to stderr by default (#1367 ) * Send glog logs to stderr by default * Factor out more shared flag parsing code Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-25 12:59:24 -07:00

1 2 3 4 5 ...

309 Commits