linkerd2

Commit Graph

Author	SHA1	Message	Date
Alejandro Pedraza	ddf2e729ac	Injection consolidation (#2334 ) - Created the pkg/inject package to hold the new injection shared lib. - Extracted from `/cli/cmd/inject.go` and `/cli/cmd/inject_util.go` the core methods doing the workload parsing and injection, and moved them into `/pkg/inject/inject.go`. The CLI files should now deal only with strictly CLI concerns, and applying the json patch returned by the new lib. - Proceeded analogously with `/cli/cmd/uninject.go` and `/pkg/inject/uninject.go`. - The `InjectReport` struct and helping methods were moved into `/pkg/inject/report.go` - Refactored webhook to use the new injection lib - Removed linkerd-proxy-injector-sidecar-config ConfigMap - Added the ability to add pod labels and annotations without having to specify the already existing ones Fixes #1748, #2289 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com>	2019-03-05 08:38:56 -05:00
Ivan Sim	28fb725901	Revert "Mark HA as non-experimental (#2430 )" (#2436 ) This reverts commit `ec51ccf317`. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-03-04 08:27:43 -08:00
Aditya Sharma	ec51ccf317	Mark HA as non-experimental (#2430 ) Closes #2419 Signed-off-by: Aditya Sharma <hello@adi.run>	2019-03-02 23:02:20 -08:00
Tarun Pothulapati	2184928813	Wire up stats for Jobs (#2416 ) Support for Jobs in stat/tap/top cli commands Part of #2007 Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-03-01 17:16:54 -08:00
Andrew Seigner	d08dcb0a37	Skip outbound port 443 in control-plane (#2411 ) linkerd/linkerd2#2349 introduced a `SelfSubjectAccessReview` check at startup, to determine whether each control-plane component should establish Kubernetes watches cluster-wide or namespace-wide. If this check occurs before the linkerd-proxy sidecar is ready, it fails, and the control-plane component restarts. This change configures each control-plane pod to skip outbound port 443 when injecting the proxy, allowing the control-plane to connect to Kubernetes regardless of the `linkerd-proxy` state. A longer-term fix should involve a more robust control-plane startup, that is resilient to failed Kubernetes API requests. An even longer-term fix could involve injecting `linkerd-proxy` as a Kubernetes "sidecar" container, when that becomes available. Workaround for #2407 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-27 15:23:19 -08:00
Kevin Lingerfelt	40076c4de2	Remove namespace from serviceprofile CRD in install config (#2409 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-02-27 14:29:47 -08:00
Oliver Gould	ab90263461	destination: Only return TLS identities when appropriate (#2371 ) As described in #2217, the controller returns TLS identities for results even when the destination pod may not be able to participate in identity requester: specifically, the other pod may not have the same controller namespace or it may not be injected with identity. This change introduces a new annotation, linkerd.io/identity-mode that is set when injecting pods (via both CLI and webhook). This annotation is always added. The destination service now only returns TLS identities when this annotation is set to optional on a pod and the destination pod uses the same controller. These semantics are expected to change before the 2.3 release. Fixes #2217	2019-02-27 12:18:39 -08:00
Andrew Seigner	9f748d2d2e	lint: Enable unparam (#2369 ) unparam reports unused function parameters: https://github.com/mvdan/unparam Part of #217 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-27 10:34:02 -08:00
Andrew Seigner	ec5a0ca8d9	Authorization-aware control-plane components (#2349 ) The control-plane components relied on a `--single-namespace` param, passed from `linkerd install` into each individual component, to determine which namespaces they were authorized to access, and whether to support ServiceProfiles. This command-line flag was redundant given the authorization rules encoded in the parent `linkerd install` output, via [Cluster]Role[Binding]s. Modify the control-plane components to query Kubernetes at startup to determine which namespaces they are authorized to access, and whether ServiceProfile support is available. This allows removal of the `--single-namespace` flag on the components. Also update `bin/test-cleanup` to cleanup the ServiceProfile CRD. TODO: - Remove `--single-namespace` flag on `linkerd install`, part of #2164 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-26 11:54:52 -08:00
Andrew Seigner	25e462352d	lint: Enable goimports (#2366 ) goimports checks import lines, adding missing ones and removing unreferenced ones: https://godoc.org/golang.org/x/tools/cmd/goimports It also requires named imports for packages whose import paths don't match their package names: - https://github.com/golang/go/issues/28428 - https://go-review.googlesource.com/c/tools/+/145699/ Also standardized named imports of common Kubernetes packaages. Part of #217 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-25 15:51:10 -08:00
Oliver Gould	9e67bcb1bc	inject: Refactor report-checking from inject logic (#2379 ) The inject logic combines the modification of a pod spec and the creation of a "report" detailing problems with the pod spec. This change extracts the report-creation-and-checking logic from the injection logic to make the contracts of each of these functions clearer. No functional changes are intended.	2019-02-25 14:18:22 -08:00
Andrew Seigner	35a0b652f2	lint: Enable goconst (#2365 ) goconst finds repeated strings that could be replaced by a constant: https://github.com/jgautheron/goconst Part of #217 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-25 12:00:03 -08:00
Risha Mars	80b6e41d5d	Modify StatSummary to also return TCP stats (#2262 ) Adds a flag, tcp_stats to the StatSummary request, which queries prometheus for TCP stats. This branch returns TCP stats at /api/tps-reports when this flag is true. TCP stats are now displayed on the Resource Detail pages. The current queried TCP stats are: tcp_open_connections tcp_read_bytes_total tcp_write_bytes_total	2019-02-25 10:37:39 -08:00
Oliver Gould	f7435800da	lint: Enable scopelint (#2364 ) [scopelint][scopelint] detects a nasty reference-scoping issue in loops. [scopelint]: https://github.com/kyoh86/scopelint	2019-02-24 08:59:51 -08:00
Andrew Seigner	cc3ff70f29	Enable `unused` linter (#2357 ) `unused` checks Go code for unused constants, variables, functions, and types. Part of #217 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-23 11:05:39 -08:00
Andrew Seigner	4b6f6aeedd	Enable gosimple linter, fix issues (#2356 ) gosimple is a Go linter that specializes in simplifying code Also fix one spelling error in `cred_test.go` Part of #217 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-22 17:19:07 -08:00
Ivan Sim	1e2e2bf53c	Install the Linkerd global and proxy config maps (#2344 ) Also, some protobuf updates: * Rename `api_port` to match recent changes in CLI code. * Remove the `cni` message because it won't be used. * Remove `registry` field from proto types. This helps to avoid having to workaround edge cases like fully-qualified image name in different format, and overriding user-specified Linkerd version etc. Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-02-22 15:28:21 -08:00
TwiN	5dcb9086f5	Implement resource limit (#2307 ) Add options in CLI for setting proxy CPU and memory limits - Deprecated `proxy-cpu` and `proxy-memory` in favor of `proxy-cpu-limit` and `proxy-memory-limit` - Updated validations and tests to reflect new options Signed-off-by: TwinProduction <twin@twinnation.org>	2019-02-20 15:07:10 -05:00
Oliver Gould	fc82e2caf5	test: Support the -update and -pretty-diff flags (#2315 ) When changing templates, it's can be pretty time-intensive to repair all test fixtures. This change instruments CLI tests with two flags, `-update` and `-pretty-diff` that control how test fixtures are diffed. When the `-update` flag is set, the tests fixtures are overwritten as tests execute. The `-pretty-diff` flag causes the full text of the fixture to be printed on mismatch.	2019-02-19 13:06:12 -08:00
Tarun Pothulapati	42b3595f57	Ignoring resources with empty kind field. (#2292 ) Update CLI inject/uninject commands to report warnings when resource `kind` is missing. Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-02-19 11:44:28 -08:00
Kevin Lingerfelt	98c2686c95	CLI help updates: unhide install-cni (#2321 ) * Un-hide linkerd-cni command and flags * Tweak install-cni and install-sp descriptions Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-02-18 18:13:26 -08:00
Kevin Lingerfelt	5384ca8c97	Add discovery package for managing discovery API (#2317 ) * Add discovery package for managing discovery API * Fix typo in destination server comment Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-02-18 16:38:04 -08:00
Kevin Lingerfelt	25e46a3e74	CLI help updates: non-experimental auto-inject; unhide install-cni (#2319 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-02-18 15:32:46 -08:00
Oliver Gould	4ed84f0c0a	Split install template into component-specific files (#2313 ) chart/templates/base.yaml is nearly 800 lines and contains the kubernetes configurations for the marjority of the control plane. Furthermore, its contents are not particularly organized (for example, the prometheus RBAC bindings are in the middle of the controller's configuration). The size and complexity of this file makes it especially daunting to introduce new functionality. In order to make the situation easier to understand and change, this splits base.yaml into several new template files: namespace, controller, serviceprofile, and prometheus, and grafana. The `tls.yaml` template has been renamed `ca.yaml`, since it installs the `linkerd-ca` resources. This change also makes the comments uniform, adding a "header" to each logical component. Fixes #2154	2019-02-18 15:31:17 -08:00
Thomas Rampelberg	12527169b8	Update profile to match other CLI doc outputs (#2318 ) * Update profile to match other CLI doc outputs * Fix typo	2019-02-18 13:47:41 -08:00
Oliver Gould	71ce786dd3	Rename linkerd-proxy-api to linkerd-destination (#2281 ) Up until now, the proxy-api controller service has been the sole service that the proxy communicates with, implementing the majoriry of the API defined in the `linkerd2-proxy-api` repo. But this is about to change: linkerd/linkerd2-proxy-api#25 introduces a new Identity service; and this service must be served outside of the existing proxy-api service in the linkerd-controller deployment (so that it may run under a distinct service account). With this change, the "proxy-api" name becomes less descriptive. It's no longer "the service that serves the API for the proxy," it's "the service that serves the Destination API to the proxy." Therefore, it seems best to bite the bullet and rename this to be the "destination" service (i.e. because it only serves the `io.linkerd.proxy.destination.Destination` service). Co-authored-by: Kevin Lingerfelt <kl@buoyant.io> Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-02-15 15:11:04 -08:00
Thomas Rampelberg	f9d353ea22	Generate CLI docs for usage by the website (#2296 ) * Generate CLI docs for usage by the website * Update description to match existing commands * Remove global	2019-02-15 13:28:31 -08:00
Tarun Pothulapati	df0357505c	Added ep Alias for endpoints command (#2293 ) Signed-off-by: Tarun Pothulapati <tarunpothulapati@outlook.com>	2019-02-14 17:36:55 -08:00
Alejandro Pedraza	0c50749990	Add validator for proxy log-level (#2256 ) Add validator for proxy log-level Follow-up to #2249 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-02-13 15:33:22 -05:00
Andrew Seigner	a9b9908908	Bump Prometheus to v2.7.1, Grafana to 5.4.3 (#2242 ) Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-13 11:27:15 -08:00
Andrew Seigner	2305974202	Introduce golangci-lint tooling, fixes (#2239 ) `golangci-lint` performs numerous checks on Go code, including golint, ineffassign, govet, and gofmt. This change modifies `bin/lint` to use `golangci-lint`, and replaces usage of golint and govet. Also perform a one-time gofmt cleanup: - `gofmt -s -w controller/` - `gofmt -s -w pkg/` Part of #217 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-13 11:16:28 -08:00
Alejandro Pedraza	4dc204ae51	Honour install's `proxy-log-level` flag when autoinjecting proxies (#2249 ) Fixes #2248 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-02-11 14:23:09 -05:00
Kevin Lingerfelt	26aa771482	Fix auto-inject config when TLS is disabled (#2246 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-02-11 11:01:18 -08:00
Andrew Seigner	1a60237a94	Update check command hint URLs to new alias (#2245 ) The existing hint URLs printing by `linkerd check` pointed to locations that would change if the linkerd.io website was reorganized. linkerd/website#148 introduces an alias for hint URLs at https://linkerd.io/checks/. This is the corresponding change to update `linkerd check` output. Depends on linkerd/website#148, relates to linkerd/website#146. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-11 11:00:16 -08:00
Ivan Sim	f6e75ec83a	Add statefulsets to the dashboard and CLI (#2234 ) Fixes #1983 Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-02-08 15:37:44 -08:00
Alex Leong	5b054785e5	Read service profiles from client or server namespace instead of control namespace (#2200 ) Fixes #2077 When looking up service profiles, Linkerd always looks for the service profile objects in the Linkerd control namespace. This is limiting because service owners who wish to create service profiles may not have write access to the Linkerd control namespace. Instead, we have the control plane look for the service profile in both the client namespace (as read from the proxy's `proxy_id` field from the GetProfiles request and from the service's namespace. If a service profile exists in both namespaces, the client namespace takes priority. In this way, clients may override the behavior dictated by the service. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-02-07 14:51:43 -08:00
Andrew Seigner	907f01fba6	Improve ServiceProfile validation in linkerd check (#2218 ) The `linkerd check` command was doing limited validation on ServiceProfiles. Make ServiceProfile validation more complete, specifically validate: - types of all fields - presence of required fields - presence of unknown fields - recursive fields Also move all validation code into a new `Validate` function in the profiles package. Validation of field types and required fields is handled via `yaml.UnmarshalStrict` in the `Validate` function. This motivated migrating from github.com/ghodss/yaml to a fork, sigs.k8s.io/yaml. Fixes #2190	2019-02-07 14:35:47 -08:00
Andrew Seigner	72812baf99	Introduce Discovery API and endpoints command (#2195 ) The Proxy API service lacked introspection of its internal state. Introduce a new gRPC Discovery API, implemented by two servers: 1) Proxy API Server: returns a snapshot of discovery state 2) Public API Server: pass-through to the Proxy API Server Also wire up a new `linkerd endpoints` command. Fixes #2165 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-07 14:02:21 -08:00
Cody Vandermyn	74eac764d8	CNI: Removed unnecessary tolerations; added created-by annotation. (#2222 ) Removed unnecessary tolerations; added created-by annotation. Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2019-02-07 13:36:00 -08:00
Risha Mars	e531655d26	Add a --tap flag to the linkerd profile command (#2139 ) Adds the ability to generate a service profile by running a tap for a configurable amount of time, and using the route results from the routes seen during the tap. e.g. `linkerd profile web --tap deploy/web -n emojivoto --tap-duration 2s`	2019-02-06 12:43:16 -08:00
Kevin Lingerfelt	02f128ec5a	Reorganize install/inject config structs (#2202 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-02-05 12:50:42 -08:00
Alejandro Pedraza	2a7654ce78	Consolidate timeouts for `linkerd check` (#2191 ) Consolidate timeouts for `linkerd check` - Moved the creation of contexts from inside the methods targeted by the checks into a single place in the runCheck() and runCheckRPC() methods where the context is built using a hard-coded timeout of 30 seconds. - k8s' client-go doesn't allow passing along contexts, but it let's us setting the Timeout manually. - Reworded the description for the --wait option. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-02-05 11:38:30 -05:00
Kevin Leimkuhler	66070c26f4	Introduce go generate to embed static templates (#2189 ) # Problem In order to switch Linkerd template rendering to use `.yaml` files, static assets must be bundled in the Go binary for use by `linkerd install`. # Solution The solution should not affect the local development process of building and testing. [vfsgen](https://github.com/shurcooL/vfsgen) generates Go code that statically implements the provided `http.FileSystem`. Paired with `go generate` and Go [build tags](https://golang.org/pkg/go/build/), we can continue to use the template files on disk when developing with no change required. In `!prod` Go builds, the `cli/static/templates.go` file provides a `http.FileSystem` to the local templates. In `prod` Go builds, `go generate ./cli` generates `cli/static/generated_templates.gogen.go` that statically provides the template files. When built with `-tags prod`, the executable will be built with the staticlly generated file instead of the local files. # Validation The binaries were compiled locally with `bin/docker-build`. The binaries were then tested with `bin/test-run (pwd)/target/cli/darwin/linkerd`. All tests passed. No change was required to successfully run `bin/go-run cli install`. No change was required to run `bin/linkerd install`. Fixes #2153 Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2019-02-04 18:09:47 -08:00
Oliver Gould	44e31f0f67	Configure proxy keepalives via the environment (#2193 ) In linkerd/linkerd2-proxy#186, the proxy supports configuration of TCP keepalive values. This change sets `LINKERD2_PROXY_INBOUND_ACCEPT_KEEPALIVE` and `LINKERD2_PROXY_OUTBOUND_CONNECT_KEEPALIVE` to 10s when injecting the proxy, so that remote connections are configured with a keepalive. This configuration is NOT yet exposed through the CLI. This may be done in a followup, if necessary. Fixes #1949	2019-02-04 16:16:43 -08:00
Oliver Gould	4798ad3f44	Use the proper controller identity when configuring pods with TLS (#2196 ) Since `37ae423`, deployments have been prefixed with linkerd-; however the inject logic was not changed to take this into consideration when constructing the controller's identity. This means that the proxy's client to the control plane has been unable to establish TLS'd communcation to the proxy-api. Previously, the proxy would silently fall back to plaintext, but in master this behavior recently changed to be stricter, so this bug will prevent the proxy from connecting to proxy-api in any way.	2019-02-04 14:59:03 -08:00
Ye Ben	f2ba17d366	fix some typos (#2194 ) Signed-off-by: yeya24 <ben.ye@daocloud.io>	2019-02-02 23:03:54 -08:00
Kevin Lingerfelt	4c019c27c1	Add pod spec annotation to disable injection in CLI and auto-injector (#2187 ) * Add pod spec annotation to disable injection in CLI and auto-injector * Remove support for linkerd.io/auto-inject label entirely * Update based on review feedback * Fix issue with finding the namespace of deployments applied to the default ns Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-02-01 16:57:06 -08:00
Cody Vandermyn	a283dabbed	Added flags to allow further configuration of destination cni bin and cni conf directories; fixed up spacing in template. (#2181 ) Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2019-01-31 13:52:39 -08:00
Andrew Seigner	5651e02496	Add hint URLs for all checks (#2159 ) linkerd/website#105 introduced a FAQ page, providing resolutions for all `linkerd check` failures. Update each check to reference its corresponding section in the FAQ. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-30 16:26:57 -08:00
Alejandro Pedraza	fe234cade1	Use `ca.NewCA()` for generating certs and keys for the proxy injector (#2163 ) Use `ca.NewCA()` for generating certs and keys for the proxy injector - Remove from CA controller everything that dealt with the webhook/proxy-injector - Remove no longer needed proxy-injector volumes for 'trust-anchors' and 'webhook-secrets' - Remove from the proxy-injector the retrieval of the trust anchor and secrets - tls flag during install is no longer needed for auto-inject to work Fixes #2095 and fixes #2166 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-30 16:04:33 -05:00
Cody Vandermyn	906c3cbfc5	WIP: CNI Plugin (#2071 ) * Export RootOptions and BuildFirewallConfiguration so that the cni-plugin can use them. * Created the cni-plugin based on istio-cni implementation * Create skeleton files that need to be filled out. * Create the install scripts and finish up plugin to write iptables * Added in an integration test around the install_cni.sh and updated the script to handle the case where it isn't the only plugin. Removed the istio kubernetes.go file in favor of pkg/k8s; initial usage of this package; found and fixed the typo in the ClusterRole and ClusterRoleBinding; found the docker-build-cni-plugin script * Corrected an incorrect name in the docker build file for cni-plugin * Rename linkerd2-cni to linkerd-cni * Fixup Dockerfile and clean up code a bit as well as logging statements. * Update Gopkg.lock after master merge. * Update test file to remove temporary tag. * Fixed the command to run during the test while building up the docker run. * Added attributions to applicable files; in the test file, use a different container for each test scenario and also print the docker logs to stdout when there is an error; * Add the --no-init-container flag to install and inject. This flag will not output the initContainer and will add an annotation assuming that the cni will be used in this case. * Update .travis.yml to build the cni-plugin docker image before running the tests. * Workaround golint warnings. * Create a new command to install the linkerd-cni plugin. * Add the --no-init-container option to linkerd inject * Use the setup ip tables annotation during the proxy auto inject webhook prevent/allow addition of an init container; move cni-plugin tests to the integration-test section of travis * gate the cni-plugin tests with the -integration-tests flag; remove unnecessary deployment .yaml file. * Incorporate PR Cleanup suggestions. * Remove the SetupIPTablesLabel annotation and use config flags and the presence of the init container to determine whether the cni-plugin writes ip tables. * Fix a logic bug in the cni-plugin code that prevented the iptables from being written; Address PR comments; make tests pass. * Update go deps shas * Changed the single file install-cni plugin filename to be .conf vs .conflist; Incorporated latest PR comments around spacing with the new renderer among others. * Fix an issue with renaming .conf to .conflist when needed. * Renamed some of the variables to try to make it more clear what is going on. * Address final PR comments. * Hide cni flags for the time being. Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2019-01-30 11:51:34 -08:00
Alex Leong	872e1bb026	Add --proto flag to linkerd profile command to read protobuf files (#2128 ) Fixes #1425 Signed-off-by: Alex Leong <alex@buoyant.io>	2019-01-25 11:15:20 -08:00
Kevin Leimkuhler	f3325e7d81	Refactor `install`'s `render` output to be helm compatible (#2098 ) # Problem In order to refactor `install` to allow for a more flexible configuration, we should start with the format of the YAML that it renders. Using the Helm YAML format will make it easier add flexible configuration options in the future. Currently, the rendered template that `install` produces does not follow this format. # Solution Use the internals that Helm itself uses to render an inject template that follows the same formatting rules. Helm's `template` cmd provides a good outline of what is needed to make Linkerd's `install` cmd work as if it was a Chart. # Validation There are no new tests, but there may not be anything to test at this stage. This is a WIP PR towards the ultimate goal of `install` allowing a more flexible configuration. However, `install` now uses all the Helm `template` internals and therefore satisfies the needed properties for Helm Charts. Signed-off-by: Kevin Leimkuhler <kevin@kleimkuhler.com>	2019-01-25 10:53:35 -08:00
Eliza Weisman	846975a190	Remove proxy bind timeout from CLIs (#2017 ) This branch removes the `--proxy-bind-timeout` flag from the `linkerd inject` and `linkerd install` CLI commands, and the `LINKERD2_PROXY_BIND_TIMEOUT` environment variable from their output. This is in preparation for removing that timeout from the proxy (as described in #2013). I thought it was prudent to remove this from the CLIs before removing it from the proxy, so we can't create a situation where the CLIs produce output that results in broken proxy containers. Fixes #2013 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2019-01-24 15:34:09 -08:00
zak	8c413ca38b	Wire up stats commands for daemonsets (#2006 ) (#2086 ) DaemonSet stats are not currently shown in the cli stat command, web ui or grafana dashboard. This commit adds daemonset support for stat. Update stat command's help message to reference daemonsets. Update the public-api to support stats for daemonsets. Add tests for stat summary and api. Add daemonset get/list/watch permissions to the linkerd-controller cluster role that's created using the install command. Update golden expectation test files for install command yaml manifest output. Update web UI with daemonsets Update navigation, overview and pages to list daemonsets and the pods associated to them. Add daemonset paths to server, and ui apps. Add grafana dashboard for daemonsets; a clone of the deployment dashboard. Update dependencies and dockerfile hashes Add DaemonSet support to tap and top commands Fixes of #2006 Signed-off-by: Zak Knill <zrjknill@gmail.com>	2019-01-24 14:34:13 -08:00
Andrew Seigner	2217a6cd2b	Add cli command to install linkerd service profiles (#2099 ) This introduces a `linkerd install-sp` command to install service profiles into the linkerd control plane. It installs one service profile for each of controller-api, proxy-api, prometheus, and grafana. Fixes #1901 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-24 11:34:19 -08:00
Andrew Seigner	c9ac77cd7c	Introduce version consistency checks (#2130 ) Version checks were not validating that the cli version matched the control plane or data plane versions. Add checks via the `linkerd check` command to validate the cli is running the same version as the control and data plane. Also add types around `channel-version` string parsing and matching. A consequence being that during development `version.Version` changes from `undefined` to `dev-undefined`. Fixes #2076 Depends on linkerd/website#101 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-23 16:54:43 -08:00
Alena Varkockova	28f662c9c6	Introduce resource selector and deprecate namespace field for ListPods (#2025 ) * Introduce resource selector and deprecate namespace field for ListPods * Changes from code review * Properly deprecate the field * Do not check for nil * Fix the mockProm usage * Protoc changes revert * Changed from code review Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2019-01-23 10:35:55 -08:00
Andrew Seigner	93322bed6d	Fix Unicode and color output for Windows (#2114 ) The default font in Windows console did not support the Unicode characters recently added to check and inject commands. Also the color library the linkerd cli depends on was not being used in a cross-platform way. Replace the existing Unicode characters used in `check` and `inject` with characters available in most fonts, including Windows console. Similarly replace the spinner used in `check` with one that uses characters available in most fonts. Modify `check` and `inject` to use `color.Output` and `color.Error`, which wrap `os.Stdout` and `os.Stderr`, and perform special tranformations when on Windows. Add a `--no-color` option to `linkerd logs`. While stern uses the same color library that `check`/`inject` use, it is not yet using the `color.Output` API for Windows support. That issue is tracked at: https://github.com/wercker/stern/issues/69 Relates to https://github.com/linkerd/linkerd2/pull/2087 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-23 09:55:44 -08:00
Alejandro Pedraza	eacc09b7ba	Follow up to #2054 : move magic strings into constants (#2122 ) Follow up to #2054: move magic strings into constants Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-22 14:53:24 -05:00
Alejandro Pedraza	a5628780f7	Fix dupe word in example (#2121 ) Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-22 13:43:31 -05:00
Kevin Lingerfelt	1989e28d8f	Add newline separator above uninject reports (#2104 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-17 11:12:54 -08:00
Alejandro Pedraza	bd1955c317	When injecting, perform an uninject as a first step (#2089 ) * When injecting, perform an uninject as a first step Fixes #1970 The fixture `inject_emojivoto_already_injected.input.yml` is no longer rejected, so I created the corresponding golden file. Note that we'll still forbid injection over resources already injected with third party meshes (Istio, Contour), so now we have `HasExisting3rdPartySidecars()` to detect that. * Generalize HasExistingSidecars() to cater for both the auto-injector and * Convert `linkerd uninject` result format to the one used in `linkerd inject`. * More updates to the uninject reports. Revert changes to the HasExistingSidecars func. Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-17 10:15:23 -08:00
Dennis Adjei-Baah	f9cd9366d9	Surface logs from control plane pods (#2037 ) When debugging control plane issues or issues pertaining to a linkerd proxy, it can be cumbersome to get logs from affected containers quickly. This PR adds a new `logs` command to the Linkerd CLI to surface log lines from any container within linkerd's control plane. This feature relies heavily on [stern](https://github.com/wercker/stern), which already provides this behavior. This PR integrates this package into the Linkerd CLI to allow users to quickly retrieve logs whenever they run into issues when using Linkerd. Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2019-01-16 19:24:42 -08:00
Alex Leong	a562f8b9fd	Improve routes command to list all routes (#2066 ) Fixes #1875 This change improves the `linkerd routes` command in a number of important ways: * The restriction on the type of the `--to` argument is lifted and any resource type can now be used. Try `--to ns/books`, `--to po/webapp-ABCDEF`, `--to au/linkerd.io`, or even `--to svc`. * All routes for the target will now be populated in the table, even if there are no Prometheus metrics for that route. * [UNKNOWN] has been renamed to [DEFAULT] * The `Service/Authority` column will now list `Service` in all cases except for when an authority target is explicitly requested. ``` $ linkerd routes deploy/traffic --to deploy/webapp ROUTE SERVICE SUCCESS RPS LATENCY_P50 LATENCY_P95 LATENCY_P99 GET / webapp 100.00% 0.5rps 50ms 180ms 196ms GET /authors/{id} webapp 100.00% 0.5rps 100ms 900ms 980ms GET /books/{id} webapp 100.00% 0.9rps 38ms 93ms 99ms POST /authors webapp 100.00% 0.5rps 35ms 48ms 50ms POST /authors/{id}/delete webapp 100.00% 0.5rps 83ms 180ms 196ms POST /authors/{id}/edit webapp 0.00% 0.0rps 0ms 0ms 0ms POST /books webapp 45.16% 2.1rps 75ms 425ms 485ms POST /books/{id}/delete webapp 100.00% 0.5rps 30ms 90ms 98ms POST /books/{id}/edit webapp 56.00% 0.8rps 92ms 875ms 975ms [DEFAULT] webapp 0.00% 0.0rps 0ms 0ms 0ms ``` This is all made possible by a shift in the way we handle the destination resource. When we get a request with a `ToResource`, we use the k8s API to find all Services which include at least one pod belonging to that resource. We then fetch all service profiles for those services and display the routes from those serivce profiles. This shift in thinking also precipitates a change in the TopRoutes API where we no longer need special cases for `ToAll` (which can be specified by `--to au`) or `ToAuthority` (which can be specified by `--to au/<authority>`) and instead can use a `ToResource` to handle all cases. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-01-16 17:15:35 -08:00
Andrew Seigner	92f2cd9b63	Update check and inject output (#2087 ) The outputs of the `check` and `inject` commands did not vary much between successful and failed executions, and were a bit verbose and challenging to parse. Reorganize output of `check` and `inject` commands, to provide more output when errors occur, and less output when successful. Specific changes: `linkerd check` - visually group checks by category - introduce `hintURL`'s, to provide doc links when checks fail - add spinners when retrying, remove additional retry lines - colored unicode characters to indicate success/warning/failure `linkerd inject` - modify default output to mirror `kubectl apply` - only output non-successful inject reports - support `--verbose` flag to output all inject reports Fixes #1471, #1653, #1656, #1739 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-16 15:14:14 -08:00
Alex Leong	771542dde2	Add support for retries (#2038 )	2019-01-16 14:13:48 -08:00
Alex Leong	3398e9391a	Fix panic when swagger has no paths (#2092 ) Fixes #2059 Fix a panic in linkerd profile --open-api when the swagger spec has no Paths field. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-01-16 13:39:51 -08:00
Andrew Seigner	dacd8819ff	Group checkers by category (#2083 ) The linkerd check command organized the various checks via loosely coupled category IDs, category names, and checkers themselves, all with ordering defined by consumers of this code. This change removes category IDs in favor of category names, groups all checkers by category, and enforces ordering at the HealthChecker level. Part of #1471, depends on #2078. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-14 18:01:32 -08:00
Andrew Seigner	04373414ef	Modify all health checks to be specified via enums (#2078 ) The set of health checks to be executed were dependent on a combination of check enums and boolean options. This change modifies the health checks to be governed strictly by a set of enums. Next steps: - tightly couple category IDs to names - tightly couple checks to their parent categories - programmatic control over check ordering Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-14 17:16:15 -08:00
Kevin Lingerfelt	6a3f4a1a99	Leave dashboard command running when open URL fails (#2081 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-14 16:46:11 -08:00
Alejandro Pedraza	d358605269	Added `linkerd uninject` command (#2054 ) Add `linkerd uninject` command uninject.go iterates through the resources annotations, labels, initContainers and Containers, removing what we know was injected by linkerd. The biggest part of this commit is the refactoring of inject.go, to make it more generic and reusable by uninject. The idea is that in a following PR this functionality will get reused by `linkerd inject` to uninject as as preliminary step to injection, as a solution to #1970. This was tested successfully on emojivoto with: ``` 1) inject: kubectl get -n emojivoto deployment -o yaml \| bin/linkerd inject - \| kubectl apply -f - 2) uninject: kubectl get -n emojivoto deployment -o yaml \| bin/linkerd uninject - \| kubectl apply -f - ``` Also created unit tests for uninject.go. The fixture files from the inject tests could be reused. But as now the input files act as outputs, they represent existing resources and required these changes (that didn't affect inject): - Rearranged fields in alphabetical order. - Added fields that are only relevant for existing resources (e.g. creationTimestamp and status.replicas in StatefulSets) Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-14 17:39:31 -05:00
Kevin Lingerfelt	ed3fbd75f3	Setup port-forwarding for linkerd dashboard command (#2052 ) * Setup port-forwarding for linkerd dashboard command * Output port-forward logs when --verbose flag is set Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-10 10:16:08 -08:00
Alena Varkockova	172398292d	Add validation to CRD for service profiles (#2024 ) * Add validation to CRD for service profiles Signed-off-by: Alena Varkockova <varkockova.a@gmail.com> * Use properties instead of oneof Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2019-01-10 07:04:40 -08:00
Alejandro Pedraza	281ba37e6d	More granular control on checks made by CLI commands (#2033 ) Have the CLI commands `get`, `routes`, `stat`, `tap`and `top` perform a more limited set of checks Fixes #1854	2019-01-10 09:13:44 -05:00
Kevin Lingerfelt	a27bb2e0ce	Proxy grafana requests through web service (#2039 ) * Proxy grafana requests through web service * Fix -grafana-addr default, clarify -api-addr flag * Fix version check in grafana dashboards * Fix comment typo Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-04 16:07:57 -08:00
Alejandro Pedraza	8639a8753f	Fix reporting of injected resources (#2031 ) * Fix reporting of injected resources When reporting resources provided as a list, it was picking just one item from the list and ignoring the rest. Also improved test for injecting list of resources. Fixes #1676 Signed-off-by: Alejandro Pedraza <alejandro@buoyant.io>	2019-01-03 15:14:11 -05:00
Andrew Seigner	a91c77d0bf	Followups from lint/comment changes (#2032 ) This is a followup branch from #2023: - delete `proxy/client.go`, move code to `destination-client` - move `RenderTapEvent` and stat functions from `util` to `cmd` Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-01-02 15:28:09 -08:00
Alena Varkockova	ef02cd6828	Disallow both pre and proxy flags for check (#2026 ) Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2019-01-02 13:36:04 -08:00
Thomas Rampelberg	612ebe2b81	Add rules_file loading into prometheus (#1966 )	2018-12-20 11:47:13 -08:00
Radu M	07cbfe2725	Fix most golint issues that are not comment related (#1982 ) Signed-off-by: Radu Matei <radu@radu-matei.com>	2018-12-20 10:37:47 -08:00
Alex Leong	0a87ebcd5d	Add --routes flag to linkerd top (#1952 ) Fixes #1910 Add a `--routes` flag to the `top` command which show requests by route instead of by path. This also includes an overhaul of the live table rendering code to be more flexible and better accommodate the various combinations of columns that may be visible or hidden. Co-authored-by: Andrew Seigner <siggy@buoyant.io> Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-18 11:25:34 -08:00
Alex Leong	cb3fa1245b	Remove TLS column from routes command output (#1956 ) Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-14 21:52:49 -08:00
Kevin Lingerfelt	b16f4da1be	Re-add RBAC permission to access installed namespace in single ns mode (#1987 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-13 15:32:53 -08:00
Kevin Lingerfelt	0866bb2a41	Remove runAsGroup field from security context settings (#1986 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-13 15:12:13 -08:00
Kevin Lingerfelt	86e95b7ad3	Disable serivce profiles in single-namespace mode (#1980 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-13 14:37:18 -08:00
Cody Vandermyn	d847f66ec5	Create new service accounts for linkerd-web and linkerd-grafana. Chan… (#1978 ) * Create new service accounts for linkerd-web and linkerd-grafana. Change 'serviceAccount:' to 'serviceAccountName:' * Use dynamic namespace name Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2018-12-12 18:10:50 -08:00
Cody Vandermyn	aa5e5f42eb	Use an emptyDir for Prometheus and Grafana (#1971 ) * Allow input of a volume name for prometheus and grafana * Make Prometheus and Grafana volume names 'data' by default and disallow user editing via cli flags * Remove volume name from options Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2018-12-12 15:54:03 -08:00
Kevin Lingerfelt	fd44896644	Remove namespace definition from --single-namespace installs (#1974 ) * Remove namespace definition from --single-namespace installs * DRY up code in healthcheck.go Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-12 14:53:02 -08:00
Cody Vandermyn	8e4d9d2ef6	add securityContext with runAsUser: {{.ControllerUID}} to the various cont… (#1929 ) * add securityContext with runAsUser: {{.ProxyUID}} to the various containers in the install template * Update golden to reflect new additions * changed to a different user id than the proxy user id * Added a controller-uid install option * change the port that the proxy-injector runs * The initContainers needs to be run as the root user. * move security contexts to container level Signed-off-by: Cody Vandermyn <cody.vandermyn@nordstrom.com>	2018-12-11 11:51:28 -08:00
Alejandro Pedraza	8c67bfbcc6	Add parameter to stats API to skip retrieving Prometheus stats (#1871 ) * Add parameter to stats API to skip retrieving Prometheus stats Used by the dashboard to populate list of resources. Fixes #1022 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Prometheus queries check results were being ignored * Refactor verifyPromQueries() to also test when no prometheus queries should be generated * Add test for SkipStats=true Includes adding ability to public.GenStatSummaryResponse to not generate basicStats * Fix previous test	2018-12-10 16:48:12 -08:00
Andrew Seigner	bef9479f57	Add input validation for profile command (#1934 ) Fixes #1878 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-12-05 15:13:10 -08:00
Alex Leong	7169eaef27	Stop routes with the same name from different services from clobbering each other (#1936 ) If the `linkerd routes` command gets two routes with the same name, it will only display one of them, even if the routes are from different services. This is particularly obvious with the default `[UNKNOWN]` route. We now display all routes, even if they have the same name. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 15:05:19 -08:00
Alex Leong	cbb196066f	Support service profiles for external authorities (#1928 ) Add support for service profiles created on external (non-service) authorities. For example, this allows you to create a service profile named `linkerd.io` which will apply to calls made to `linkerd.io`. This is done by changing the `LINKERD2_PROXY_DESTINATION_PROFILE_SUFFIXES` to `.` so that the proxy will attempt to lookup a service profile for any authority. We provide the `--disable-external-profiles` proxy flag to revert this behavior in case it is a problem. We also refactor the proxy-api implementation of GetProfiles so that it does the profile lookup, regardless of if the authority looks like a Kubernetes service name or not. To simplify this, support for multiple resolves (which was unused) was removed. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 14:32:59 -08:00
Oliver Gould	12ec5cf922	install: Add a -disable-h2-upgrade flag (#1926 ) The proxy-api service _always_ suggests that two meshed pods communicate via HTTP/2 (i.e. via transparent protocol upgrading, if necessary). This can complicate debugging and diagnostics at times, so it's important that we have a way to deploy linkerd without this auto-upgrade behavior. This change adds a `-disable-h2-upgrade` flag to the `linkerd install` command that disables transparent upgrading for the whole cluster.	2018-12-05 12:50:47 -08:00
Alex Leong	380ec52a39	Rework routes command to accept any resource (#1921 ) We rework the routes command so that it can accept any Kubernetes resource, making it act much more similarly to the stat command. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 11:11:34 -08:00
Alex Leong	4f3e55e937	Rename path to path_regex in ServiceProfile CRD (#1923 ) We rename path to path_regex in the ServiceProfile CRD to make it clear that this field accepts a regular expression. We also take this opportunity to remove unnecessary line anchors from regular expressions now that these anchors are added in the proxy. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 10:42:47 -08:00
Kevin Lingerfelt	37ae423bb3	Add linkerd- prefix to all objects in linkerd install (#1920 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-04 15:41:47 -08:00
Andrew Seigner	ad2366f208	Revert proxy readiness initialDelaySeconds change (#1912 ) Reverts part of #1899 to workaround readiness failures. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-12-04 14:27:55 -08:00
Risha Mars	e8a39cd17e	Add ability to download a service profile template from the web UI (#1893 ) Adds an endpoint, at /profiles/new that allows you to input a service name and namespace, and download a service profile yaml template. This will enable future work, where we can add more of the yaml customization via a form in the dashboard, and use that data to help the user configure routes.	2018-12-03 16:48:43 -08:00
Andrew Seigner	37a5455445	Add filtering by job in stat, tap, top; fix panic (#1904 ) Filtering by Kubernetes job was not supported. Also filtering by any unknown type caused a panic. Add filtering support by Kubernetes job, with special case mapping `job` to `k8s_job`, to not conflict with Prometheus' job label. Fix panic when unknown type specified as a `--from` or `--to` flag. Fix `job` label from `linkerd-proxy` overwriting Prometheus `job` label at collection time. This caused all metrics collected by proxy sidecars in Kubernetes jobs to be collected into an incorrect Prometheus job, rather than the expected `linkerd-proxy` Prometheus job. Fix `unsupported resource type` tap error message incorrectly printing the target resource rather than the destination. Set `--controller-log-level debug` in `install_test.go` for easier debugging. Expose `slow-cooker`'s metrics via a k8s service in the tap integration test, to validate proxy requests with a job as destination. Fixes #1872 Part of #627 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-12-03 15:34:49 -08:00
Andrew Seigner	d121071f87	Adjust proxy, Prometheus, and Grafana probes (#1899 ) * Adjust proxy, Prometheus, and Grafana probes High `readinessProbe.initialDelaySeconds` values delayed the controller's readiness by up to 30s, preventing cli commands from succeeding shortly after control plane deployment. Decrease `readinessProbe.initialDelaySeconds` in the proxy, Prometheus, and Grafana to the default 0s. Also change `linkerd check` controller pod ordering to: controller, prometheus, web, grafana. Detailed probe changes: - proxy - decrease `readinessProbe.initialDelaySeconds` from 10s to 0s - prometheus - decrease `readinessProbe.initialDelaySeconds` from 30s to 0s - decrease `readinessProbe.timeoutSeconds` from 30s to 1s - decrease `livenessProbe.timeoutSeconds` from 30s to 1s - grafana - decrease `readinessProbe.initialDelaySeconds` from 30s to 0s - decrease `readinessProbe.timeoutSeconds` from 30s to 1s - decrease `readinessProbe.failureThreshold` from 10 to 3 - increase `livenessProbe.initialDelaySeconds` from 0s to 30s Fixes #1804 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-12-03 10:41:11 -08:00
Alex Leong	f9d66cf4de	Add --open-api option to linkerd profiles command (#1867 ) The `--open-api` flag is an alternative to the `--template` flag for the `linkerd profile` command. It reads an OpenAPI specification file (also called a swagger file) and uses it to generate a corresponding service profile. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-11-30 09:25:19 -08:00
Alex Leong	835e34b500	Left-align the routes column and sort by route name (#1879 ) Signed-off-by: Alex Leong <alex@buoyant.io>	2018-11-28 09:32:59 -08:00
Ben Lambert	297cb570f2	Added a --ha flag to install CLI (#1852 ) This change allows some advised production config to be applied to the install of the control plane. Currently this runs 3x replicas of the controller and adds some pretty sane requests to each of the components + containers of the control plane. Fixes #1101 Signed-off-by: Ben Lambert <ben@blam.sh>	2018-11-20 23:03:59 -05:00
Alex Leong	7a7f6b6ecb	Add TopRoutes method the the public api and route CLI command to consume it (#1860 ) Add a routes command which displays per-route stats for services that have service profiles defined. This change has three parts: * A new public-api RPC called `TopRoutes` which serves per-route stat data about a service * An implementation of TopRoutes in the public-api service. This implementation reads per-route data from Prometheus. This is very similar to how the StatSummaries RPC and much of the code was able to be refactored and shared. * A new CLI command called `routes` which displays the per-route data in a tabular or json format. This is very similar to the `stat` command and much of the code was able to be refactored and shared. Note that as of the currently targeted proxy version, only outbound route stats are supported so the `--from` flag must be included in order to see data. This restriction will be lifted in an upcoming change once we add support for inbound route stats as well. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-11-19 12:20:30 -08:00
Alejandro Pedraza	bbcf5a8c9f	Allow stat summary to query for multiple resources (#1841 ) * Refactor util.BuildResource so it can deal with multiple resources First step to address #1487: Allow stat summary to query for multiple resources Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Update the stat cli help text to explain the new multi resource querying ability Propsal for #1487: Allow stat summary to query for multiple resources Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Allow stat summary to query for multiple resources Implement this ability by issuing parallel requests to requestStatsFromAPI() Proposal for #1487 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Update tests as part of multi-resource support in `linkerd stat` (#1487) - Refactor stat_test.go to reuse the same logic in multiple tests, and add cases and files for json output. - Add a couple of cases to api_utils_test.go to test multiple resources validation. Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * `linkerd stat` called with multiple resources should keep an ordering (#1487) Add SortedRes holding the order of resources to be followed when querying `linkerd stat` with multiple resources Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Extra validations for `linkerd stat` with multiple resources (#1487) Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * `linkerd stat` resource grouping, ordering and name prefixing (#1487) - Group together stats per resource type. - When more than one resource, prepend name with type. - Make sure tables always appear in the same order. Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Allow `linkerd stat` to be called with multiple resources A few final refactorings as per code review. Fixes #1487 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com>	2018-11-14 10:44:04 -08:00
Alex Leong	32d556e732	Improve ergonomics of service profile spec (#1828 ) We make several changes to the service profile spec to make service profiles more ergonomic and to make them more consistent with the destination profile API. * Allow multiple fields to be simultaneously set on a RequestMatch or ResponseMatch condition. Doing so is equivalent to combining the fields with an "all" condition. * Rename "responses" to "response_classes" * Change "IsSuccess" to "is_failure" Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-31 12:00:22 -07:00
Alex Leong	d8b5ebaa6d	Remove the proxy-api container (#1813 ) A container called `proxy-api` runs in the Linkerd2 controller pod. This container listens on port 8086 and serves the proxy-api but does nothing other than forward gRPC requests to the destination container which listens on port 8089. We remove the proxy-api container altogether and change the destination container to listen on port 8086 instead of 8089. The result is that clients still use the proxy-api by connecting to `proxy-api.<ns>.svc.cluster.local:8086` but the controller has one fewer containers. This results in a simpler system that is easier to reason about. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-29 16:31:43 -07:00
Alex Leong	82ca821e62	Use fqdn for service profile name (#1808 ) Service profiles must be named in the form `"<service>.<namespace>"`. This is inconsistent with the fully normalized domain name that the proxy sends to the controller. It also does not permit creating service profiles for non-Kubernetes services. We switch to requiring that service profiles must be named with the FQDN of their service. For Kubernetes services, this is `"<service>.<namespace>.svc.cluster.local"`. This change alone is not sufficient for allowing service profile for non-Kubernetes services because the k8s resolver will ignore any DNS names which are not Kubernetes services. Further refactoring of the resolver will be required to allow looking up non-Kubernetes service profiles in Kuberenetes. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-29 14:35:42 -07:00
Alex Leong	6cffad277b	Make service profile validation a warning instead of an error (#1807 ) The existence of an invalid service profile causes `linkerd check` to fail. This means that it is not possible to open the Linkerd dashboard with the `linkerd dashboard` command. While service profile validation is useful, it should not lock users out. Add the ability to designate health checks as warnings. A failed warning health check will display a warning output in `linkerd check` but will not affect the overall success of the command. Switch the service profile validation to be a warning. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-26 13:28:10 -07:00
Alex Leong	652ca161ef	Add linkerd profile --template command (#1773 ) Add a new CLI command: `linkerd profile --template` which outputs a sample service profile yaml. Users can edit this sample and then `kubectl apply` it to add a service profile. The sample serves as "documentation by example" of what service profiles may contain. Example usage: ```bash linkerd profile -n emojivoto --template web-svc > web-svc-profile.yaml # edit web-svc-profile.yaml in your favorite editor kubectl apply -f web-svc-profile.yaml ``` Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-19 13:34:54 -07:00
Alena Varkockova	87b2773930	Fix the validation of docker registry, improve the error message (#1780 ) Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2018-10-18 15:36:28 -07:00
Alex Leong	43c22fe967	Implement getProfiles method in destination service (#1759 ) We implement the getProfiles method in the destination service. This method returns a stream of destination profiles for a given authority. It does this by looking up the ServiceProfile resource in the controller namespace named `<svc>.<ns>` where `<svc>` is the name of the service and `<ns>` is the namespace of the service. This PR includes: * Adding a ServiceProfile Custom Resource Definition to linkerd install * A watch based implementation of the getProfiles method in the destination service, similar to the implementation of get. * An update to the destination client script that allows querying the getProfiles method. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-10-16 15:39:12 -07:00
Kevin Lingerfelt	e9874b9c3e	Improve docker layer caching for web image (#1757 ) * Improve docker layer caching for web image * Move all web files to /linkerd dir Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-10-16 11:10:52 -07:00
Alejandro Pedraza	37bc8a69db	Added support for json output in `linkerd stat` (#1749 ) Added support for json output in `linkerd stat` through a new (-o\|--output)=json option. Fixes #1417 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com>	2018-10-15 14:10:48 -07:00
Alejandro Pedraza	2d6fde274c	Make room for columns in `linkerd top` (#1750 ) * Make room for columns in `linkerd top` Make room for columns in `linkerd top`. Columns with data longer than some predetermined minimum length were stepping over each other. Proposal for #1728 * Removed unneeded truncations Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com>	2018-10-11 13:08:46 -07:00
Kevin Lingerfelt	46c887ca00	Add --single-namespace install flag for restricted permissions (#1721 ) * Add --single-namespace install flag for restricted permissions * Better formatting in install template * Mark --single-namespace and --proxy-auto-inject as experimental * Fix wording of --single-namespace check flag * Small healthcheck refactor Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-10-11 10:55:57 -07:00
Ivan Sim	4fba6aca0a	Proxy init and sidecar containers auto-injection (#1714 ) * Support auto sidecar-injection 1. Add proxy-injector deployment spec to cli/install/template.go 2. Inject the Linkerd CA bundle into the MutatingWebhookConfiguration during the webhook's start-up process. 3. Add a new handler to the CA controller to create a new secret for the webhook when a new MutatingWebhookConfiguration is created. 4. Declare a config map to store the proxy and proxy-init container specs used during the auto-inject process. 5. Ignore namespace and pods that are labeled with linkerd.io/auto-inject: disabled or linkerd.io/auto-inject: completed 6. Add new flag to `linkerd install` to enable/disable proxy auto-injection Proposed implementation for #561. * Resolve missing packages errors * Move the auto-inject label to the pod level * PR review items * Move proxy-injector to its own deployment * Ignore pods that already have proxy injected This ensures the webhook doesn't error out due to proxy that are injected using the command * PR review items on creating/updating the MWC on-start * Replace API calls to ConfigMap with file reads * Fixed post-rebase broken tests * Don't mutate the auto-inject label Since we started using healhcheck.HasExistingSidecars() to ensure pods with existing proxies aren't mutated, we don't need to use the auto-inject label as an indicator. This resolves a bug which happens with the kubectl run command where the deployment is also assigned the auto-inject label. The mutation causes the pod auto-inject label to not match the deployment label, causing kubectl run to fail. * Tidy up unit tests * Include proxy resource requests in sidecar config map * Fixes to broken YAML in CLI install config The ignore inbound and outbound ports are changed to string type to avoid broken YAML caused by the string conversion in the uint slice. Also, parameterized the proxy bind timeout option in template.go. Renamed the sidecar config map to 'linkerd-proxy-injector-webhook-config'. Signed-off-by: ihcsim <ihcsim@gmail.com>	2018-10-10 12:09:22 -07:00
Darko Radisic	6fee0f3c2b	Added --context flag to specify the context to use to talk to the Kubernetes apiserver (#1743 ) * Added --context flag to specify the context to use to talk to the Kubernetes apiserver * Fix tests that are failing * Updated context flag description Signed-off-by: Darko Radisic <ffd2subroutine@users.noreply.github.com>	2018-10-08 12:37:35 -07:00
Ben Lambert	69cebae1a2	Added ability to configure sidecar CPU + Memory requests (#1731 ) Horizontal Pod Autoscaling does not work when container definitions in pods do not all have resource requests, so here's the ability to add CPU + Memory requests to install + inject commands by proving proxy options --proxy-cpu + --proxy-memory Fixes #1480 Signed-off-by: Ben Lambert <ben@blam.sh>	2018-10-08 10:51:29 -07:00
Oliver Gould	eaec37c64f	cli: Use updated proxy config environment vars In linkerd/linkerd2-proxy#99, several proxy configuration variables were deprecated. This change updates the CLI to use the updated names to avoid deprecation warnings during startup.	2018-10-03 11:15:39 -07:00
Risha Mars	427078844d	Add HTTP method to Top display in the CLI and Web UI (#1709 ) * Add Method column to Top tables in Web UI * Add method to CLI top table	2018-09-25 11:15:11 -07:00
Rodrigo Chacon	783bb1c3a7	cli: add support for LINKERD_NAMESPACE environment variable (#1695 ) Signed-off-by: Rodrigo Chacon <rochacon@gmail.com>	2018-09-21 17:24:10 -07:00
Alena Varkockova	8ab9b4981b	Make wait flag configurable for check and dashboard (#1654 ) Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2018-09-19 10:42:29 -07:00
Kevin Lingerfelt	f1b3827194	Bump default check retry time to 5 minutes (#1645 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-09-14 10:58:03 -07:00
Alena Varkockova	169dcf4e70	Make wait=true a default option for check and dashboard (#1640 ) * Remove wait option and make it a default for check * Switch the wait default to true * Wait by default also for dashboard Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2018-09-14 09:59:04 -07:00
Andrew Seigner	5d85680ec1	Introduce inject check for known sidecars (#1619 ) `linkerd inject` was not checking its input for known sidecars and initContainers. Modify `linkerd inject` to check for existing sidecars and initContainers, specifically, Linkerd, Istio, and Contour. Part of #1516 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-11 15:09:19 -07:00
Andrew Seigner	bae05410fd	Bump Prometheus to v2.4.0, Grafana to 5.2.4 (#1625 ) Prometheus v2.3.1 -> v2.4.0 Grafana 5.1.3 -> 5.2.4 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-11 14:45:55 -07:00
Alex Leong	bd15482329	Add with-source flag to top (#1614 ) Fixes #1593 Add a `--hide-sources` flag to `linkerd top`. Setting this removes the source column from the output. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-09-11 14:21:36 -07:00
Andrew Seigner	7eec5f181d	Inject warns on UDP ports (#1617 ) linkerd only routes TCP data, but `linkerd inject` does not warn when it injects into pods with ports set to `protocol: UDP`. Modify `linkerd inject` to warn when injected into a pod with `protocol: UDP`. The Linkerd sidecar will still be injected, but the stderr output will include a warning. Also add stderr checking on all inject unit tests. Part of #1516. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-11 10:12:45 -07:00
Andrew Seigner	c5a719da47	Modify inject to warn when file is un-injectable (#1603 ) If an input file is un-injectable, existing inject behavior is to simply output a copy of the input. Introduce a report, printed to stderr, that communicates the end state of the inject command. Currently this includes checking for hostNetwork and unsupported resources. Malformed YAML documents will continue to cause no YAML output, and return error code 1. This change also modifies integration tests to handle stdout and stderr separately. example outputs... some pods injected, none with host networking: ``` hostNetwork: pods do not use host networking...............................[ok] supported: at least one resource injected..................................[ok] Summary: 4 of 8 YAML document(s) injected deploy/emoji deploy/voting deploy/web deploy/vote-bot ``` some pods injected, one host networking: ``` hostNetwork: pods do not use host networking...............................[warn] -- deploy/vote-bot uses "hostNetwork: true" supported: at least one resource injected..................................[ok] Summary: 3 of 8 YAML document(s) injected deploy/emoji deploy/voting deploy/web ``` no pods injected: ``` hostNetwork: pods do not use host networking...............................[warn] -- deploy/emoji, deploy/voting, deploy/web, deploy/vote-bot use "hostNetwork: true" supported: at least one resource injected..................................[warn] -- no supported objects found Summary: 0 of 8 YAML document(s) injected ``` TODO: check for UDP and other init containers Part of #1516 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-10 10:34:25 -07:00
Kevin Lingerfelt	b5ff29c8aa	Add data plane check to validate proxy version (#1574 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-09-04 15:22:38 -07:00
Kevin Lingerfelt	c7a79da89c	Add data plane check to validate proxies are ready (#1570 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-31 15:51:57 -07:00
Alex Leong	0f7d684ca9	Increase default max-rps for tap and top (#1531 ) The default value for the max-rps argument to the tap and top commands is an overly conservative 1rps. This causes the data to come in very slowly and much data to be discarded. Furthermore, because tap requests are windowed to 10 seconds, this causes long pauses between updates. We fix this in two ways. Firstly we reduce the window size to 1s so that updates will come in at least once per second, even when the actual RPS of the data path is extremely high. Secondly, we increase the default max-rps parameter from 1 to 100. This allows tap to paint an accurate picture of the data much more quickly and sidesteps some sampling bias that happens when the max-rps is low. In general, tap events tend to happen in bursts. For example, one request in may trigger one or more requests out. Likewise, a single upstream event may trigger several requests to the tapped pod in quick succession. Sampling bias will occur when the max-rps is less than the actual rps and when the tap event limit subdivides these event bursts (biasing towards the first few events in the burst). The greater the max-rps, the less the effects of this bias. Fixes #1525 Signed-off-by: Alex Leong <alex@buoyant.io>	2018-08-28 14:16:39 -07:00
Risha Mars	136b9cc7c1	Add linkerd check flag to run data plane checks (#1528 ) Adds a --proxy flag to the linkerd check CLI command which will run to-be-implemented data plane checks	2018-08-28 10:16:24 -07:00
Risha Mars	fff09c5d06	Only tap pods that are meshed (#1535 ) Previously, we would tap any resource's pods, regardless of whether the pods were meshed or not. We can't actually tap non-meshed pods, so I'm adding a check that will filter out non-meshed pods from the pods that tap watches. Previous behaviour: When attempting to hang a non meshed pod, it would establish a watch on the pods, but then never return any results. In the CLI you could just cancel it with Ctrl-C. In the web, clicking Stop would send a WebSocket.close(1000) but wouldn't actually close the connection... Behaviour after change : If no pods under the specified resource are meshed, it'll return an error of no pods being found to tap	2018-08-28 09:59:52 -07:00
Risha Mars	27e52a6cc0	Add ReadinessProbe and LivenessProbe to injected proxy containers (#1530 ) Adds basic probes to the linkerd-proxy containers injected by linkerd inject. - Currently the Readiness and Liveness probes are configured to be the same. - I haven't supplied a periodSeconds, but the default is 10. - I also set the initialDelaySeconds to 10, but that might be a bit high. https://kubernetes.io/docs/tasks/configure-pod-container/configure-liveness-readiness-probes/	2018-08-27 11:55:17 -07:00
Kevin Lingerfelt	4450a7536d	Add --wait flag for CLI check and dashboard commands (#1503 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-22 12:56:42 -07:00
Kevin Lingerfelt	49f6c4c770	Refactor healthcheck init and observe setup (#1502 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-22 12:30:45 -07:00
Kevin Lingerfelt	53cd3b50d5	Add --pre flag for linkerd check command (#1497 ) * Add --pre flag for linkerd check command * Small adjustments to check help text Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-20 17:09:43 -07:00
Kevin Lingerfelt	e97be1f5da	Move all healthcheck-related code to pkg/healthcheck (#1492 ) * Move all healthcheck-related code to pkg/healthcheck * Fix failed check formatting * Better version check wording Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-20 16:50:22 -07:00
Eliza Weisman	b8434d60d4	Add resource metadata to Tap CLI output (#1437 ) Closes #1170. This branch adds a `-o wide` (or `--output wide`) flag to the Tap CLI. Passing this flag adds `src_res` and `dst_res` elements to the Tap output, as described in #1170. These use the metadata labels in the tap event to describe what Kubernetes resource the source and destination peers belong to, based on what resource type is being tapped, and fall back to pods if either peer is not a member of the specified resource type. In addition, when the resource type is not `namespace`, `src_ns` and `dst_ns` elements are added, which show what namespaces the the source and destination peers are in. For peers which are not in the Kubernetes cluster, none of these labels are displayed. The source metadata added in #1434 is used to populate the `src_res` and `src_ns` fields. Also, this branch includes some refactoring to how tap output is formatted. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-08-20 14:25:26 -07:00
Alex Leong	094a375015	[RFC] linkerd top (#1435 ) This an initial implementation of the `linkerd top` command. This command launches an ncurses style tabular view of current requests (using data from tap). Most of the command line arguments are the same as tap and allow selecting the resource to inspect and filtering which requests to view. Fixes #1283 Signed-off-by: Alex Leong <alex@buoyant.io>	2018-08-15 18:10:23 -07:00
Kevin Lingerfelt	00a0572098	Better CLI error messages when control plane is unavailable (#1428 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-09 15:40:41 -07:00
Eliza Weisman	9d8f58cb16	Add additional validation for stat command-line arguments (#1415 ) Closes #776. This branch adds the following validation to the `linkerd stat` command: * The `--to` and `--from` flags are now mutually exclusive * The `--to-namespace` and `--from-namespace` commands are also mutually exclusive. * The `namespace` resource type conflicts with the `--namespace`, `--to-namespace`, and `--from-namespace` flags. Examples: ``` $ bin/go-run cli/main.go stat deploy --to deploy/foo --from deploy/bar Error: --to and --from flags are mutually exclusive Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat deploy --to-namespace foo --from-namespace bar Error: --to-namespace and --from-namespace flags are mutually exclusive Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat namespace foo --namespace bar Error: --namespace flag is incompatible with namespace resource type Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat ns --to-namespace bar Error: --to-namespace flag is incompatible with namespace resource type Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat namespace --from-namespace bar Error: --from-namespace flag is incompatible with namespace resource type Usage: linkerd stat [flags] (RESOURCE) ... ``` ``` $ bin/go-run cli/main.go stat ns/foo --from-namespace bar Error: --from-namespace flag is incompatible with namespace resource type Usage: linkerd stat [flags] (RESOURCE) ... ``` Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-08-08 15:35:47 -07:00
Kevin Lingerfelt	82940990e9	Rename mailing lists, remove all remaining conduit references (#1416 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-07 17:00:55 -07:00
Kevin Lingerfelt	4845b4ec04	Restore linkerd.io/control-plane* labels (#1411 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-07 13:53:29 -07:00
Kevin Lingerfelt	e0a01c5dd8	Remove node scrape target, kubernetes grafana dashboard (#1410 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-07 13:41:38 -07:00
Kevin Lingerfelt	bd19e8aaff	Update prometheus to only scrape proxies in the same mesh (#1402 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-06 12:05:55 -07:00
Eliza Weisman	01cc30d102	Increase outbound router capacity for Prometheus pod's proxy (#1358 ) Currently, when a cluster has over 100 pods injected with the Linkerd2 proxy, Prometheus metrics are not collected correctly. This is because Prometheus appears to be making more concurrent requests than its' proxy's outbound router cache can handle See issue #1322 for further details. This branch introduces a workaround for this issue, by increasing the outbound router cache capacity to 10000 routes for the Prometheus pod's proxy only. The router capacity limit of 100 active routes is primarily due to the limitation of the number of active Destination service lookups, so increasing the capacity for the Prometheus pod specifically is probably okay, as the scrape requests are made to IP addresses directly and therefore will not cause service discovery lookups. This change was originally implemented and tested in @siggy's PR #1228. I've rebased his branch onto the current `master`, and updated the code to reflect the project name change. Signed-off-by: Eliza Weisman <eliza@buoyant.io> Co-authored-by: Andrew Seigner <siggy@buoyant.io>	2018-08-02 16:44:11 -07:00
Ivan Sim	eb04217a12	Update inject cmd to read from folder (#1377 ) This change is a simplified implementation of the Builder.Path() and Visitor().ExpandPathsToFileVisitors() functions used by kubectl to parse files and directories. The filepath.Walk() function is used to recursively traverse directories. Every .yaml or .json resource file in the directory is read into its own io.Reader. All the readers are then passed to the YAMLDecoder in the InjectYAML() function. Fixes #1376 Signed-off-by: ihcsim <ihcsim@gmail.com>	2018-08-01 17:12:00 -07:00
Kevin Lingerfelt	8fe9e53f67	Remove remaining conduit references in codebase (#1381 ) * Remove remaining conduit references in codebase * Shorten emojivoto config url Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-31 11:19:34 -07:00
Kevin Lingerfelt	51848230a0	Send glog logs to stderr by default (#1367 ) * Send glog logs to stderr by default * Factor out more shared flag parsing code Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-25 12:59:24 -07:00
Risha Mars	ec3c861743	Enable Tap from the Web UI (#1356 ) Adds a tap endpoint in the web api that communicates with the dashboard via websockets. I've moved a bunch of code from the cli tap.go into utils so that the code can be shared between web and CLI. I think we should consider making the display more suited to web, but in the short term, reusing the CLI's rendering of tap events works. Adds a Tap page in the Web UI that you can use to make tap requests. The form currently only allows you to enter a resource and namespace, other filters coming in a follow-up branch.	2018-07-24 14:23:42 -04:00
Kevin Lingerfelt	4b9700933a	Update prometheus labels to match k8s resource names (#1355 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-23 15:45:05 -07:00
Brian Smith	a98bfb1ca7	Rename `ca-bundle-distributor` to `ca`. (#1340 ) `ca-bundle-distributor` described the original role of the program but `ca` ("Certificate Authority") better describes its current role. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-07-17 14:10:40 -10:00
Brian Smith	1b38310019	Remove executable bit from non-executable files. (#1335 ) These files were created with the executable bit set accidentally due to the way my network file system setup was configured. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-07-16 13:55:52 -10:00
Franziska von der Goltz	c7ac072acc	update grafana dashboards: conduit to linkerd (#1320 ) * update grafana dashboards to remove conduit reference and replace with linkerd instances * update test install fixtures to reflect changes Fixes: #1315 Signed-off-by: Franziska von der Goltz <franziska@vdgoltz.eu>	2018-07-16 13:05:01 -07:00
Kevin Lingerfelt	e5cce1abaf	Rename CLI from conduit to linkerd (#1312 ) * Rename CLI binary * Update integration tests for new binary name * Rename --conduit-namespace flag, change default ns * Rename occurrences of conduit in rest of CLI * Rename inject and install components * Remove conduit occurrences in docker files * Additional miscellaneous cleanup * Move protobuf definitions to linkerd2 package * Rename conduit.io labels to use linkerd.io * Rename conduit-managed segment to linkerd-managed * Fix conduit references in web project Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-12 17:14:07 -07:00
Andrew Seigner	e18fa48135	Name ClusterRole objects to be namespace-specific (#1295 ) The control-plane's `ClusterRole` and `ClusterRoleBinding` objects are global. Because their names did not vary across multiple control-plane deployments, it prevented multiple control-planes from coexisting (when RBAC is enabled). Modify the `ClusterRole` and `ClusterRoleBinding` objects to include the control-plane's namespace in their names. Also modify the integration test to first install two control-planes, and then perform its full suite of tests, to prevent regression. Fixes #1292. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-07-10 16:21:20 -07:00
Oliver Gould	941cad4a9c	Migrate build infrastructure to linkerd2 (#1298 ) This PR begins to migrate Conduit to Linkerd2: * The proxy has been completely removed from this repo, and is now located at github.com/linkerd/linkerd2-proxy. * A `Dockerfile-proxy` has been added to fetch the most-recently published proxy binary from build.l5d.io. * Proxy-specific protobuf bindings have been moved to github.com/linkerd/linkerd2-proxy-api. * All docker images now use the gcr.io/linkerd-io registry. * `inject` now uses `LINKERD2_PROXY_` environment variables * Go paths have been updated to reflect the new (future) repo location.	2018-07-09 15:38:38 -07:00
Kevin Lingerfelt	fd1aecfa63	Unhide --tls flag in conduit CLI (#1278 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-05 15:49:19 -07:00
Kevin Lingerfelt	f0ba8f3ee8	Fix owner types in TLS identity strings (#1257 ) * Fix owner types in TLS identity strings * Update documentation on TLSIdentity struct Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-03 14:20:24 -07:00
Risha Mars	83b982b25a	Change CLI and web TLS indicators from Secured to TLS (#1247 ) Previously, we had "Secured" columns in the web and CLI for the percentage of traffic that is TLSed. Change this to "TLS"	2018-07-03 10:51:38 -07:00
Brian Smith	252a8d39d3	Generate an ephemeral CA at startup that distributes TLS credentials (#1245 ) Create a ephemeral, in-memory TLS certificate authority and integrate it into the certificate distributor. Remove the re-creation of deleted ConfigMaps; this will be added back later in #1248. Signed-off-by: Brian Smith brian@briansmith.org Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-02 18:09:31 -10:00
Oliver Gould	20276b106e	tap: Support `tls` labeling (#1244 ) The proxy's metrics are instrumented with a `tls` label that describes the state of TLS for each connection and associated messges. This same level of detail is useful to get in `tap` output as well. This change updates Tap in the following ways: * `TapEvent` protobuf updated: * Added `source_meta` field including source labels * `proxy_direction` enum indicates which proxy server was used. * The proxy adds a `tls` label to both source and destination meta indicating the state of each peer's connection * The CLI uses the `proxy_direction` field to determine which `tls` label should be rendered.	2018-07-02 17:19:20 -07:00
Kevin Lingerfelt	a685dba873	Use parent name instead of pod name in identity string (#1236 ) * Use parent name instead of pod name in identity string * Update protobuf comment Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-29 14:28:13 -07:00
Brian Smith	f989c56127	Proxy: Skip TLS for control plane loopback connections. (#1229 ) If the controller address has a loopback host then don't use TLS to connect to it. TLS isn't needed for security in that case. In mormal configurations the proxy isn't terminating TLS for loopback connections anyway. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-06-28 17:24:09 -10:00
Risha Mars	5ed7fc563c	Add controller component pod uptimes to the ServiceMesh page (#1205 ) - Return pod uptimes from the GetPods endpoint - Adds filtering by namespace to api.GetPods - Adds a --namespace filter to conduit get pods - Adds pod uptimes to the controller component toolitps on the ServiceMesh page - Moves the ServiceMesh page back to using /api/pods	2018-06-28 15:42:00 -07:00
Risha Mars	68586fe697	Add the ability to query stats by authority (#1181 ) Adds the ability to query by a new non-kubernetes resource type, "authorities", in the StatSummary api. This includes an extensive refactor of stat_summary.go to deal with non-kubernetes resource types. - Add documentation to Resource in the public api so we can use it for authority - Handle non-k8s resource requests in the StatSummary endpoint - Rewrite stat summary fetching and parsing to handle non-k8s resources - keys stat summary metric handling by Resource instead of a generated string - Adds authority to the CLI - Adds /authorities to the Web UI - Adds some more stat integration and unit tests	2018-06-28 14:31:44 -07:00
Kevin Lingerfelt	ef9c890505	Fix issue with injected resource name, add test (#1226 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-28 10:23:38 -10:00
Oliver Gould	9f274526d6	cli: tap: Use safe accessors (#1224 ) The `tap` command is prone to panic due to use of `nil` values. This is because we don't use the safe `Get*()` field accessors provided by protobuf. This change fixes several unsafe field access paths. Fixes #47	2018-06-28 11:10:56 -07:00
Thomas Rampelberg	fafce1b8b3	Add important comment back (#1219 )	2018-06-28 08:18:52 -07:00
Brian Smith	cca8e7077d	Add TLS support to `conduit inject`. (#1220 ) * Add TLS support to `conduit inject`. Add the settings needed to enable TLs when `--tls=optional` is passed on the commend line. Later the requirement to add `--tls` will be removed. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-06-27 16:04:07 -10:00
Thomas Rampelberg	97868f654f	Add Pod to injectable types (#1213 ) * Add Pod to injectable types * Remove the pod label for pods	2018-06-27 14:37:05 -07:00
Kevin Lingerfelt	1f1968ad4d	Add --registry flag support for inject command (#1188 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-22 12:52:42 -07:00
Kevin Lingerfelt	5cf8ab00df	Switch to multi-value --tls flag, add to inject (#1182 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-21 15:52:14 -07:00
Kevin Lingerfelt	af85d1714f	Add probes and log termination policy for distributor (#1178 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-21 14:02:41 -07:00
Kevin Lingerfelt	12f869e7fc	Add CA certificate bundle distributor to conduit install (#675 ) * Add CA certificate bundle distributor to conduit install * Update ca-distributor to use shared informers * Only install CA distributor when --enable-tls flag is set * Only copy CA bundle into namespaces where inject pods have the same controller * Update API config to only watch pods and configmaps * Address review feedback Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-21 13:12:21 -07:00
Kevin Lingerfelt	e80356de34	Upgrade prometheus to v2.3.1 (#1174 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-21 11:02:21 -07:00
Kevin Lingerfelt	682b0274b5	Add controller admin servers and readiness probes (#1168 ) * Add controller admin servers and readiness probes * Tweak readiness probes to be more sane * Refactor based on review feedback Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-20 17:32:44 -07:00
Risha Mars	0ff1bb4ad8	Don't allow stat requests for named resources in --all-namespaces (#1163 ) Don't allow the CLI or Web UI to request named resources if --all-namespaces is used. This follows kubectl, which also does not allow requesting named resources over all namespaces. This PR also updates the Web API's behaviour to be in line with the CLI's. Both will now default to the default namespace if no namespace is specified.	2018-06-20 12:59:31 -07:00
Risha Mars	46c99febf2	Don't panic on stats that aren't included in StatAllResourceTypes (#1154 ) Problem `conduit stat` would cause a panic for any resource that wasn't in the list of StatAllResourceTypes This bug was introduced by https://github.com/runconduit/conduit/pull/1088/files Solution Fix writeStatsToBuffer to not depend on what resources are in StatAllResourceTypes Also adds a unit test and integration test for `conduit stat ns`	2018-06-19 17:00:16 -07:00
Andrew Seigner	0b9e7ff7df	Enable get for nodes/proxy for Prometheus RBAC (#1142 ) The `kubernetes-nodes-cadvisor` Prometheus queries node-level data via the Kubernetes API server. In some configurations of Kubernetes, namely minikube and at least one baremetal kubespray cluster, this API call requires the `get` verb on the `nodes/proxy` resource. Enable `get` for `nodes/proxy` for the `conduit-prometheus` service account. Fixes #912 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-06-18 17:49:23 +01:00
Risha Mars	e2c2f19d2c	Propagate errors in conduit containers to the api (#1117 ) - It would be nice to display container errors in the UI. This PR gets the pod's container statuses and returns them in the public api - Also add a terminationMessagePolicy to conduit's inject so that we can capture the proxy's error messages if it terminates	2018-06-14 16:22:31 -07:00
Thomas Rampelberg	516807bde6	Add readiness/liveness checks for third party components (#1121 ) * Add readiness/liveness checks for third party components Any possible issues with the third party control plane components can wedge the services. Take the best practices for prometheus/grafana and add them to our template. See #1116 * Update test fixtures for new output	2018-06-14 13:01:13 -07:00
Kevin Lingerfelt	9f1df963e9	Move controller/util and web/util packages to pkg (#1109 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-13 11:25:56 -07:00
Risha Mars	7d4c4aa290	CLI: print resources in the same order every time stat all is run (#1088 ) Previously, in conduit stat all we would just print the map of stat results, which resulted in the order in which stats were displayed varying between prints. Fix: Define an array, k8s.StatAllResourceTypes and use the order in this array to print the map; ensuring a consistent print order every time the command is run.	2018-06-08 15:02:17 -07:00
Kevin Lingerfelt	eebc612d52	Add install flag for sending tls identity info to proxies (#1055 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-04 16:55:06 -07:00
Kevin Lingerfelt	ec2433e9bd	Update controller to use 'tls' metric label (#1044 ) * Update controller to use 'tls' metric label * Fix meshed column formatter Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-01 16:44:33 -07:00
Sacha Froment	84781c9c74	conduit inject: Add flag to set proxy bind timeout (#865 ) * conduit inject: Add flag to set proxy bind timeout (#863) * fix test * fix flag to get it working with #909 * Add time parsing * Use the variable to set the default value Signed-off-by: Sacha Froment <sfroment42@gmail.com>	2018-05-29 11:14:29 -07:00
Risha Mars	d333a7d861	Add a secured label to the CLI tap responses (#996 ) Adds secured=yes/no to the conduit tap responses. This assumes a `meshed` label is returned by the proxy.	2018-05-25 11:21:38 -07:00
Risha Mars	ffabdefc6c	Add queries to prometheus to determine number of fully meshed requests (#983 ) - Update the `response_total` prometheus query of the StatSummary endpoint to also break queries out by a `meshed` label. - Add a 'Secured' column to the web UI/CLI stat displays, which indicate the percentage of traffic starting and ending in the mesh This meshed label is used in the CLI/Web UI to display a column of the percentage of traffic that starts/ends in the mesh. (Which is a proxy indicator for whether that traffic is 'secured' when we add TLS by default for intra mesh requests). The `meshed` label is not yet added anywhere, so until it is supplied by the proxy, all traffic will show up as 0% secured in the web/CLI.	2018-05-24 11:05:09 -07:00
Haiwei Liu	8c98cde82b	change init image to root options, for install and inject to use (#1001 ) Signed-off-by: Haiwei Liu <carllhw@gmail.com>	2018-05-24 10:21:16 -07:00
Andrew Seigner	8a1a3b31d4	Fix non-default proxy-api port (#979 ) Running `conduit install --api-port xxx` where xxx != 8086 would yield a broken install. Fix the install command to correctly propagate the `api-port` flag, setting it as the serve address in the proxy-api container. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-05-22 10:34:25 -07:00
Kevin Lingerfelt	2baeaacbc8	Remove package-scoped vars in cmd package (#975 ) * Remove package-scoped vars in cmd package * Run gofmt on all cmd package files * Re-add missing Args setting on check command Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-05-21 18:15:39 -07:00
Risha Mars	b8dc83f9d2	Modify the Stat API to handle requests for resource type "all" (#928 ) Allow the Stat endpoint in the public-api to accept requests for resourceType "all". Currently, this queries Pods, Deployments, RCs and Services, but can be modified to query other resources as well. Both the CLI and web endpoints now work if you set resourceType to all. e.g. `conduit stat all`	2018-05-11 14:35:37 -07:00
Kevin Lingerfelt	4e8e1eb84d	CLI: Fix validation for service stats (#935 ) * CLI: Fix validation for service stats * Address review feedback Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-05-11 10:28:49 -07:00
Thomas Rampelberg	25d8b22b5c	Adding statefulsets to inject. Fixes #907 (#910 )	2018-05-10 09:00:36 -05:00
Thomas Rampelberg	e96c2e1135	Made inject aware of the List type. (#886 )	2018-05-10 08:08:55 -05:00
Andrew Seigner	1275b1ae89	Introduce Grafana, K8s, and Prom dashboards (#904 ) Grafana provides default dashboards for Prometheus and Grafana health. The community also provides Kubernetes-specific dashboards. Conduit was not taking advantage of these. Introduce new Grafana dashboards focused on Grafana, Kubernetes, and Prometheus health. Tag all Conduit dashboards for easier UI navigation. Also fix layout in Conduit Health dashboard. Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-05-08 23:11:43 +02:00
Risha Mars	f94856e489	Modify the Stat endpoint to also return the number of failed conduit pods (#895 ) * Modify the Stat endpoint to also return the count of failed pods * Add comments explaining pod count stats * Rename total pod count to running pod count This is to support the service mesh overview page, as I'd like to include an indicator of failed pods there.	2018-05-08 10:35:21 -07:00
Andrew Seigner	97bf4fcdf2	Release Notes for 0.4.1 release. (#839 ) Also update Getting Started and Debugging docs to reflect changes in `Tap` and `Stat`. Fixes #838 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-26 13:32:41 -07:00
Andrew Seigner	dce31b888f	Deprecate Tap, rename TapByResource to Tap (#844 ) The `conduit tap` command is now deprecated. Replace `conduit tap` with `connduit tapByResource`. Rename tapByResource to tap. The underlying protobuf for tap remains, the tap gRPC endpoint now returns Unimplemented. Fixes #804 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-25 12:24:46 -07:00
Andrew Seigner	640570cd6b	Make TapByResource output destination pod (#837 ) The TapByResource command now has access to destination labels from the proxy, but was not outputting them on the cli. Modify the TapByResource output to print the destination pod label, rather than the ip, when available. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-24 18:58:40 -07:00
Andrew Seigner	a0a9a42e23	Implement Public API and Tap on top of Lister (#835 ) public-api and and tap were both using their own implementations of the Kubernetes Informer/Lister APIs. This change factors out all Informer/Lister usage into the Lister module. This also introduces a new `Lister.GetObjects` method. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-24 18:10:48 -07:00
Andrew Seigner	39eccb09e2	cli: standardize kubernetes resource parsing (#830 ) The Tap command leveraged new cli parsing code, enabling Kubernetes resources specified as `(TYPE [NAME] \| TYPE/NAME)`. The Stat command did not use this. Modify the Stat command to use the same cli flag parsing code as Tap. Remove the to/from-resource flags from Stat. Fixes #792 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-23 15:17:42 -07:00
Andrew Seigner	1f32f130de	Upgrade Prometheus from 2.1.0 to 2.2.1 (#816 ) There have been a number of performance improvements and bug fixes since v2.1.0. Bump our Prometheus container to v2.2.1. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-19 18:00:53 -07:00
Andrew Seigner	79bdc638b3	Service support in stat command (#809 ) The `stat` command did not support `service` as a resource type. This change adds `service` support to the `stat` command. Specifically: - as a destination resource on `--to` commands - as a target resource on `--from` commands Fixes #805 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-19 16:51:20 -07:00
Andrew Seigner	293e00bc3e	Introduce tapByResource cli command (#802 ) The existing `tap` command is being deprecated. Introduce a `tapByResource` cli command. It supports tapping a Kubernetes resource or collection of resources, optionally filtered by outbound resources. This command will eventually replace `tap`. Part of #778 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-19 14:44:23 -07:00
Kevin Lingerfelt	653dc6bfaa	Add replication controller stats in CLI (#794 ) * Add replication controller stats in CLI * Fix pod status in stat summary tests Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-18 18:12:14 -07:00
Oliver Gould	06dd8d90ee	Introduce the TapByResource API (#778 ) This changes the public api to have a new rpc type, `TapByResource`. This api supersedes the Tap api. `TapByResource` is richer, more closely reflecting the proxy's capabilities. The proxy's Tap api is extended to select over destination labels, corresponding with those returned by the Destination api. Now both `Tap` and `TapByResource`'s responses may include destination labels. This change avoids breaking backwards compatibility by: * introducing the new `TapByResource` rpc type, opting not to change Tap * extending the proxy's Match type with a new, optional, `destination_label` field. * `TapEvent` is extended with a new, optional, `destination_meta`.	2018-04-18 15:37:07 -07:00
Kevin Lingerfelt	71a51afb40	Expose pod stats in CLI, web UI, and Grafana (#788 ) * Expose pod stats in CLI, web UI, and Grafana * Fix js api helpers test * Add outbound traffic stats to pod dashboard Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-18 11:26:47 -07:00
Andrew Seigner	727521f914	Permit arbitrary time windows in public-api (#774 ) The public-api previously only permitted 4 hard-coded time windows: 10s, 1m, 10m, 1h. This was primarily a relic of the recently removed telemetry system. Modify the public-api to validate the time string, but allow for any window size, which is then passed through to Prometheus. Fixes #686 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-16 17:37:17 -07:00
Kevin Lingerfelt	11a4359e9a	Misc cleanup following the telemetry rewrite (#771 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-16 15:51:07 -07:00
Oliver Gould	800cefdb77	Skip the proxy on the metrics port (#770 ) When prometheus queries the proxy for data, these requests are reported as inbound traffic to the pod. This leads to misleading stats when a pod otherwise receives little/no traffic. In order to prevent these requests being proxied, the metrics port is now added to the default inbound skip-ports list (as is already case for the tap server). Fixes #769	2018-04-16 11:54:58 -07:00
Andrew Seigner	77fb6d3709	Add namespace as a resource type in public-api (#760 ) * Add namespace as a resource type in public-api The cli and public-api only supported deployments as a resource type. This change adds support for namespace as a resource type in the cli and public-api. This also change includes: - cli statsummary now prints `-`'s when objects are not in the mesh - cli statsummary prints `No resources found.` when applicable - removed `out-` from cli statsummary flags, and analagous proto changes - switched public-api to use native prometheus label types - misc error handling and logging fixes Part of #627 Signed-off-by: Andrew Seigner <siggy@buoyant.io> * Refactor filter and groupby label formulation Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Rename stat_summary.go to stat.go in cli Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Update rbac privileges for namespace stats Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-13 16:53:01 -07:00
Oliver Gould	cc44db054f	Remove NODE_NAME and POD_NAME env usage (#758 ) * proxy: Remove pod_name and node_name * cli: Do not inject POD_NAME and NODE_NAME env vars	2018-04-13 13:09:51 -07:00
Kevin Lingerfelt	fb15fe7c1a	Remove the telemetry service (#757 ) * Remove the telemetry service The telemetry service is no longer needed, now that prometheus scrapes metrics directly from proxies, and the public-api talks directly to prometheus. In this branch I'm removing the service itself as well as all of the telemetry protobuf, and updating the conduit install command to no longer install the service. I'm also removing the old version of the stat command, which required the telemetry service, and renaming the statsummary command to stat. * Fix time window tests * Remove deprecated controller scrape config Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-13 11:21:29 -07:00
Kevin Lingerfelt	47caf1ca07	Add --all-namespaces flag to CLI statsummary command (#745 ) * Add --all-namespaces flag to CLI statsummary command * Fix statsummary output formatting Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-11 16:40:25 -07:00
Andrew Seigner	259fdcd134	Add latency stats in new stat summary endpoint (#737 ) The new StatSummary endpoint was only providing request volume and successs rate information. Add support for retrieving latency stats via StatSummary. Also make all prometheus calls in parallel, and implement kubernetes test fixtures. Fixes #681 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-11 11:58:32 -07:00
Andrew Seigner	716b392231	Move StatSummary logic into grpc server (#717 ) The StatSummary logic was implemented as a method on http_server. Move the StatSummary logic into grpc_server, for consistency with the other endpoints. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-06 16:46:15 -07:00
Kevin Lingerfelt	baa4d10c2f	CLI: change conduit namespace shorthand flag to -c (#714 ) * CLI: change conduit namespace shorthand flag to -c All of the conduit CLI subcommands accept a --conduit-namespace flag, indicating the namespace where conduit is running. Some of the subcommands also provide a --namespace flag, indicating the kubernetes namespace where a user's application code is running. To prevent confusion, I'm changing the shorthand flag for the conduit namespace to -c, and using the -n shorthand when referring to user namespaces. As part of this change I've also standardized the capitalization of all of our command line flags, removed the -r shorthand for the install --registry flag, and made the global --kubeconfig and --api-addr flags apply to all subcommands. * Switch flag descriptions from lowercase to Capital Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-06 14:47:31 -07:00
Andrew Seigner	50c323c617	Use canonical k8s names, fix prom labels (#702 ) The new statsummary command accepted friendly k8s names, which worked for k8s queries, but Prometheus requires a specific key. Modify the statsummary query to map friendly k8s names to canonical k8s names when constructing the query. Then during the query, map the canonical k8s name to a specific Prometheus label. Fixes #695 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-06 12:34:54 -07:00
Risha Mars	2f5b5ea5f2	Start implementing conduit stat summary endpoint (#671 ) Start implementing new conduit stat summary endpoint. Changes the public-api to call prometheus directly instead of the telemetry service. Wired through to `api/stat` on the web server, as well as `conduit statsummary` on the CLI. Works for deployments only. Current implementation just retrieves requests and mesh/total pod count (so latency stats are always 0). Uses API defined in #663 Example queries the stat endpoint will eventually satisfy in #627 This branch includes commits from @klingerf * run ./bin/dep ensure * run ./bin/update-go-deps-shas	2018-04-05 17:05:06 -07:00
Andrew Seigner	28d5007cdf	Harmonize Prometheus label usage (#690 ) The Destination service used slightly different labels than the telemetry pipeline expected, specifically, prefixed with `k8s_`. Make all Prometheus labels consistent by dropping `k8s_`. Also rename `pod_name` to `pod` for consistency with `deployement`, etc. Also update and reorganize `proxy-metrics.md` to reflect new labelling. Fixes #655 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-05 15:09:06 -07:00
Andrew Seigner	9508e11b45	Build conduit-specific Grafana Docker image (#679 ) Using a vanilla Grafana Docker image as part of `conduit install` avoided maintaining a conduit-specific Grafana Docker image, but made packaging dashboard json files cumbersome. Roll our own Grafana Docker image, that includes conduit-specific dashboard json files. This significantly decreases the `conduit install` output size, and enables dashboard integration in the docker-compose environment. Fixes #567 Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-05 14:20:05 -07:00
Andrew Seigner	ee042e1943	Rename grafana viz to top-line (#666 ) The primary Grafana dashboard was named 'viz' from a prototype. Rename 'viz' to 'Top Line'. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-02 18:10:35 -07:00
Andrew Seigner	bf721466e3	Filter out conduit controller pods from Grafana (#657 ) The Grafana dashboards were displaying all proxy-enabled pods, including conduit controller pods. In the old telemetry pipeline filtering these out required knowledge of the controller's namespace, which the dashboards are agnostic to. This change leverages the new `conduit_io_control_plane_component` prometheus label to filter out proxy-enabled controller components. Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-02 17:56:12 -07:00
Andrew Seigner	8fe742e2de	Update Grafana dashboards to use new proxy metrics (#637 ) The Top-line and Deployment Grafana dashboards relied on the soon-to-be-removed telemetry pipeline metrics. Update the Grafana dashboards to query for the new, proxy-based metrics. Grafana dashboard layouts have not changed. Depends on #635 to render metrics. Part of #420. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-29 13:00:01 -07:00
Andrew Seigner	666c83e963	Add pod_name to Prometheus labels (#649 ) Previously we were using the instance label to uniquely identify a pod. This meant that getting stats by pod name would require extra queries to Kubernetes to map pod name to instance. This change adds a pod_name label to metrics at collection time. This should not affect cardinality as pod_name is invariant with respect to instance. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-29 11:07:35 -07:00
Deshi Xiao	732f3c1565	fix grafana CrashLoopBackoff on image 5.0.3 (#646 ) this is a known issue with grafana in k8s. grafana/grafana:5.0.4 was just released today. update the repo from 5.0.3 to 5.0.4 fixed issues #582 Signed-off-by: Deshi Xiao <xiaods@gmail.com>	2018-03-29 09:36:11 -07:00
Andrew Seigner	fe35509406	Clean up Prometheus labels scraped from proxy (#633 ) The Prometheus scrape config collects from Conduit proxies, and maps Kubernetes labels to Prometheus labels, appending "k8s_". This change keeps the resultant Prometheus labels consistent with their source Kubernetes labels. For example: "deployment" and "pod_template_hash". Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-27 15:01:08 -07:00
Andrew Seigner	291d8e97ab	Move injected data from env var to k8s labels (#605 ) The inject code detects the object it is being injected into, and writes self-identifying information into the CONDUIT_PROMETHEUS_LABELS environment variable, so that conduit-proxy may read this information and report it to Prometheus at collection time. This change puts the self-identifying information directly into Kubernetes labels, which Prometheus already collects, removing the need for conduit-proxy to be aware of this information. The resulting label in Prometheus is recorded in the form `k8s_deployment`. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-23 16:11:34 -07:00
Andrew Seigner	fb1d6a5c66	Introduce Conduit Health dashboard (#591 ) In addition to dashboards display service health, we need a dashboard to display health of the Conduit service mesh itself. This change introduces a conduit-health dashboard. It currently only displays health metrics for the control plane components. Proxy health will come later. Fixes #502 Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-22 15:16:03 -07:00
Alex Leong	d50550515e	Add the proxy pod owner as a Prometheus label (#448 ) Update the inject command to set a CONDUIT_PROMETHEUS_LABELS proxy environment variable with the name of the pod spec that the proxy is injected into. This will later be used as a label value when the proxy is exposing metrics. Fixes: #426 Signed-off-by: Alex Leong <alex@buoyant.io>	2018-03-22 15:10:51 -07:00
Andrew Seigner	c03508ba8c	Update Prometheus to scrape data and control plane (#583 ) The existing telemetry pipeline relies on Prometheus scraping the Telemetry service, which will soon be removed. This change configures Prometheus to scrape the conduit proxies directly for telemetry data, and the control plane components for control-plane health information. This affects the output of both conduit install and conduit inject. Fixes #428, #501 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-22 13:58:11 -07:00
Andrew Seigner	680bf6211a	Add Grafana support to conduit dashboard command (#590 ) The existing `conduit dashboard` command supported opening the conduit dashboard, or displaying the conduit dashboard URL, via a `url` boolean flag. Replace the `url` boolean flag with a `show` string flag, with three modes: `conduit dashboard --show conduit`: default, open conduit dashboard `conduit dashboard --show grafana`: open grafana dashboard `conduit dashboard --show url`: display dashboard URLs Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-20 18:07:30 -07:00
Andy Hume	e6286e1bdf	cli: ensure check command has 80-character output (#587 ) Successful `conduit check` commands now take into account `[ok]` and `\n` tokens when constraining line length. Fixes #554 Signed-off-by: Andy Hume <andyhume@gmail.com>	2018-03-20 13:55:19 -07:00
Andrew Seigner	3ca8e84eec	Add Top Line and Deployment Grafana dashboards (#562 ) Existing Grafana configuration contained no dashboards, just a skeleton for testing. Introduce two Grafana dashboards: 1) Top Line: Overall health of all Conduit-enabled services 2) Deployment: Health of a specific conduit-enabled deployment Fixes #500 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-20 10:22:30 -07:00
Alex Leong	9eb084c99d	Most controller listeners should only bind on localhost (#494 ) * Most controller listeners should only bind on localhost * Use default listening addresses in controller components * Review feedback * Revert test_helper change * Revert use of absolute domains Signed-off-by: Alex Leong <alex@buoyant.io>	2018-03-12 11:32:20 -07:00
Brian Smith	649e784d9c	Simplify cluster zone suffix handling in the proxy (#528 ) * Temporarily stop trying to support configurable zones in the proxy. None of the zone configuration is tested and lots of things assume the cluster zone is `cluster.local`. Further, how exactly the proxy will actually learn the cluster zone hasn't been decided yet. Just hard-code the zone as "cluster.local" in the proxy until configurable zones are fully implemented and tested to be working correctly. Signed-off-by: Brian Smith <brian@briansmith.org> * Remove the CONDUIT_PROXY_DESTINATIONS_AUTOCOMPLETE_FQDN setting The way that Kubernetes configures DNS search suffixes has some negative consequences as some names like "example.com" are ambiguous: depending on whether there is a service "example" in the "com" namespace, "example.com" may refer to an external service or an internal service, and this can fluctuate over time. In recognition of that we added the CONDUIT_PROXY_DESTINATIONS_AUTOCOMPLETE_FQDN setting, thinking this would be part of a solution for users to opt out of the unfortunate behavior if their applications didn't depend on the DNS search suffix feature. It turns out similar effects can be acheived using a custom dnsConfig, starting in Kubernetes 1.10 when dnsConfig reaches the beta stability level. Now any CONDUIT_PROXY_DESTINATIONS_AUTOCOMPLETE_FQDN-based seems duplicative. Further, attempting to support it optionally made the code complex and hard to read. Therefore, let's just remove it. If/when somebody actually requests this functionality then we can add it back, if dnsConfig isn't a valid alternative for them. Signed-off-by: Brian Smith <brian@briansmith.org> * Further hard-code "cluster.local" as the zone, temporarily. Addresses review feedback. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-03-07 14:30:13 -10:00
Brian Smith	0d4ab39ce7	Revert "Make absolute names truly absolute. (#525 )" (#533 ) This reverts commit `517616a166`. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-03-07 10:57:10 -10:00
Brian Smith	517616a166	Make absolute names truly absolute. (#525 ) Kubernetes will do multiple DNS lookups for a name like `proxy-api.conduit.svc.cluster.local` based on the default search settings in /etc/resolv.conf for each container: 1. proxy-api.conduit.svc.cluster.local.conduit.svc.cluster.local. IN A 2. proxy-api.conduit.svc.cluster.local.svc.cluster.local. IN A 3. proxy-api.conduit.svc.cluster.local.cluster.local. IN A 4. proxy-api.conduit.svc.cluster.local. IN A We do not need or want this search to be done, so avoid it by making each name absolute by appending a period so that the first three DNS queries are skipped for each name. The case for `localhost` is even worse because we expect that `localhost` will always resolve to 127.0.0.1 and/or ::1, but this is not guaranteed if the default search is done: 1. localhost.conduit.svc.cluster.local. IN A 2. localhost.svc.cluster.local. IN A 3. localhost.cluster.local. IN A 4. localhost. IN A Avoid these unnecessary DNS queries by making each name absolute, so that the first three DNS queries are skipped for each name. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-03-07 09:46:03 -10:00
Kevin Lingerfelt	47fc2eae20	Set -logtostderr flag on controller components (#524 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-03-07 10:18:15 -08:00
Andrew Seigner	a065174688	Disable Grafana update check (#521 ) Grafana by default calls out to grafana.com to check for updates. As user's of Conduit do not have direct control over updating Grafana directly, this update check is not needed. Disable Grafana's update check via grafana.ini. This is also a workaround for #155, root cause of #519. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-03-06 16:14:44 -08:00
Kevin Lingerfelt	d6bd17425a	Add --expected-version flag for conduit check command (#497 ) * Add --expected-version flag for conduit check command Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Update build instructions Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-03-06 11:32:14 -08:00
Dennis Adjei-Baah	5a4c5aa683	Exclude telemetry generated by the control plane when requesting depl… (#493 ) When the conduit proxy is injected into the controller pod, we observe controller pod proxy stats show up as an "outbound" deployment for an unrelated upstream deployment. This may cause confusion when monitoring deployments in the service mesh. This PR filters out this "misleading" stat in the public api whenever the dashboard requests metric information for a specific deployment. * exclude telemetry generated by the control plane when requesting deployment metrics fixes #370 Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2018-03-05 17:58:08 -08:00
Brian Smith	4c9b9c0f68	Install: Don't install buoyantio/kubectl into the prometheus pod. (#509 ) In the initial review for this code (preceding the creation of the runconduit/conduit repository), it was noted that this container is not actually used, so this is actually dead code. Further, this container actualy causes a minor problem, as it doesn't implement any retry logic, thus it will sometimes often cause errors to be logged. See https://github.com/runconduit/conduit/issues/496#issuecomment-370105328. Further, this is a "buoyantio/" branded container. IF we actually need such a container then it should be a Conduit-branded container. See https://github.com/runconduit/conduit/issues/478 for additional context. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-03-05 08:59:14 -10:00
Igor Zibarev	0f6db6efc0	cli: refactor k8s config to support $KUBECONFIG with multiple paths (#482 ) Kubernetes $KUBECONFIG environment variable is a list of paths to configuration files, but conduit assumes that it is a single path. Changes in this commit introduce a straightforward way to discover and load config file(s). Complete list of changes: - Use k8s.io/client-go/tools/clientcmd to deal with kubernetes configuration file - rename k8s API and k8s proxy constructors to get rid of redundancy - remove shell package as it is not needed anymore Signed-off-by: Igor Zibarev <zibarev.i@gmail.com>	2018-02-28 12:13:09 -08:00
Andrew Seigner	d50c8b4ac8	Add Grafana to conduit install (#444 ) `conduit install` deploys prometheus, but lacks a general-purpose way to visualize that data. This change adds a Grafana container to the `conduit install` command. It includes two sample dashboards, viz and health, in their own respective source files. Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-02-28 11:36:21 -08:00
Andy Hume	1e611c21c6	cli: add check for latest version of cli and control plane (#460 ) As part of `conduit check` command, warn the user if they are running an outdated version of the cli client or the control plane components. Fixes #314 Signed-off-by: Andy Hume <andyhume@gmail.com>	2018-02-27 16:11:38 -08:00
Dennis Adjei-Baah	893bacf8d6	Make prometheus URL in config fully qualified DNS name (#443 ) The telemetry service in the controller pod uses a non-fully qualified URL to connect to the prometheus pod in the control plane. This PR changes the URL the telemetry's prometheus URL to be fully qualified to be consistent with other URLs in the control plane. This change was tested in minikube. The logs report no errors and looking at the prometheus dashboard shows that stats are being recorded from all conduit proxies. fixes #414 Signed-off-by: Dennis Adjei-Baah dennis@buoyant.io	2018-02-26 09:40:31 -08:00
Brian Smith	34cf79a3e6	Add a test of the actual default output of `conduit install`. (#376 ) Refactor `conduit install` test into a data-driven test. Then add a test of the actual default output of `conduit install`. This test is useful to make it clear when we change the default settings of `conduit install`. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-02-23 13:27:36 -10:00
Brian Smith	86bb65a148	Remove potentially-conflicting `app` labels in control plane (#373 ) The `app` label should be reserved for end-user applications and we shouldn't use it ourselves. We already have a Conduit-specific label that is is prefixed with the `conduit.io/` prefix to avoid naming collisions with users' labels, so just use that one instead. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-02-23 12:43:55 -10:00
Andrew Seigner	83f9c391bb	Move install template to its own file (#423 ) The template used by `conduit install` was hard-coded in install.go. This change moves the template into its own file, in anticipation of increasing the template's size and complexity. Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-02-23 14:15:31 -08:00
Dennis Adjei-Baah	f66ec6414c	Inject the conduit proxy into controller pod during conduit install (#365 ) In order to take advantage of the benefits the conduit proxy gives to deployments, this PR injects the conduit proxy into the control plane pod. This helps us lay the groundwork for future work such as TLS, control plane observability etc. Fixes #311 Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2018-02-23 13:55:46 -08:00
Kevin Lingerfelt	c579a8fe8d	Improve get/stat/tap help text by way of examples (#401 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-02-20 19:27:42 -08:00
Kevin Lingerfelt	8db7115420	Update go-run to set version equal to root-tag (#393 ) * Update go-run to set version equal to root-tag * Fix inject tests for undefined version change * Pass inject version explitictly as arg Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-02-20 12:25:55 -08:00
Kevin Lingerfelt	f48555d3cc	Remove kubectl dependency, validate k8s server version via api (#396 ) * Remove kubectl dependency, validate k8s server version via api Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Remove unused MockKubectl Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Remame kubectl.go to version.go Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-02-20 12:14:11 -08:00
Dennis Adjei-Baah	9af3783555	Print error message only when invalid YAML file is used with inject command (#389 ) When the `inject` command is used on a YAML file that is invalid, it prints out an invalid YAML file with the injected proxy. This may give a false indication to the user that the inject was successful even though the inject command prints out an error message further down the terminal window. This PR fixes #303 and contains a test input and output file that indicates what should be shown. This PR also fixes #390. Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2018-02-20 11:59:41 -08:00
Kevin Lingerfelt	d1ae4c5bc7	Run conduit dashboard on ephemeral port by default (#394 ) * Run conduit dashboard on ephemeral port by default * Fix wording on dashboard --port flag * log.Debug error instead of discarding it Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-02-20 10:33:47 -08:00
Brian Smith	1489a84316	Refactor `conduit inject` code to eliminate duplicate logic (#383 ) * Refactor `conduit inject` code to eliminate duplicate logic Previously there was a lot of code repeated once for each type of object that has a pod spec. Refactor the code to reduce the amount of duplication there, to make future changes easier. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-02-19 11:18:44 -10:00
Brian Smith	80aba6c075	CLI: Remove now-unnecessary "enhanced" Kubernetes object types (#382 ) * CLI: Remove now-unnecessary "enhanced" Kubernetes object types The "enhanced" types aren't necessary because now the Kuberentes API implementation has the correct JSON annotations for the InitContainers field. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-02-19 09:37:25 -10:00
Risha Mars	8bc7c5acde	UI tweaks: sidebar collapse, latency formatting, table row spacing (#361 ) - reduce row spacing on tables to make them more compact - Rename TabbedMetricsTable to MetricsTable since it's not tabbed any more - Format latencies greater than 1000ms as seconds - Make sidebar collapsible - poll the /pods endpoint from the sidebar in order to refresh the list of deployments in the autocomplete - display the conduit namespace in the service mesh details table - Use floats rather than Col for more responsive layout (fixes #224)	2018-02-19 11:21:54 -08:00
Dennis Adjei-Baah	01e694ad71	add check and friendly error if conduit dashboard is not installed (#289 ) * add check and friendly error if conduit dashboard is not installed Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2018-02-19 09:28:56 -08:00
Brian Smith	d8f9c33183	Skip pods with hostNetwork=true in `conduit inject` (#380 ) The init container injected by conduit inject rewrites the iptables configuration for its network namespace. This causes havoc when the network namespace isn't restricted to the pod, i.e. when hostNetwork=true. Skip pods with hostNetwork=true to avoid this problem. Fixes #366. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-02-18 13:55:42 -10:00
Brian Smith	51873542e5	Refactor `conduit inject` code to make it unit-testable. (#379 ) Refactor `conduit inject` code to make it unit-testable. Refactor the conduit inject code to make it easier to add unit tests. This work was done by @deebo91 in #365. This is the same PR without the conduit install changes, so that it can land ahead of #365. In particular, this will be used for testing the fix for high-priority bug #366. Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io> Signed-off-by: Brian Smith <brian@briansmith.org>	2018-02-18 12:33:52 -10:00
Andrew Seigner	797bba6bc6	Upgrade to Prometheus 2.1.0 (#344 ) Conduit has been on Prometheus 1.8.1. Prometheus 2.x promises better performance. Upgrade Conduit to Prometheus 2.1.0 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-02-13 13:22:53 -08:00
Eliza Weisman	458e9d2ac5	Remove per-path metrics from telemetry pipeline (#317 ) Follow-up from #315. Now that the UIs don't report per-path metrics, we can remove the path label from Prometheus, the path aggregation and filtering options from the telemetry API, and the path field from the proxy report API. I've modified the tests to no longer expect the removed fields, and manually verified that Conduit still works after making these changes. Closes #265 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-02-09 14:20:28 -08:00
Eliza Weisman	6c2ac6125f	Remove per-path metrics from UIs (#315 ) I've removed per-path metrics from the web dashboard and from the `conduit stat` command. Manually validated that these metrics are no longer displayed. Closes #263 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-02-09 12:35:49 -08:00
Eliza Weisman	2015d992cc	Remove pod-level metrics from web and CLI (#304 ) This PR updates the web UI to remove the pod detail page, and to remove the links to that page from pod names in metrics tables. It also removes the `pods` option from `conduit stat`, and the `sourcePod` and `targetPod` fields from the controller API proto's `MetricMetadata` message. I've updated the `conduit stat` tests to reflect these changes, and manually verified the web UI changes. Closes #261 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-02-08 19:07:10 -08:00
Jeff Haynie	f721a0f800	Fixed mispelling in conduit inject args (#300 )	2018-02-08 12:48:40 -05:00
Phil Calçado	8041d07e4d	Add --url option to skip browser when opening dashboard (#281 ) Browser opening is a UX nicety, but it shouldn't be mandatory.	2018-02-06 14:07:55 -05:00
Phil Calçado	5628d3c8f4	Add --short and --client to CLI version command (#274 ) These are very useful when writing scripts, e.g. conduit_version=`conduit version --short --client` Signed-off-by: Phil Calcado <phil@buoyant.io>	2018-02-05 17:02:45 -05:00
Alex Leong	b691c2e25b	Rename --version flag in conduit install to --conduit-version (#255 ) This makes the `conduit install` flag match the `conduit inject` flag. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-02-05 10:45:44 -08:00
Andrew Seigner	9a40d984ff	Replace shelling out with kubernetes proxy (#249 ) The conduit dashboard command asychronously shells out and runs "kubectl proxy". This change replaces the shelling out with calls to kubernetes proxy APIs. It also allows us to enable race detection in our go tests, as the shell out code tests did not pass race detection. Fixes #173 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-02-02 10:31:59 -08:00
Andrew Seigner	277c06cf1e	Simplify and refactor k8s labels and annnotations (#227 ) The conduit.io/* k8s labels and annotations we're redundant in some cases, and not flexible enough in others. This change modifies the labels in the following ways: `conduit.io/plane: control` => `conduit.io/controller-component: web` `conduit.io/controller: conduit` => `conduit.io/controller-ns: conduit` `conduit.io/plane: data` => (remove, redundant with `conduit.io/controller-ns`) It also centralizes all k8s labels and annotations into pkg/k8s/labels.go, and adds tests for the install command. Part of #201 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-02-01 14:12:06 -08:00
Kevin Lingerfelt	9ff439ef44	Add -log-level flag for install and inject commands (#239 ) * Add -log-level flag for install and inject commands Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Turn off all CLI logging by default, rename inject and install flags Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Re-enable color logging Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-02-01 12:38:07 -08:00
Kevin Lingerfelt	4a76c6448b	Update cli subcommands to print errors when encountered (#221 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-01-29 11:28:19 -08:00
Kevin Lingerfelt	7399df83f1	Set conduit version to match conduit docker tags (#208 ) * Set conduit version to match conduit docker tags Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Remove --skip-inbound-ports for emojivoto Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Rename git_sha => git_sha_head Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Switch to using the go linker for setting the version Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Log conduit version when go servers start Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Cleanup conduit script Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Add --short flag to head sha command Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Set CONDUIT_VERSION in docker-compose env Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-01-26 11:43:45 -08:00
Phil Calçado	9410da471a	Better error handling for Tap (#177 ) Previously, running `$conduit tap` would return a `Unexpected EOF` error when the server wasn't available. This was due to a few problems with the way we were handling errors all the way down the tap server. This change fixes that and cleans some of the protobuf-over-HTTP code. - first step towards #49 - closes #106	2018-01-25 11:49:38 -05:00
Andrew Seigner	d0a0bb22bd	Move EosCtx to common for Tap and Telemetery (#204 ) * Make Eos optional in TapEvent grpc_status not being set in protobuf is the same as being set to zero, which is also status OK Modify TapEvent to include an optional EOS struct Signed-off-by: Andrew Seigner <siggy@buoyant.io> Part of #198 * Add Eos to proto & proxy tap end-of-stream events The proxy now outputs `Eos` instead of `grpc_status` in all end-of-stream tap events. The EOS value is set to `grpc_status_code` when the response ended with a `grpc_status` trailer, `http_reset_code` when the response ended with a reset, and no `Eos` when the response ended gracefully without a `grpc_status` trailer. This PR updates the proxy. The proto and controller changes are in PR #204. Part of #198. Closes #202 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-01-24 15:48:00 -08:00
Andrew Seigner	2413086335	cli polish for 0.1.2 release (#176 ) rename conduit status -> conduit check remove 6h and 24 window options from conduit stat remove watch and watch-only from conduit stat Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-01-18 16:33:25 -08:00
Dennis Adjei-Baah	f7af375e73	Remove scheme requirement for api-addr flag in conduit CLI (#126 ) * Allow external controller public api clients that don't rely on a kubeconfig to interact with Conduit CLI Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2018-01-17 17:12:44 -08:00
Kevin Lingerfelt	fd3cfcb5d9	Move healthcheck proto to separate file, use throughout (#150 ) * Move healthcheck proto to separate file, use throughout Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Remove Check message from healthcheck.proto Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Standardize healthcheck protobuf import name Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-01-17 11:15:38 -08:00
Phil Calçado	612bd0f7a0	Add --verbose option to CLI (#154 ) * Use stdout as writer for tap command fixes #136 Signed-off-by: Phil Calcado <phil@buoyant.io> * Add --log-level to command line Signed-off-by: Phil Calcado <phil@buoyant.io>	2018-01-17 12:06:43 -05:00
Phil Calçado	4daa007256	Use stdout as writer for tap command (#152 ) fixes #136 Signed-off-by: Phil Calcado <phil@buoyant.io>	2018-01-17 11:19:22 -05:00
Zhu Guihua	1ba375b71c	fix typo error (#153 ) Signed-off-by: Guihua Zhu <z.zhuguihua@gmail.com>	2018-01-16 11:20:21 -05:00
Phil Calçado	e328db7e87	Adds conduit-api check for status command (#140 ) * Abstract Conduit API client from protobuf interface to add new features Signed-off-by: Phil Calcado <phil@buoyant.io> * Consolidate mock api clients Signed-off-by: Phil Calcado <phil@buoyant.io> * Add simple implementation of healthcheck for conduit api Signed-off-by: Phil Calcado <phil@buoyant.io> * Change NextSteps to FriendlyMessageToUser Signed-off-by: Phil Calcado <phil@buoyant.io> * Add grpc check for status on the client Signed-off-by: Phil Calcado <phil@buoyant.io> * Add simple server-side check for Conduit API Signed-off-by: Phil Calcado <phil@buoyant.io> * Fix feedback from PR Signed-off-by: Phil Calcado <phil@buoyant.io>	2018-01-12 15:35:22 -05:00
Kevin Lingerfelt	1dc1c00a2a	Upgrade k8s.io/client-go to v6.0.0 (#122 ) * Sort imports Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Upgrade k8s.io/client-go to v6.0.0 Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Make k8s store initialization blocking with timeout Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-01-11 10:22:37 -08:00
Phil Calçado	120dbce49d	second iteration of status subcommand: check Kubernetes API #92 (#108 ) Signed-off-by: Phil Calcado <phil@buoyant.io>	2018-01-05 13:32:41 -08:00
Phil Calçado	709de5a7b0	Moves k8s and conduit client code to /pkg (#103 ) * Rename constructor functions from MakeXyz to NewXyz As it is more commonly used in the codebase Signed-off-by: Phil Calcado <phil@buoyant.io> * Make Conduit client depend on KubernetesAPI Signed-off-by: Phil Calcado <phil@buoyant.io> * Move Conduit client and k8s logic to standard go package dir for internal libs Signed-off-by: Phil Calcado <phil@buoyant.io> * Move dependencies to /pkg Signed-off-by: Phil Calcado <phil@buoyant.io> * Make conduit client more testable Signed-off-by: Phil Calcado <phil@buoyant.io> * Remove unused config object Signed-off-by: Phil Calcado <phil@buoyant.io> * Add more test cases for marhsalling Signed-off-by: Phil Calcado <phil@buoyant.io> * Move client back to controller Signed-off-by: Phil Calcado <phil@buoyant.io> * Sort imports Signed-off-by: Phil Calcado <phil@buoyant.io>	2018-01-04 10:10:10 -08:00
Andrew Seigner	7edea726e6	Add shell completion command to cli (#97 ) Cobra supports bash and zsh completion code generation, but our cli project was not leveraging it. Add a 'completion' command to the conduit cli, which outputs bash and zsh completion code. Signed-off-by: Andrew Seigner <andrew@sig.gy>	2017-12-28 14:24:06 -08:00
Phil Calçado	c76b705fce	first iteration of status subcommand: check Kubectl #92 (#96 ) * Add framework for healthcheck in CLI Signed-off-by: Phil Calcado <phil@buoyant.io> * Add self-checked for kubectl Signed-off-by: Phil Calcado <phil@buoyant.io> * Clear formatting code Signed-off-by: Phil Calcado <phil@buoyant.io> * Removed ununsed objects from status Signed-off-by: Phil Calcado <phil@buoyant.io> * Removed ununsed parameter Signed-off-by: Phil Calcado <phil@buoyant.io> * Ignore errored self checkers Signed-off-by: Phil Calcado <phil@buoyant.io> * Make the check error by default Signed-off-by: Phil Calcado <phil@buoyant.io> * Log error, format changes Signed-off-by: Phil Calcado <phil@buoyant.io>	2017-12-28 14:03:18 -05:00
Phil Calçado	31e9846f62	Make several CLI commands testable (#86 ) * Add func to rsolve kubectl-like names to canonical names Signed-off-by: Phil Calcado <phil@buoyant.io> * Refactor API instantiation Signed-off-by: Phil Calcado <phil@buoyant.io> * Make version command testable Signed-off-by: Phil Calcado <phil@buoyant.io> * Make get command testable Signed-off-by: Phil Calcado <phil@buoyant.io> * Add tests for api utils Signed-off-by: Phil Calcado <phil@buoyant.io> * Make stat command testable Signed-off-by: Phil Calcado <phil@buoyant.io> * Make tap command testablë Signed-off-by: Phil Calcado <phil@buoyant.io>	2017-12-27 14:10:41 -05:00
Christopher Schmidt	bcf61e1bfb	adding RBAC stuff for Prometheus (#83 ) * adding RBAC stuff for Prometheus, fixes runconduit/conduit#82 Signed-off-by: Christopher Schmidt fakod666@gmail.com * fixed ident Signed-off-by: Christopher Schmidt fakod666@gmail.com	2017-12-22 13:45:56 -08:00
Kevin Lingerfelt	a8e75115ab	Prepare the repo for the v0.1.1 release (#75 ) * Prepare the repo for the v0.1.1 release * Add changelog * Changelog updates, wrap at 100 characters	2017-12-20 10:51:53 -08:00
Kevin Lingerfelt	9fca7e8b36	Modify inject to stop passing malformed arguments (#74 )	2017-12-19 22:03:56 -08:00
Dennis Adjei-Baah	63e2fbb97b	Refactor each CLI command to silently exit on uncaught errors (#54 ) * add silent exits on all conduit commands * Revert "add silent exits on all conduit commands" This reverts commit 07488ca * adds back a change that was accidentally removed on revert * Stop printing errors to stderr	2017-12-19 17:24:38 -08:00
Phil Calçado	0a6a9edaee	Respect $KUBECONFIG env var (#68 ) * Move kubectl logis to k8s package * Made kubectl return url.URL, just like API Make k8s API code respect /Users/pcalcado/.kube/config (closes #17) * Fix style mistakes and typos	2017-12-20 11:50:25 +11:00
Kevin Lingerfelt	42e9a94e45	Make tap output line-oriented (#66 ) * Make tap output line-oriented * Use grpc response code constants, add tests	2017-12-19 15:58:46 -08:00
Phil Calçado	683b5c0dd6	Fix unit test runner and failing tests (#67 ) * Remove integration test clause from travis file * Use correct channel for asyn process error reporting * Fix test for sorting stat keys * Make integration tests only run if CONDUIT_INTEGRATION_TESTS_ENABLED is set * Fix timeouts in tests * Fix handler test check for version * Removes smoke test that required kubectl as not present on travis * Replace tail with sleep to avoid leaking subprocesses during tests * Fix typo & extract constant	2017-12-19 14:21:56 -08:00
Brian Smith	208723b44d	conduit inject: Enable auto name completion in proxy (#60 ) Previously `conduit inject` did not enable automatic name completion in the proxy. As a result services couldn't connect to services outside the "default" namespace without qualifying the service name with (at least) the namespace. This is arguably safer but it isn't compatible with the way things work in Kubernetes when the proxy isn't used. Enable name auto-completion in the proxy so that the proxy will add the current pod's namespace to any unqualified service name. This depends on the feature being added to the proxy (PR #59). Due to some issues with how zones are dealt with in the project, the zone component isn't provided; it turns out that it doesn't matter whether we provide the zone in the current implementation. Dealing with the zone better will be added later. Validated by deploying the emojivoto service with its configuration updated to use unqualified names (`sed "s/\\.emojivoto//g"`). Before this change this modified configuration would fail; now it succeeds. Fixes #9.	2017-12-19 12:07:54 -10:00
Phil Calçado	6f9e8be2ee	Check kubectl versions before attempting dashboard (#53 ) * Introduce objects to manage kubectl and shell * Make dashboard command use new shell & kubectl objects * Make compatible with version numbers like v1.9.0-beta.1 * Add version check for kubectl * Refactor error to use proper method from fmt pa ckage * Make channel and error handling more idiomatic and safe * Make version require 1.8.0	2017-12-19 11:41:15 +11:00
Alex Leong	772b43fefa	Add inject flag for skipping outbound ports (#38 ) * Add inject flag for skipping outbound ports * Fix usage of proxy-init ignore flags (closes #541)	2017-12-19 11:17:11 +11:00
Brian Smith	d025bf4c0f	conduit inject: Configure proxy to log at "info" level (#58 ) Previously `conduit inject` was configuring the proxy to log a lot of detail, most of which is probably shouldn't be relevant to Conduit users. Configure the proxy to log at the "info" level instead for the proxy itself, and the "warn" level for internal components of the proxy. Validated by manually doing a `conduit inject`, triggering some traffic, and inspecting the logs. Fixes #57	2017-12-18 08:37:27 -10:00
Alex Leong	8a7579ef4a	Add RBAC support (#40 ) Adds a ServiceAccount, ClusterRole, and ClusterRoleBinding to the conduit install output that allows the Conduit controller read access to the k8s API. I have tested this on RBAC-enabled minikube and minikube without RBAC.	2017-12-14 16:57:52 -08:00
Dennis Adjei-Baah	922b41c0fa	Remove namespace property from the Conduit web app (#10 ) * Remove namespace property from ServiceMesh.jsx and Deployments.jsx and related Go code	2017-12-08 15:38:04 -08:00
Kevin Lingerfelt	2f114e69fa	Add support for path stats in cli and web api (#13 ) * Add support for path stats in cli and web api The cli stat command supports grouping by pod and deployment. With this change, it will also support grouping by path, in order to facilitate a summary stats per individual endpoint. * Right-align numeric columns in stat output	2017-12-08 12:24:39 -08:00
Oliver Gould	bff3efea3f	Prepare for v0.1.0 (#1 ) Update versions in code. Use default docker tag of v0.1.0	2017-12-04 19:55:56 -08:00
Oliver Gould	b104bd0676	Introducing Conduit, the ultralight service mesh We’ve built Conduit from the ground up to be the fastest, lightest, simplest, and most secure service mesh in the world. It features an incredibly fast and safe data plane written in Rust, a simple yet powerful control plane written in Go, and a design that’s focused on performance, security, and usability. Most importantly, Conduit incorporates the many lessons we’ve learned from over 18 months of production service mesh experience with Linkerd. This repository contains a few tightly-related components: - `proxy` -- an HTTP/2 proxy written in Rust; - `controller` -- a control plane written in Go with gRPC; - `web` -- a UI written in React, served by Go.	2017-12-05 00:24:55 +00:00

... 5 6 7 8 9 ...

613 Commits