linkerd2

Commit Graph

Author	SHA1	Message	Date
Alena Varkockova	28f662c9c6	Introduce resource selector and deprecate namespace field for ListPods (#2025 ) * Introduce resource selector and deprecate namespace field for ListPods * Changes from code review * Properly deprecate the field * Do not check for nil * Fix the mockProm usage * Protoc changes revert * Changed from code review Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2019-01-23 10:35:55 -08:00
Alex Leong	a562f8b9fd	Improve routes command to list all routes (#2066 ) Fixes #1875 This change improves the `linkerd routes` command in a number of important ways: * The restriction on the type of the `--to` argument is lifted and any resource type can now be used. Try `--to ns/books`, `--to po/webapp-ABCDEF`, `--to au/linkerd.io`, or even `--to svc`. * All routes for the target will now be populated in the table, even if there are no Prometheus metrics for that route. * [UNKNOWN] has been renamed to [DEFAULT] * The `Service/Authority` column will now list `Service` in all cases except for when an authority target is explicitly requested. ``` $ linkerd routes deploy/traffic --to deploy/webapp ROUTE SERVICE SUCCESS RPS LATENCY_P50 LATENCY_P95 LATENCY_P99 GET / webapp 100.00% 0.5rps 50ms 180ms 196ms GET /authors/{id} webapp 100.00% 0.5rps 100ms 900ms 980ms GET /books/{id} webapp 100.00% 0.9rps 38ms 93ms 99ms POST /authors webapp 100.00% 0.5rps 35ms 48ms 50ms POST /authors/{id}/delete webapp 100.00% 0.5rps 83ms 180ms 196ms POST /authors/{id}/edit webapp 0.00% 0.0rps 0ms 0ms 0ms POST /books webapp 45.16% 2.1rps 75ms 425ms 485ms POST /books/{id}/delete webapp 100.00% 0.5rps 30ms 90ms 98ms POST /books/{id}/edit webapp 56.00% 0.8rps 92ms 875ms 975ms [DEFAULT] webapp 0.00% 0.0rps 0ms 0ms 0ms ``` This is all made possible by a shift in the way we handle the destination resource. When we get a request with a `ToResource`, we use the k8s API to find all Services which include at least one pod belonging to that resource. We then fetch all service profiles for those services and display the routes from those serivce profiles. This shift in thinking also precipitates a change in the TopRoutes API where we no longer need special cases for `ToAll` (which can be specified by `--to au`) or `ToAuthority` (which can be specified by `--to au/<authority>`) and instead can use a `ToResource` to handle all cases. Signed-off-by: Alex Leong <alex@buoyant.io>	2019-01-16 17:15:35 -08:00
Alex Leong	771542dde2	Add support for retries (#2038 )	2019-01-16 14:13:48 -08:00
Alejandro Pedraza	8c67bfbcc6	Add parameter to stats API to skip retrieving Prometheus stats (#1871 ) * Add parameter to stats API to skip retrieving Prometheus stats Used by the dashboard to populate list of resources. Fixes #1022 Signed-off-by: Alejandro Pedraza <alejandro.pedraza@gmail.com> * Prometheus queries check results were being ignored * Refactor verifyPromQueries() to also test when no prometheus queries should be generated * Add test for SkipStats=true Includes adding ability to public.GenStatSummaryResponse to not generate basicStats * Fix previous test	2018-12-10 16:48:12 -08:00
Alex Leong	380ec52a39	Rework routes command to accept any resource (#1921 ) We rework the routes command so that it can accept any Kubernetes resource, making it act much more similarly to the stat command. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-12-05 11:11:34 -08:00
Oliver Gould	926395f616	tap: Include route labels in tap events (#1902 ) This change alters the controller's Tap service to include route labels when translating tap events, modifies the public API to include route metadata in responses, and modifies the tap CLI command to include rt_ labels in tap output (when -o wide is used).	2018-12-03 13:52:47 -08:00
Risha Mars	f8583df4db	Add ListServices to controller public api (#1876 ) Add a barebones ListServices endpoint, in support of autocomplete for services. As we develop service profiles, this endpoint could probably be used to describe more aspects of services (like, if there were some way to check whether a service profile was enabled or not). Accessible from the web UI via http://localhost:8084/api/services	2018-11-27 11:34:47 -08:00
Alex Leong	7a7f6b6ecb	Add TopRoutes method the the public api and route CLI command to consume it (#1860 ) Add a routes command which displays per-route stats for services that have service profiles defined. This change has three parts: * A new public-api RPC called `TopRoutes` which serves per-route stat data about a service * An implementation of TopRoutes in the public-api service. This implementation reads per-route data from Prometheus. This is very similar to how the StatSummaries RPC and much of the code was able to be refactored and shared. * A new CLI command called `routes` which displays the per-route data in a tabular or json format. This is very similar to the `stat` command and much of the code was able to be refactored and shared. Note that as of the currently targeted proxy version, only outbound route stats are supported so the `--from` flag must be included in order to see data. This restriction will be lifted in an upcoming change once we add support for inbound route stats as well. Signed-off-by: Alex Leong <alex@buoyant.io>	2018-11-19 12:20:30 -08:00
Alena Varkockova	5a853e8990	Use ListPods always for data plane HC (#1701 ) * Use ListPods always for data plane HC * Missing changes in grpc_server.go * Address review comments * Read proxy version from spec Signed-off-by: Alena Varkockova <varkockova.a@gmail.com>	2018-10-02 11:45:01 -07:00
Kevin Lingerfelt	f884caf56d	Upgrade protobuf to v1.2.0 (#1591 ) * Upgrade protobuf to v1.2.0 * Fix Gopkg.lock * Switch linkerd2-proxy-api dep back to stable Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-09-06 11:36:29 -07:00
Kevin Lingerfelt	e5cce1abaf	Rename CLI from conduit to linkerd (#1312 ) * Rename CLI binary * Update integration tests for new binary name * Rename --conduit-namespace flag, change default ns * Rename occurrences of conduit in rest of CLI * Rename inject and install components * Remove conduit occurrences in docker files * Additional miscellaneous cleanup * Move protobuf definitions to linkerd2 package * Rename conduit.io labels to use linkerd.io * Rename conduit-managed segment to linkerd-managed * Fix conduit references in web project Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-12 17:14:07 -07:00
Oliver Gould	941cad4a9c	Migrate build infrastructure to linkerd2 (#1298 ) This PR begins to migrate Conduit to Linkerd2: * The proxy has been completely removed from this repo, and is now located at github.com/linkerd/linkerd2-proxy. * A `Dockerfile-proxy` has been added to fetch the most-recently published proxy binary from build.l5d.io. * Proxy-specific protobuf bindings have been moved to github.com/linkerd/linkerd2-proxy-api. * All docker images now use the gcr.io/linkerd-io registry. * `inject` now uses `LINKERD2_PROXY_` environment variables * Go paths have been updated to reflect the new (future) repo location.	2018-07-09 15:38:38 -07:00
Kevin Lingerfelt	693acdbf26	Update ListPods endpoint to return all pod owner types (#1275 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-05 15:14:16 -07:00
Risha Mars	ba2e13c731	Small tweaks to error modal, add Reason to api error response (#1246 ) - Add Reason to the error data passed from the api - Rewrite error logic in the UI to try to make it clearer - Show 0/0 pods meshed instead of 0/0 pods meshed (N/A) if 0 pods are meshed	2018-07-03 17:14:27 -07:00
Risha Mars	5ed7fc563c	Add controller component pod uptimes to the ServiceMesh page (#1205 ) - Return pod uptimes from the GetPods endpoint - Adds filtering by namespace to api.GetPods - Adds a --namespace filter to conduit get pods - Adds pod uptimes to the controller component toolitps on the ServiceMesh page - Moves the ServiceMesh page back to using /api/pods	2018-06-28 15:42:00 -07:00
Risha Mars	68586fe697	Add the ability to query stats by authority (#1181 ) Adds the ability to query by a new non-kubernetes resource type, "authorities", in the StatSummary api. This includes an extensive refactor of stat_summary.go to deal with non-kubernetes resource types. - Add documentation to Resource in the public api so we can use it for authority - Handle non-k8s resource requests in the StatSummary endpoint - Rewrite stat summary fetching and parsing to handle non-k8s resources - keys stat summary metric handling by Resource instead of a generated string - Adds authority to the CLI - Adds /authorities to the Web UI - Adds some more stat integration and unit tests	2018-06-28 14:31:44 -07:00
Risha Mars	e2c2f19d2c	Propagate errors in conduit containers to the api (#1117 ) - It would be nice to display container errors in the UI. This PR gets the pod's container statuses and returns them in the public api - Also add a terminationMessagePolicy to conduit's inject so that we can capture the proxy's error messages if it terminates	2018-06-14 16:22:31 -07:00
Oliver Gould	2a4f38b9e7	proto: Use explicit `go_package` option (#1120 ) protobuf has a `go_package` option that can be used to explicitly name Go packages such that they can be imported without additional rewrites. This allows us to store proto files without additional, redundant directories (which were used for packaging hints, previously). This change adds an explicit `go_package` to all .proto files and updates `bin/protoc-go.sh` to ensure these packages are output into $GOPATH (so that the go_package can be absolute). This removes the need to manually rewrite imports in bin/protoc-go.sh.	2018-06-14 14:03:00 -07:00
Kevin Lingerfelt	ec2433e9bd	Update controller to use 'tls' metric label (#1044 ) * Update controller to use 'tls' metric label * Fix meshed column formatter Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-01 16:44:33 -07:00
Risha Mars	ffabdefc6c	Add queries to prometheus to determine number of fully meshed requests (#983 ) - Update the `response_total` prometheus query of the StatSummary endpoint to also break queries out by a `meshed` label. - Add a 'Secured' column to the web UI/CLI stat displays, which indicate the percentage of traffic starting and ending in the mesh This meshed label is used in the CLI/Web UI to display a column of the percentage of traffic that starts/ends in the mesh. (Which is a proxy indicator for whether that traffic is 'secured' when we add TLS by default for intra mesh requests). The `meshed` label is not yet added anywhere, so until it is supplied by the proxy, all traffic will show up as 0% secured in the web/CLI.	2018-05-24 11:05:09 -07:00
Risha Mars	f94856e489	Modify the Stat endpoint to also return the number of failed conduit pods (#895 ) * Modify the Stat endpoint to also return the count of failed pods * Add comments explaining pod count stats * Rename total pod count to running pod count This is to support the service mesh overview page, as I'd like to include an indicator of failed pods there.	2018-05-08 10:35:21 -07:00
Andrew Seigner	dce31b888f	Deprecate Tap, rename TapByResource to Tap (#844 ) The `conduit tap` command is now deprecated. Replace `conduit tap` with `connduit tapByResource`. Rename tapByResource to tap. The underlying protobuf for tap remains, the tap gRPC endpoint now returns Unimplemented. Fixes #804 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-25 12:24:46 -07:00
Oliver Gould	06dd8d90ee	Introduce the TapByResource API (#778 ) This changes the public api to have a new rpc type, `TapByResource`. This api supersedes the Tap api. `TapByResource` is richer, more closely reflecting the proxy's capabilities. The proxy's Tap api is extended to select over destination labels, corresponding with those returned by the Destination api. Now both `Tap` and `TapByResource`'s responses may include destination labels. This change avoids breaking backwards compatibility by: * introducing the new `TapByResource` rpc type, opting not to change Tap * extending the proxy's Match type with a new, optional, `destination_label` field. * `TapEvent` is extended with a new, optional, `destination_meta`.	2018-04-18 15:37:07 -07:00
Andrew Seigner	727521f914	Permit arbitrary time windows in public-api (#774 ) The public-api previously only permitted 4 hard-coded time windows: 10s, 1m, 10m, 1h. This was primarily a relic of the recently removed telemetry system. Modify the public-api to validate the time string, but allow for any window size, which is then passed through to Prometheus. Fixes #686 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-16 17:37:17 -07:00
Andrew Seigner	77fb6d3709	Add namespace as a resource type in public-api (#760 ) * Add namespace as a resource type in public-api The cli and public-api only supported deployments as a resource type. This change adds support for namespace as a resource type in the cli and public-api. This also change includes: - cli statsummary now prints `-`'s when objects are not in the mesh - cli statsummary prints `No resources found.` when applicable - removed `out-` from cli statsummary flags, and analagous proto changes - switched public-api to use native prometheus label types - misc error handling and logging fixes Part of #627 Signed-off-by: Andrew Seigner <siggy@buoyant.io> * Refactor filter and groupby label formulation Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Rename stat_summary.go to stat.go in cli Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Update rbac privileges for namespace stats Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-13 16:53:01 -07:00
Kevin Lingerfelt	fb15fe7c1a	Remove the telemetry service (#757 ) * Remove the telemetry service The telemetry service is no longer needed, now that prometheus scrapes metrics directly from proxies, and the public-api talks directly to prometheus. In this branch I'm removing the service itself as well as all of the telemetry protobuf, and updating the conduit install command to no longer install the service. I'm also removing the old version of the stat command, which required the telemetry service, and renaming the statsummary command to stat. * Fix time window tests * Remove deprecated controller scrape config Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-13 11:21:29 -07:00
Andrew Seigner	259fdcd134	Add latency stats in new stat summary endpoint (#737 ) The new StatSummary endpoint was only providing request volume and successs rate information. Add support for retrieving latency stats via StatSummary. Also make all prometheus calls in parallel, and implement kubernetes test fixtures. Fixes #681 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-11 11:58:32 -07:00
Risha Mars	2f5b5ea5f2	Start implementing conduit stat summary endpoint (#671 ) Start implementing new conduit stat summary endpoint. Changes the public-api to call prometheus directly instead of the telemetry service. Wired through to `api/stat` on the web server, as well as `conduit statsummary` on the CLI. Works for deployments only. Current implementation just retrieves requests and mesh/total pod count (so latency stats are always 0). Uses API defined in #663 Example queries the stat endpoint will eventually satisfy in #627 This branch includes commits from @klingerf * run ./bin/dep ensure * run ./bin/update-go-deps-shas	2018-04-05 17:05:06 -07:00
Risha Mars	d1a39ea6bf	Define a new telemetry Stat API (#663 ) * Define a new telemetry Stat API Proposal definition for a new Stat API, for the purposes of satisfying the queries proposed in #627. StatSummary will replace Stat once implemented and the original Stat deleted.	2018-04-03 14:45:58 -07:00
Brian Smith	7dc21f9588	Add the NoEndpoints message to the Destination API (#564 ) Have the controller tell the client whether the service exists, not just what are available. This way we can implement fallback logic to alternate service discovery mechanisms for ambigious names. Signed-off-by: Brian Smith <brian@briansmith.org> Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-03-27 10:45:41 -10:00
Eliza Weisman	458e9d2ac5	Remove per-path metrics from telemetry pipeline (#317 ) Follow-up from #315. Now that the UIs don't report per-path metrics, we can remove the path label from Prometheus, the path aggregation and filtering options from the telemetry API, and the path field from the proxy report API. I've modified the tests to no longer expect the removed fields, and manually verified that Conduit still works after making these changes. Closes #265 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-02-09 14:20:28 -08:00
Eliza Weisman	2015d992cc	Remove pod-level metrics from web and CLI (#304 ) This PR updates the web UI to remove the pod detail page, and to remove the links to that page from pod names in metrics tables. It also removes the `pods` option from `conduit stat`, and the `sourcePod` and `targetPod` fields from the controller API proto's `MetricMetadata` message. I've updated the `conduit stat` tests to reflect these changes, and manually verified the web UI changes. Closes #261 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-02-08 19:07:10 -08:00
Kevin Lingerfelt	fd3cfcb5d9	Move healthcheck proto to separate file, use throughout (#150 ) * Move healthcheck proto to separate file, use throughout Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Remove Check message from healthcheck.proto Signed-off-by: Kevin Lingerfelt <kl@buoyant.io> * Standardize healthcheck protobuf import name Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-01-17 11:15:38 -08:00
Phil Calçado	e328db7e87	Adds conduit-api check for status command (#140 ) * Abstract Conduit API client from protobuf interface to add new features Signed-off-by: Phil Calcado <phil@buoyant.io> * Consolidate mock api clients Signed-off-by: Phil Calcado <phil@buoyant.io> * Add simple implementation of healthcheck for conduit api Signed-off-by: Phil Calcado <phil@buoyant.io> * Change NextSteps to FriendlyMessageToUser Signed-off-by: Phil Calcado <phil@buoyant.io> * Add grpc check for status on the client Signed-off-by: Phil Calcado <phil@buoyant.io> * Add simple server-side check for Conduit API Signed-off-by: Phil Calcado <phil@buoyant.io> * Fix feedback from PR Signed-off-by: Phil Calcado <phil@buoyant.io>	2018-01-12 15:35:22 -05:00
Eliza Weisman	63d1a5d70d	Add Protocol field to Transports telemetry (#138 ) See #132. This PR adds a protocol field to the ClientTransport and ServerTransport messages, and modifies the proxy to report a value for this field (currently, it's only ever HTTP). Currently, HTTP/1 and HTTP/2 are collapsed into one Protocol variant, see #132 (comment). I expect that we can treat H1 as a subset of H2 as far as metrics goes. Note that after discussing it with @klingerf, I learned that the control plane telemetry API currently does not do anything with the ClientTransport and ServerTransport messages, so beyond regenerating the protobuf-generated code, no controller changes were actually necessary. As we actually add metrics to TCP transports, we'll want to make some additions to the telemetry API to ingest these metrics. If any metrics are shared between HTTP and raw TCP transports (say, bytes sent), we'll want to differentiate between them in Prometheus. All the metrics that the control plane currently ingests from telemetry reports are likely to be HTTP-specific (requests, responses, response latencies), or at least, do not apply to raw TCP. Actually adding metrics to raw TCP transports will probably have to wait until there are raw TCP transports implemented in the proxy... Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-01-11 16:00:38 -08:00
Phil Calçado	0a6a9edaee	Respect $KUBECONFIG env var (#68 ) * Move kubectl logis to k8s package * Made kubectl return url.URL, just like API Make k8s API code respect /Users/pcalcado/.kube/config (closes #17) * Fix style mistakes and typos	2017-12-20 11:50:25 +11:00
Kevin Lingerfelt	2f114e69fa	Add support for path stats in cli and web api (#13 ) * Add support for path stats in cli and web api The cli stat command supports grouping by pod and deployment. With this change, it will also support grouping by path, in order to facilitate a summary stats per individual endpoint. * Right-align numeric columns in stat output	2017-12-08 12:24:39 -08:00
Kevin Lingerfelt	906d4e8b69	Fix public-api error marshaling and unmarshaling (#16 )	2017-12-08 11:03:55 -08:00
Oliver Gould	b104bd0676	Introducing Conduit, the ultralight service mesh We’ve built Conduit from the ground up to be the fastest, lightest, simplest, and most secure service mesh in the world. It features an incredibly fast and safe data plane written in Rust, a simple yet powerful control plane written in Go, and a design that’s focused on performance, security, and usability. Most importantly, Conduit incorporates the many lessons we’ve learned from over 18 months of production service mesh experience with Linkerd. This repository contains a few tightly-related components: - `proxy` -- an HTTP/2 proxy written in Rust; - `controller` -- a control plane written in Go with gRPC; - `web` -- a UI written in React, served by Go.	2017-12-05 00:24:55 +00:00

39 Commits