linkerd2

Commit Graph

Author	SHA1	Message	Date
Andrew Seigner	8384f1eb56	Ensure shared tooltips in Linkerd Health dashboard (#2324 ) All Grafana graphs use shared tooltips (display all series in the tooltip rather than the one currently moused-over), except for 3 graphs in the Linkerd Health dashboard. This change ensures all tooltips are shared. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-19 15:55:36 -08:00
Risha Mars	ee18a7fe31	Modify the grafana variable queries to use a tcp-based metric (#2272 ) Currently, we use request_total for the variable query to determine the names in the grafana dropdowns. We should use a non-http-based metric instead, so that if there is only TCP traffic, the dropdowns will still be populated. This branch uses process_start_time_seconds instead of the http-based request_total to query for grafana variables	2019-02-19 13:46:02 -08:00
Andrew Seigner	1df1683b6a	Instrument k8s clients (#2243 ) The control-plane's clients, specifically the Kubernetes clients, did not provide telemetry information. Introduce a `prometheus.ClientWithTelemetry` wrapper to instrument arbitrary clients. Apply this wrapper to Kubernetes clients. Fixes #2183 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-18 09:10:02 -08:00
Kevin Lingerfelt	a27bb2e0ce	Proxy grafana requests through web service (#2039 ) * Proxy grafana requests through web service * Fix -grafana-addr default, clarify -api-addr flag * Fix version check in grafana dashboards * Fix comment typo Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-04 16:07:57 -08:00
Kevin Lingerfelt	37ae423bb3	Add linkerd- prefix to all objects in linkerd install (#1920 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-04 15:41:47 -08:00
Oliver Gould	747fd328e9	grafana: Show TCP closes by errno (#1839 ) linkerd/linkerd2-proxy#116 removes the `classification` label for the `tcp_close_total` metric because TCP sockets that close with an error do not actually indicate any sort of failure -- many graceful shutdown situations can still cause a socket error. This change uses the `errno` label to enumerate tcp_close_total metrics.	2018-11-02 10:20:11 -07:00
Kevin Lingerfelt	12b10e27c1	Update version checks to support release channels (#1667 ) * Update version checks to support release channels * Update based on review feedback * Fix sidebar tests * Update CI config for edge and stable tags Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-09-17 17:13:50 -07:00
Andrew Seigner	b708378d07	Add version check to Grafana dashboard (#1638 ) * Add version check to Grafana dashboard The web dashboard checks the local Linkerd version against the latest release, and informs the user if an update is available. Grafana was not doing this. Modify the Grafana dashboard to perform a version check, and prompt the user to update if needed. Fixes #1607 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-13 15:28:44 -07:00
Kevin Lingerfelt	4845b4ec04	Restore linkerd.io/control-plane* labels (#1411 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-08-07 13:53:29 -07:00
Franziska von der Goltz	c7ac072acc	update grafana dashboards: conduit to linkerd (#1320 ) * update grafana dashboards to remove conduit reference and replace with linkerd instances * update test install fixtures to reflect changes Fixes: #1315 Signed-off-by: Franziska von der Goltz <franziska@vdgoltz.eu>	2018-07-16 13:05:01 -07:00
Kevin Lingerfelt	e5cce1abaf	Rename CLI from conduit to linkerd (#1312 ) * Rename CLI binary * Update integration tests for new binary name * Rename --conduit-namespace flag, change default ns * Rename occurrences of conduit in rest of CLI * Rename inject and install components * Remove conduit occurrences in docker files * Additional miscellaneous cleanup * Move protobuf definitions to linkerd2 package * Rename conduit.io labels to use linkerd.io * Rename conduit-managed segment to linkerd-managed * Fix conduit references in web project Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-07-12 17:14:07 -07:00
Andrew Seigner	e70d62dc9f	Introduce Proxy process telemetry in Grafana (#1199 ) PR #1128 introduced new proxy process stats. Introduce Grafana graphs that expose these new proxy process stats. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-06-27 00:58:28 +01:00
Risha Mars	fdb0b7f63f	Grafana: remove fill and stack from individual resource breakouts (#1092 ) Remove the filling and stacking in request rate graphs that combine resources, to make it easier to spot outliers. * Grafana: remove fill and stack from individual resource breakouts * Remove all the stacks and fills from request rates everywhere	2018-06-18 10:14:39 -07:00
Risha Mars	b930bc6b88	Fix conduit health grafana dashboard (#1086 ) * Scope health queries to controller namespace * Add a prometheus query variable to get the conduit namespace	2018-06-08 12:57:05 -07:00
Andrew Seigner	95f9f8dc35	Add meshed label support to Grafana (#1021 ) The Grafana dashboards currently show Request Volume by ns/deploy/pod. Add a `meshed` dimension to the Request Volume graphs, in anticipation of the `meshed`/`secured` label from the proxy. Also increase `irate` time window queries from `20s` to `30s`, per recommendation from Prometheus team. Relates to #388. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-05-25 14:10:57 -07:00
Andrew Seigner	1275b1ae89	Introduce Grafana, K8s, and Prom dashboards (#904 ) Grafana provides default dashboards for Prometheus and Grafana health. The community also provides Kubernetes-specific dashboards. Conduit was not taking advantage of these. Introduce new Grafana dashboards focused on Grafana, Kubernetes, and Prometheus health. Tag all Conduit dashboards for easier UI navigation. Also fix layout in Conduit Health dashboard. Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-05-08 23:11:43 +02:00
Eliza Weisman	d55e334a42	Add TCP stats to deployment dashboards (#824 ) This PR adds the TCP metrics added in #785 and #790 to the Grafana deployment dashboards. I've added three new charts in the "Inbound Traffic" and "Outbound Traffic" headings: + "TCP Connection Failures": plots the number of failed TCP connections over time + "TCP Connections Open": shows the number of accepted and opened connections currently open + "TCP Connection Duration": a heatmap of connection durations over time I'm planning on adding similar graphs to other dashboards as well in subsequent PRs.	2018-04-25 16:26:43 -07:00
Andrew Seigner	c9cdd838dc	Standardize and polish Grafana for 0.4.0 release (#766 ) The top-line, deployments, and health Grafana dashboards had inconsistent layouts and data. This change standardizes our Grafana dashboards. Every row is composed of Success Rate, Request Rate, and Latency. Part of #420. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-13 18:01:44 -07:00
Andrew Seigner	9508e11b45	Build conduit-specific Grafana Docker image (#679 ) Using a vanilla Grafana Docker image as part of `conduit install` avoided maintaining a conduit-specific Grafana Docker image, but made packaging dashboard json files cumbersome. Roll our own Grafana Docker image, that includes conduit-specific dashboard json files. This significantly decreases the `conduit install` output size, and enables dashboard integration in the docker-compose environment. Fixes #567 Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-05 14:20:05 -07:00

19 Commits