linkerd2

Commit Graph

Author	SHA1	Message	Date
mmiller1	c39b698525	Use a globing operator in top-line metrics dashboard for the "all" value (#4057 ) * use custom all values for top line dashboard * convert remaining allValue params to wildcard glob Signed-off-by: Matt Miller <mamiller@rosettastone.com>	2020-03-10 10:08:48 -07:00
Ivan Sim	5e51208b5d	Increase the Grafana dashboards refresh interval (#3464 ) Signed-off-by: Ivan Sim <ivan@buoyant.io>	2019-09-23 14:47:59 -07:00
Andrew Seigner	48a69cb88a	Bump Prometheus to 2.11.1, Grafana to 6.2.5 (#3123 ) - set `disable_sanitize_html` in `grafana.ini`. - make all text box dimensions whole integers to fix dropdown issue, reported in: https://github.com/linkerd/linkerd2/issues/2955#issuecomment-503085444 - rev all dashboards to `schemaVersion` 18 for Grafana 6.2.5 - `prometheus-benchmark.json` based on: https://grafana.com/grafana/dashboards/9761 - `prometheus.json` based on: `69c93e6401/public/app/plugins/datasource/prometheus/dashboards/prometheus_2_stats.json` - `grafana.json` based on: `85aed0276e/public/app/plugins/datasource/prometheus/dashboards/grafana_stats.json` Fixes #2955 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-07-23 13:37:56 -07:00
Risha Mars	ee18a7fe31	Modify the grafana variable queries to use a tcp-based metric (#2272 ) Currently, we use request_total for the variable query to determine the names in the grafana dropdowns. We should use a non-http-based metric instead, so that if there is only TCP traffic, the dropdowns will still be populated. This branch uses process_start_time_seconds instead of the http-based request_total to query for grafana variables	2019-02-19 13:46:02 -08:00
Andrew Seigner	1df1683b6a	Instrument k8s clients (#2243 ) The control-plane's clients, specifically the Kubernetes clients, did not provide telemetry information. Introduce a `prometheus.ClientWithTelemetry` wrapper to instrument arbitrary clients. Apply this wrapper to Kubernetes clients. Fixes #2183 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2019-02-18 09:10:02 -08:00
Kevin Lingerfelt	a27bb2e0ce	Proxy grafana requests through web service (#2039 ) * Proxy grafana requests through web service * Fix -grafana-addr default, clarify -api-addr flag * Fix version check in grafana dashboards * Fix comment typo Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2019-01-04 16:07:57 -08:00
Kevin Lingerfelt	37ae423bb3	Add linkerd- prefix to all objects in linkerd install (#1920 ) Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-12-04 15:41:47 -08:00
Kevin Lingerfelt	12b10e27c1	Update version checks to support release channels (#1667 ) * Update version checks to support release channels * Update based on review feedback * Fix sidebar tests * Update CI config for edge and stable tags Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-09-17 17:13:50 -07:00
Andrew Seigner	b708378d07	Add version check to Grafana dashboard (#1638 ) * Add version check to Grafana dashboard The web dashboard checks the local Linkerd version against the latest release, and informs the user if an update is available. Grafana was not doing this. Modify the Grafana dashboard to perform a version check, and prompt the user to update if needed. Fixes #1607 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-09-13 15:28:44 -07:00
Franziska von der Goltz	c7ac072acc	update grafana dashboards: conduit to linkerd (#1320 ) * update grafana dashboards to remove conduit reference and replace with linkerd instances * update test install fixtures to reflect changes Fixes: #1315 Signed-off-by: Franziska von der Goltz <franziska@vdgoltz.eu>	2018-07-16 13:05:01 -07:00
Risha Mars	fdb0b7f63f	Grafana: remove fill and stack from individual resource breakouts (#1092 ) Remove the filling and stacking in request rate graphs that combine resources, to make it easier to spot outliers. * Grafana: remove fill and stack from individual resource breakouts * Remove all the stacks and fills from request rates everywhere	2018-06-18 10:14:39 -07:00
Risha Mars	53b713b2a8	Remove the ⚠️ emoji from non-tlsed grafana stat labels (#1089 )	2018-06-08 15:00:56 -07:00
Kevin Lingerfelt	ec2433e9bd	Update controller to use 'tls' metric label (#1044 ) * Update controller to use 'tls' metric label * Fix meshed column formatter Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-06-01 16:44:33 -07:00
Andrew Seigner	95f9f8dc35	Add meshed label support to Grafana (#1021 ) The Grafana dashboards currently show Request Volume by ns/deploy/pod. Add a `meshed` dimension to the Request Volume graphs, in anticipation of the `meshed`/`secured` label from the proxy. Also increase `irate` time window queries from `20s` to `30s`, per recommendation from Prometheus team. Relates to #388. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-05-25 14:10:57 -07:00
Andrew Seigner	6fccdee58e	Stop special-casing conduit controller in Grafana (#984 ) The Grafana dashboards were explicitly filtering out Conduit control-plane data. Remove control-plane filtering from Grafana dashboards. This brings Grafana in-line with web, and also encourages better dog-fooding of our proxy metrics and dashboards. Also update Grafana to 5.1.3, update the BUILD.md architecture diagram to include Promethues and Grafana, and introduce a Prometheus Benchmark dashboard, courtesy of Robust Perception. Fixes #908 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-05-23 13:47:20 -07:00
Andrew Seigner	1275b1ae89	Introduce Grafana, K8s, and Prom dashboards (#904 ) Grafana provides default dashboards for Prometheus and Grafana health. The community also provides Kubernetes-specific dashboards. Conduit was not taking advantage of these. Introduce new Grafana dashboards focused on Grafana, Kubernetes, and Prometheus health. Tag all Conduit dashboards for easier UI navigation. Also fix layout in Conduit Health dashboard. Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-05-08 23:11:43 +02:00
Andrew Seigner	5a5c6d14ab	Update Grafana to 5.1.0, handle missing data (#876 ) Conduit 0.4.1 contained some rough edges in the Grafana deployment. This PR include the following: - bump Grafana to 5.1.0 - fix deployment and rc graphs when no data present - fix some text sections overlapping due to scrolling Fixes #705 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-29 22:24:22 +02:00
Andrew Seigner	326d9f493c	Fix top-line Grafana counts (#815 ) The top-line single stat numbers were not calculated properly, resulting in inflated counts. Modify the underlying Prometheus queries to ensure accurate counts of Deployments, Pods, and Namespaces. Fixes #801. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-19 18:01:06 -07:00
Andrew Seigner	c9cdd838dc	Standardize and polish Grafana for 0.4.0 release (#766 ) The top-line, deployments, and health Grafana dashboards had inconsistent layouts and data. This change standardizes our Grafana dashboards. Every row is composed of Success Rate, Request Rate, and Latency. Part of #420. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-13 18:01:44 -07:00
Andrew Seigner	b6bcdcc059	Namespace-aware Grafana dashboards (#716 ) The Grafana dashboards key off of deployment, but had no awareness of namespaces, causing incorrect metrics aggregation and display. This change makes the Grafana dashboards key off of namespaces, and also modifies the Grafana links in the Conduit dashboard to link to namespace+deployment. Fixes #704 Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-06 15:37:53 -07:00
Andrew Seigner	9508e11b45	Build conduit-specific Grafana Docker image (#679 ) Using a vanilla Grafana Docker image as part of `conduit install` avoided maintaining a conduit-specific Grafana Docker image, but made packaging dashboard json files cumbersome. Roll our own Grafana Docker image, that includes conduit-specific dashboard json files. This significantly decreases the `conduit install` output size, and enables dashboard integration in the docker-compose environment. Fixes #567 Part of #420 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-04-05 14:20:05 -07:00

21 Commits