linkerd2

Commit Graph

Author	SHA1	Message	Date
Eliza Weisman	8bc497a057	Remove unused metrics (#322 ) Removed the `method` label from Prometheus, and removed HTTP methods from reports. Removed `StreamSummary` from reports and replaced it with a `u32` count of streams. Closes #266 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-02-09 17:14:17 -08:00
Eliza Weisman	458e9d2ac5	Remove per-path metrics from telemetry pipeline (#317 ) Follow-up from #315. Now that the UIs don't report per-path metrics, we can remove the path label from Prometheus, the path aggregation and filtering options from the telemetry API, and the path field from the proxy report API. I've modified the tests to no longer expect the removed fields, and manually verified that Conduit still works after making these changes. Closes #265 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-02-09 14:20:28 -08:00
Eliza Weisman	915f08ac4c	Store proxy latencies in a structure that matches controller histogram (#11 ) The proxy currently stores latency values in an `OrderMap` and reports every observed latency value to the controller's telemetry API since the last report. The telemetry API then sends each individual value to Prometheus. This doesn't scale well when there are a large number of proxies making reports. I've modified the proxy to use a fixed-size histogram that matches the histogram buckets in Prometheus. Each report now includes an array indicating the histogram bounds, and each response scope contains a set of counts corresponding to each index in the bounds array, indicating the number of times a latency in that bucket was observed. The controller then reports the upper bound of each bucket to Prometheus, and can use the proxy's reported set of bucket bounds so that the observed values will be correct even if the bounds in the control plane are changed independently of those set in the proxy. I've also modified `simulate-proxy` to generate the new report structure, and added tests in the proxy's telemetry test suite validating the new behaviour.	2018-02-07 18:02:59 -08:00
Oliver Gould	a2d537f5c4	Use a load-aware balancer (#251 ) Currently, the conduit proxy uses a simplistic Round-Robin load balancing algorithm. This strategy degrades severely when individual endpoints exhibit abnormally high latency. This change improves this situation somewhat by making the load balancer aware of the number of outstanding requests to each endpoint. When nodes exhibit high latency, they should tend to have more pending requests than faster nodes; and the Power-of-Two-Choices node selector can be used to distribute requests to lesser-loaded instances. From the finagle guide: The algorithm randomly picks two nodes from the set of ready endpoints and selects the least loaded of the two. By repeatedly using this strategy, we can expect a manageable upper bound on the maximum load of any server. The maximum load variance between any two servers is bound by ln(ln(n))` where `n` is the number of servers in the cluster. Signed-off-by: Oliver Gould <ver@buoyant.io>	2018-02-07 09:39:31 -08:00
Oliver Gould	6a0936e699	Remove proxy/Dockerfile-deps (#279 ) The current proxy Dockerfile configuration does not cache dependencies well, which can increase build times substantially. By carefully splitting proxy/Dockerfile into several stages that mock parts of the project, dependencies may be built and cached in Docker such that changes to the proxy only require building the conduit-proxy crate. Furthermore, proxy/Dockerfile now runs the proxy's tests before producing an artifact, unless the ` PROXY_SKIP_TESTS` build-arg is set and not-empty. The `PROXY_UNOPTIMIZED` build-arg has been added to support quicker, debug-friendly builds.	2018-02-06 13:01:38 -08:00
Oliver Gould	e2093e37f8	Move the Rust gRPC bindings to a dedicated crate (#275 ) The proxy depends on `protoc`-generated gRPC bindings to communicate with the controller. In order to generate these bindings, build-time dependencies must be compiled. In order to support a more granular, cacheable build scheme, a new crate has been created to house these gRPC bindings, `conduit-proxy-controller-grpc`. Because `TryFrom` and `TryInto` conversions are implemented for protobuf-defined types, the `convert` module also had to be moved to into a dedicated crate. Furthermore, because the proxy's tests require that `quickcheck::Aribtrary` be implemented for protobuf types, the `conduit-proxy-controller-grpc` crate supports an _arbitrary_ feature fla protobuf types, the `conduit-proxy-controller-grpc` crate supports an _arbitrary_ feature flag. While we're moving these libraries around, the `tower-router` crate has been moved to `proxy/router` and renamed to `conduit-proxy-router.` `futures-mpsc-lossy` has been moved into the proxy directory but has not been renamed. Finally, the `proxy/Dockerfile-deps` image has been updated to avoid the wasteful building of dependency artifacts, as they are not actually used by `proxy/Dockerfile`.	2018-02-06 10:31:48 -08:00
Andrew Seigner	277c06cf1e	Simplify and refactor k8s labels and annnotations (#227 ) The conduit.io/* k8s labels and annotations we're redundant in some cases, and not flexible enough in others. This change modifies the labels in the following ways: `conduit.io/plane: control` => `conduit.io/controller-component: web` `conduit.io/controller: conduit` => `conduit.io/controller-ns: conduit` `conduit.io/plane: data` => (remove, redundant with `conduit.io/controller-ns`) It also centralizes all k8s labels and annotations into pkg/k8s/labels.go, and adds tests for the install command. Part of #201 Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-02-01 14:12:06 -08:00
Eliza Weisman	eddc37de28	Adopt external tower-grpc and tower-h2 deps #225 ) The conduit repo includes several library projects that have since been moved into external repos, including `tower-grpc` and `tower-h2`. This change removes these vendored libraries in favor of using the new external crates.	2018-02-01 11:57:02 -08:00
Dennis Adjei-Baah	01312f9ffe	Prepare for v0.2.0 release (#248 ) * prepare for v0.2.0 release Signed-off-by: Dennis Adjei-Baah <dennis@buoyant.io>	2018-01-31 15:39:48 -08:00
Sean McArthur	9720a32de7	proxy: fix tcp_with_no_orig_dst test (#229 ) Sometimes, the try_read will return a connection error, sometimes it will just return EOF. Handle both cases. Closes #226	2018-01-29 15:15:06 -08:00
Sean McArthur	b861318e86	proxy: fix h1 streams to trigger response end events Response End events were only triggered after polling the trailers of a response, but when the Response is given to a hyper h1 server, it doesn't know about trailers, so they were never polled! The fix is that the `BodyStream` glue will now poll the wrapped body for trailers after it sees the end of the data, before telling hyper the stream is over. This ensures a ResponseEnd event is emitted. Includes a proxy telemetry test over h1 connections.	2018-01-25 16:36:16 -08:00
Andrew Seigner	aa17e37ab5	Add docker deps validation to ci (#207 ) If docker image tags were out of date, ci would not fail until the docker-deploy stage (master merge). Modify ci to validate tags as part of the default ci run. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-01-25 12:41:02 -08:00
Andrew Seigner	4e2eb18f1d	Use cargo frozen flag in build scripts (#206 ) The cargo commands in our docker and ci scripts were at risk for modifying Cargo.lock and cache. Using cargo's --frozen flag (and --locked during fetch) ensures our build is consistent with what's defined across Cargo.toml, Cargo.lock, and cached build artifacts. Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-01-25 11:08:15 -08:00
Andrew Seigner	d0a0bb22bd	Move EosCtx to common for Tap and Telemetery (#204 ) * Make Eos optional in TapEvent grpc_status not being set in protobuf is the same as being set to zero, which is also status OK Modify TapEvent to include an optional EOS struct Signed-off-by: Andrew Seigner <siggy@buoyant.io> Part of #198 * Add Eos to proto & proxy tap end-of-stream events The proxy now outputs `Eos` instead of `grpc_status` in all end-of-stream tap events. The EOS value is set to `grpc_status_code` when the response ended with a `grpc_status` trailer, `http_reset_code` when the response ended with a reset, and no `Eos` when the response ended gracefully without a `grpc_status` trailer. This PR updates the proxy. The proto and controller changes are in PR #204. Part of #198. Closes #202 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-01-24 15:48:00 -08:00
Sean McArthur	54aef56e25	proxy: add transparent protocol detection and handling The proxy will now try to detect what protocol new connections are using, and route them accordingly. Specifically: - HTTP/2 stays the same. - HTTP/1 is now accepted, and will try to send an HTTP/1 request to the target. - If neither HTTP/1 nor 2, assume a TCP stream and simply forward between the source and destination. * tower-h2: fix Server Clone bounds * proxy: implement Async{Read,Write} extra methods for Connection Closes #130 Closes #131	2018-01-23 16:14:07 -08:00
Andrew Seigner	06c9894c31	Updates for v0.1.3 release (#185 ) Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-01-19 13:58:52 -08:00
Andrew Seigner	e6f17faf28	Updates for v0.1.2 release (#171 ) Signed-off-by: Andrew Seigner <siggy@buoyant.io>	2018-01-19 10:56:20 -08:00
Oliver Gould	008f53865b	Make proxy-deps multi-stage to remove the original source files (#161 ) Previously, proxy-deps and go-deps included the source tree for local projects. This can cause build conflicts when files are renamed. By adopting a multi-stage build for the proxy-deps image, we can be sure that we only preserve essential dependencies & manifests in the proxy-deps and go-deps images. Furthermore, `bin/update-go-deps-shas` and `bin/update-proxy-deps-shas` have been added to ease maintenance when files are changed. Fixes #159 Signed-off-by: Oliver Gould <ver@buoyant.io>	2018-01-17 12:26:22 -08:00
Eliza Weisman	2b20a8bb10	Use cargo:rerun-if-changed to avoid recompiling protos (#160 ) As @seanmonstar noticed, the build script will currently re-compile all the protobufs regardless of whether or not they have changed, making the build much slower. This PR modifies it to emit `cargo:rerun-if-changed=` for all the protobuf files, so they will only be regenerated if one of them changes. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-01-17 09:23:27 -08:00
Eliza Weisman	63d1a5d70d	Add Protocol field to Transports telemetry (#138 ) See #132. This PR adds a protocol field to the ClientTransport and ServerTransport messages, and modifies the proxy to report a value for this field (currently, it's only ever HTTP). Currently, HTTP/1 and HTTP/2 are collapsed into one Protocol variant, see #132 (comment). I expect that we can treat H1 as a subset of H2 as far as metrics goes. Note that after discussing it with @klingerf, I learned that the control plane telemetry API currently does not do anything with the ClientTransport and ServerTransport messages, so beyond regenerating the protobuf-generated code, no controller changes were actually necessary. As we actually add metrics to TCP transports, we'll want to make some additions to the telemetry API to ingest these metrics. If any metrics are shared between HTTP and raw TCP transports (say, bytes sent), we'll want to differentiate between them in Prometheus. All the metrics that the control plane currently ingests from telemetry reports are likely to be HTTP-specific (requests, responses, response latencies), or at least, do not apply to raw TCP. Actually adding metrics to raw TCP transports will probably have to wait until there are raw TCP transports implemented in the proxy... Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-01-11 16:00:38 -08:00
clemensw	b1831cd415	[proxy] Fix rendering for top-level rustdoc (#113 ) Signed-off-by: clemensw <clemensw@users.noreply.github.com>	2018-01-08 15:40:12 -08:00
Andrew Seigner	caeb83a526	Fix Go and Proxy dependency image SHAs (#117 ) The image tags for gcr.io/runconduit/go-deps and gcr.io/runconduit/proxy-deps were not updating to account for all changes in those images. Modify SHA generation to include all files that affect the base dependency images. Also add instructions to README.md for updating hard-coded SHAs in Dockerfile's. Fixes #115 Signed-off-by: Andrew Seigner <andrew@sig.gy>	2018-01-08 11:19:49 -08:00
Eliza Weisman	90b5ddfb00	Change Cargo.lock to trigger deps image rebuild (#116 ) Because whether or not to build a new deps image is based on the SHA of Cargo.lock, changes to the deps Dockerfile will not cause a new deps image to be built. Because of this, the current proxy deps Docker image is based on the wrong Rust version, breaking the build. See #115 for details on this issue. I've appended a newline to Cargo.lock to change the lockfile's SHA and trigger a rebuild of the deps Docker image on CI. I've also added a comment in the Dockerfile noting that it is necessary to do this when changing that file. Signed-off-by: Eliza Weisman eliza@buoyant.io	2018-01-08 10:29:51 -08:00
Eliza Weisman	2bd9f99f6b	Require Rust 1.23 in proxy Dockerfile-deps (#105 ) After merging #104, Conduit will not build against pre-1.23 Rust versions. This PR updates the Dockerfile to require this version. This should fix the build on master. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-01-05 13:54:38 -08:00
Eliza Weisman	67d4b56253	Remove `AsciiExt` import (#104 ) Since the methods on this trait were moved to direct implementations on the implementing types, this produces an unused import warning with the latest (1.23) Rust standard library. As we set `deny(warnings)`, this breaks the build. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-01-04 10:49:13 -08:00
Brian Smith	02176d8d16	Remove default controller URL from proxy. (#48 ) Previously there was a default controller URL in the proxy. This default was never used for any proxy injected by `conduit inject` and it was the wrong default when using the proxy outside of Kubernetes. Also more generally this is such an important setting in terms of correctness and security that it was dangerous to let it be implied in any context. Remove the default, requiring that it be set in order for the proxy to start.	2018-01-02 08:44:27 -10:00
Sky Ao	238c54414b	correct typo: Enviroment -> Environment (#100 ) Signed-off-by: Sky Ao <aoxiaojian@gmail.com>	2017-12-29 10:14:48 -08:00
Sean McArthur	8e2dd66bfa	disable push promises in proxy (#70 )	2017-12-21 14:41:17 -08:00
Kevin Lingerfelt	a8e75115ab	Prepare the repo for the v0.1.1 release (#75 ) * Prepare the repo for the v0.1.1 release * Add changelog * Changelog updates, wrap at 100 characters	2017-12-20 10:51:53 -08:00
Brian Smith	8385a7a8c1	Proxy: Map unqualified/partially-qualified names to FQDN (#59 ) * Proxy: Map unqualified/partially-qualified names to FQDN Previously we required the service to fully qualify all service names for outbound traffic. Many services are written assuming that Kubernetes will complete names using its DNS search path, and those services weren't working with Conduit. Now add an option, used by default, to fully-qualify the domain names. Currently only Kubernetes-like name completion for services is supported, but the configuration syntax is open-ended to allow for alternatives in the future. Also, the auto-completion can be disabled for applications that prefer to ensure they're always using unambiguous names. Once routing is implemented then it is likely that (default) routing rules will replace these hard-coded rules. Unit tests for the name completion logic are included. Part of the solution for #9. The changes to `conduit inject` to actually use this facility will be in another PR.	2017-12-19 11:59:26 -10:00
Brian Smith	4fa4891694	Use `connection::Connection` for outbound connections (#51 ) Previously `connection::Connection` was only being used for inbound connections, not outbound connections. This led to some duplicate logic and also made it difficult to adapt that code to enable TLS. Now outbound connections use `connection::Connection` too. This will allow the upcoming TLS logic to guarantee that `TCP_NODELAY` is enabled at the right time, and the TLS logic also control access to the underlying plaintext socket for security reasons.	2017-12-15 12:44:25 -10:00
Brian Smith	40d50f0f4a	Encapsulate listening port connection acceptance logic (#46 ) Previously every use of `BoundPort` repeated a bunch of logic. Move the repeated logic to `BoundPort` itself. Just remove the no-op handshaking logic; new handshaking logic will be added to `BoundPort` when TLS is added.	2017-12-14 13:19:05 -10:00
Brian Smith	81fb0fea5f	Move default private connect timeout to `Config` (#42 ) Previously the default value of this setting was in lib.rs instead of being automatically set in `Config` like all the other defaults, which was inconsistent and confusing. Fix this by moving the defaulting logic to `Config`. Validated by running the test suite.	2017-12-13 21:15:21 -06:00
Brian Smith	0c2aa0e185	Centralize and clarify TCP port binding (#43 ) Previously the logic related to listening for incoming TCP connections was duplicated in several places. Begin centralizing this logic. Future commits will centralize it further. No validation was done other than running the test suite.	2017-12-13 19:45:15 -06:00
Brian Smith	0185522821	Proxy: Parse environment variables in one place (#26 ) Previously `Process` did its own environment variable parsing and did not benefit from the improved error handling that `config` now has. Additionally, future changes will need access to these same environment variables in other parts of the proxy. Move `Process`'s environment variable parsing to `config` to address both of these issues. Now there are no uses of `env::var` outside of `config` except for logging, which is the final desired state. I validated this manually.	2017-12-13 19:33:37 -06:00
Brian Smith	559f4a76fb	Proxy: Use production config parsing in tests (#25 ) * Proxy: Use production config parsing in tests Previosuly the testing code for the proxy was sensitive to the values of environment variables unintentionally, because `Config` looked at the environment variables. Also, the tests were largely avoiding testing the production configuration parsing code since they were doing their own parsing. Now the tests avoid looking at environment variables other than `ENV_LOG`, which makes them more resilient. Also the tests now parse the settings using the same code as production use uses. I validated this manually.	2017-12-13 19:27:50 -06:00
Brian Smith	0ebc20c013	Proxy: Parse all environment variables before aborting (#24 ) Previously, as soon as we would encounter one environment variable with an invalid value we would exit. This is frustrating behavior when deploying to Kubernetes and there are multiple problems because the edit-compile-test cycle is so slow. Fix this by parsing all the environment variables and logging error messages before exiting. I validated this manually.	2017-12-13 18:56:14 -06:00
Eliza Weisman	2fdb859dff	Add timeout to in-flight telemetry reports (#12 ) This PR adds a configurable timeout duration after which in-flight telemetry reports are dropped, cancelling the corresponding RPC request to the control plane. I've also made the `Timeout` implementation used in `TimeoutConnect` generic, and reused it in multiple places, including the timeout for in-flight reports. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2017-12-13 15:07:36 -08:00
Brian Smith	e29a02d63b	Proxy: Improve error reporting for invalid environment variables (#23 ) * Proxy: Improve error reporting for invalid environment variables Previously when an environment variable had an invalid value the process would exit with an error that did not mention which environment variable is invalid. Start fixing this by routing environment variable parsing through functions that always know the name of the environment variable when they report errors. I validated this change manually. * Proxy: Improve configuration URL parsing Previously there was a bit of duplicated logic between parsing `Addr` and `HostAndPort` values. Factor out the common logic. In the process, improve the error reporting in the cases where parsing fails.	2017-12-08 12:32:43 -06:00
Oliver Gould	bff3efea3f	Prepare for v0.1.0 (#1 ) Update versions in code. Use default docker tag of v0.1.0	2017-12-04 19:55:56 -08:00
Oliver Gould	980f85963d	apply rustffmt on proxy, remove rustfmt.toml for now	2017-12-05 00:44:16 +00:00
Oliver Gould	b104bd0676	Introducing Conduit, the ultralight service mesh We’ve built Conduit from the ground up to be the fastest, lightest, simplest, and most secure service mesh in the world. It features an incredibly fast and safe data plane written in Rust, a simple yet powerful control plane written in Go, and a design that’s focused on performance, security, and usability. Most importantly, Conduit incorporates the many lessons we’ve learned from over 18 months of production service mesh experience with Linkerd. This repository contains a few tightly-related components: - `proxy` -- an HTTP/2 proxy written in Rust; - `controller` -- a control plane written in Go with gRPC; - `web` -- a UI written in React, served by Go.	2017-12-05 00:24:55 +00:00

42 Commits