linkerd2-proxy

Commit Graph

Author	SHA1	Message	Date
Oliver Gould	e8e163a2e9	proxy: Convert `convert` from crate to module (#1115 ) In e2093e3, we created a `convert` crate when refactoring the proxy's gRPC bindings into a dedicated crate. It's not really necessary to handle `convert` as a crate, given that it holds a single 39-line file that's mostly comments. It's possible to "vendor" this file in the proxy, and controller-grpc crate doesn't even need this trait (in fact, the proxy probably doesn't either).	2018-06-13 16:18:51 -07:00
Sean McArthur	6c7173075c	proxy: update to released hyper 0.12 (#1069 )	2018-06-05 17:05:10 -07:00
Eliza Weisman	be9486c239	proto: Add TLS identity to WeightedAddr message (#1041 ) Required for #1008. This PR adds the `TlsIdentity` message to the Destination service proto, to describe what strategy the proxy should use for verifying an endpoint's TLS certificates. It also adds a `TlsIdentity` field to the `WeightedAddr` message. Currently, there is one possible variant for `TlsIdentity`, `KubernetesPodName`, which consists of the Kubernetes pod name of the endpoint, the namespace of the endpoint, and the namespace of that pod's Conduit control plane. The proxy should attempt to connect over TLS if the control plane namespace matches its own control plane namespace. The pod name and namespace are used to verify the endpoint's TLS certificate. See https://github.com/runconduit/conduit/issues/386#issuecomment-392948046. This change was initially part of #1008, but I factored it out to make the diff smaller. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-05-31 11:48:25 -07:00
Brian Smith	79a38327d2	Abstract I/O interface into a trait. (#1020 ) * Rename so_original_dst.rs to addr_info.rs. Prepare for expanding the functionality of this module by renaming it. Signed-off-by: Brian Smith <brian@briansmith.org> * Abstract I/O interface into a trait. Instead of pattern matching over an `Io` variant, use a `Box<Io>` to abstract the I/O interface. This will make it easier to add a TLS transport. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-05-26 10:04:31 -10:00
Eliza Weisman	1b1623dd83	proxy: Upgrade Conduit to use the new version of Tokio (#944 ) Closes #888. Closes #867. This branch upgrades Conduit to use the new Tokio API. It was also necessary to upgrade some other dependencies (including `hyper`, and `trust-dns`) alongside this upgrade. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-05-17 16:38:15 -07:00
Sean McArthur	d9fd091411	proxy: rebind services on connect errors (#952 ) Instead of having connect errors destroy all buffered requests, this changes Bind to return a service that can rebind itself when there is a connect error. It won't try to establish the new connection itself, but waits for the buffer to poll again. Combing this with changes in tower-buffer to remove canceled requests from the buffer should mean that we won't loop on connect errors for forever. Signed-off-by: Sean McArthur <sean@seanmonstar.com>	2018-05-17 14:15:16 -07:00
Eliza Weisman	281281f5bc	proxy: Make `outbound_updates_newer_services` test forward-compatible (#939 ) This is in preparation for landing the Tokio upgrade. The test `discovery::outbound_updates_newer_services` currently contains an assertion that an HTTP/2 request to an HTTP/1 service will return a response with status code 500. This is because the current version of Hyper on which Conduit depends does not support protocol upgrades. However, commit hyperium/hyper@bc6af88a32, which adds support for this kind of protocol upgrade, was recently merged to Hyper's master branch. Therefore, this assertion will no longer be correct once we depend on the upcoming Hyper release. When we migrate to the new Tokio, it will be necessary to upgrade our Hyper dependency as well, and this test will fail. I've modified the test to no longer make assertions about the response's status code, so that it's compatible with both the current and future Hyper versions. If the response is not `Ok`, the test will still fail, since `tests::support::Client::request()` `expect`s that the response is successful, but the status code is ignored. I've added a comment in the test explaining this. Eventually, when the master version of Conduit depends on the latest Hyper, we may want to change this test to assert that the status code is 200 instead. We may also want to add more tests for Hyper's protocol upgrade functionality, but that seems out of scope for this PR. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-05-11 14:36:03 -07:00
Oliver Gould	2392b3df2d	proxy: Parse units with duration configurations (#909 ) Configuration values that take durations are currently specified as time values with no units. So `600` may mean 600ms in some contexts and 10 minutes in others. In order to avoid this problem, this change now requires that configurations provide explicit units for time values such as '600ms' or 10 minutes'. Fixes #27.	2018-05-08 13:54:12 -07:00
Oliver Gould	bdc19d926c	proxy: Upgrade tower dependencies (#892 ) In order to pick up https://github.com/tower-rs/tower-grpc/pull/60, upgrade tower dependencies. This will reduce the cost of updating for upcoming tower-h2 improvements.	2018-05-02 13:40:55 -07:00
Sean McArthur	2e296dbf69	proxy: wrap connections in Transport sensor before peeking (#851 ) In case there are any errors while peeking the connection to do protocol detection, the sensors will now be in place to detect them. Besides just errors, this will also allow reporting about connections that are accepted, but then immediately closed. Additionally: - add write_buf implementation for Transport sensor, can help performance for http1/http2 - add better logs for tcp connections errors - add printlns for when tests fail Signed-off-by: Sean McArthur <sean@seanmonstar.com>	2018-04-27 14:18:23 -07:00
Oliver Gould	219872bab8	Introduce the `peer` label to transport metrics (#848 ) Previously, the proxy exposed separate _accept_ and _connect_ metrics for some metric types, but not for all. This leads to confusing aggregations, particularly for read and write taotals. This change primarily introduces the `peer` prometheus label (with possible values _src_ or _dst_) to indicate which side of the proxy the metric reflects. Additionally, the `received_bytes` and `sent_bytes` metrics have been renamed as `tcp_read_bytes_total` and `tcp_write_bytes_total`, resectively. This more naturally fits into existing idioms. Stream classification is not applied to these metrics, as we plan to increment them throughout stream lifetime and not only on close. The `tcp_connections_open` metric has also been renamed to `tcp_open_connections` to reflect Prometheus idioms. Finally, `msg1` and `msg2` have been constified in telemetry test fixtures so that tests are somewhat easier to read.	2018-04-25 14:06:33 -07:00
Eliza Weisman	337eeb5a17	Fix assertions in metrics_compression test (#847 ) Fixes #846 The proxy `metrics_compression` test contained an assertion that a compressed scrape contained the `request_duration_ms_count` metric. This was chosen completely arbitrarily, and was only intended as an assertion that metrics were updated between compressed scrapes. Unfortunately, that metric was removed in d9112abc933035ba48eabc1e9e5a81b4da0e367f, so when #665 merged to master, this test broke. CI didn't catch this since we don't build merges for PRs --- we should probably (re)enable this in Travis? This PR fixes the test to assert on a metric that wasn't removed. Sorry for the ❌s!	2018-04-25 11:02:52 -07:00
Eliza Weisman	514074129e	Add optional GZIP compression to proxy /metrics endpoint (#665 ) Closes #598. According to the Prometheus documentation, metrics export endpoints should support serving metrics compressed using GZIP. I've modified the proxy's `/metrics` endpoint to serve metrics compressed with GZIP when an `Accept-Encoding: gzip` request header is sent. I've also added a new unit test that attempts to get the proxy's metrics endpoint as GZIP, and asserts that the metrics are decompressed successfully. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-04-24 17:42:50 -07:00
Eliza Weisman	ea72f774a8	proxy: Add tcp_connections_open gauge (#791 ) Depends on #785. This PR adds the `tcp_connections_open` gauge to the proxy's TCP metrics. It also adds some tests for that metric.	2018-04-24 10:17:48 -07:00
Sean McArthur	60823456b1	proxy tests: reduce some boilerplate, improve error information (#833 ) The `controller` part of the proxy will now use a default, removing the need to pass the exact same `controller::new().run()` in every test case. The TCP server and client will include their socket addresses in some panics. Signed-off-by: Sean McArthur <sean@seanmonstar.com>	2018-04-23 18:01:51 -07:00
Eliza Weisman	3511801b1c	proxy: remove unused metrics (#826 ) This PR removes the unused `request_duration_ms` and `response_duration_ms` histogram metrics from the proxy. It also removes them from the `simulate-proxy` script's output, and from `docs/proxy-metrics.md` Closes #821	2018-04-23 16:05:20 -07:00
Eliza Weisman	d27782b9d0	Ignore flaky metrics tests on CI (#832 ) Fixes #831. Proxy metrics tests `transport::inbound_tcp_accept` and `transport::inbound_tcp_duration` are known to be flaky and should be ignored on CI. Note that the outbound versions of these tests were already marked as flaky, so this was almost certainly either an oversight or the result of an incorrect merge. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-04-23 14:29:34 -07:00
Eliza Weisman	a379cc079c	proxy: Unbreak process_start_time_seconds metric (#825 ) The refactoring of how metrics are formatted in 674ce87588bfe27ee64b5601cfe5b8e3e548dd34 inadvertently introduced a bug that caused the `process_start_time_seconds` metric to be formatted as just a number without the metric name. This causes Prometheus to fail with a parse error rather than accepting the metrics. I've fixed this issue, and added a unit test to detect regressions in the future.	2018-04-20 15:59:08 -07:00
Eliza Weisman	f4fd8ce98e	proxy: Add classifications to TCP close stats (#790 ) This PR adds a `classification` label to transport level metrics collected on transport close. Like the `classification` label on HTTP response metrics, the value may be either `"success"` or `"failure"`. The label value is determined based on the `clean` field on the `TransportClose` event, which indicates whether a transport closed cleanly or due to an error. I've updated the tests for transport-level metrics to reflect the addition of the new label. I'd like to also modify the test support code to allow us to close transports with errors, in order to test that the errors are correctly classified as failures.	2018-04-19 19:01:48 -07:00
Eliza Weisman	8368bb711e	proxy: Add transport-level metrics (#785 ) This branch adds all the transport-level Prometheus metrics as described in #742, with the exception of the `tcp_connections_open` gauge (to be added in a subsequent branch). A brief description of the metrics added in this branch: * `tcp_accept_open_total`: counter of the number of connections accepted by the proxy * `tcp_accept_close_total`: counter of the number of accepted connections that have closed * `tcp_connect_open_total`: counter of the number of connections opened by the proxy * `tcp_connect_close_total`: counter of the number of connections opened by the proxy that have been closed. * `tcp_connection_duration_ms`: histogram of the total duration of each TCP connection (incremented on connection close) * `sent_bytes`: counter of the total number of bytes sent on TCP connections (incremented on connection close) * `received_bytes`: counter of the total number of bytes received on TCP connections (incremented on connection close) These metrics are labeled with the direction (inbound or outbound) and whether the connection was proxied as raw TCP or corresponds to an HTTP request. Additionally, I've added several proxy tests for these metrics. Note that there are some cases which are currently untested; in particular, while there are tests for the `tcp_accept_close_total` counter, it's more difficult to test the `tcp_connect_close_total` counter, due to connection pooling. I'd like to improve the tests for this code in additional branches.	2018-04-19 17:27:43 -07:00
Oliver Gould	25b1e48b0b	proxy: Rewrite mock controller to accept a stream of dst updates (#808 ) Currently, the mock controller, which is used in tests, takes all of its updates a priori, which makes it hard to control when an update occurs within a test. Now, the controller exposes a `DstSender`, which wraps an unbounded channel of destination updates. This allows tests to trigger updates at a specific point in the test. In order to accomplish this, the controller's hand-rolled gRPC server implementation has been discarded in favor of a real gRPC destination service. This requires that the `controller-grpc` project now builds both clients and servers for the destination service. Additionally, we now build a tap client as well (assuming that we'll want to write tests against our tap server).	2018-04-19 11:01:10 -07:00
Eliza Weisman	9198e7fd9b	Factor out reused test fixtures from telemetry tests (#782 ) This is a fairly minor refactor to the proxy telemetry tests. b07b554d2bdb4b92a1feeed22a79bd71e87856eb added a `Fixture` in the Destination service labeling tests added in #661 to reduce the repetition of copied and pasted code in those tests. I've refactored most of the other telemetry tests to also use the test fixture. Significantly less code is copied and pasted now. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-04-17 14:15:56 -07:00
Sean McArthur	3e2d782d19	proxy: clean up some logs and a few warnings in proxy tests (#780 ) Signed-off-by: Sean McArthur <sean@seanmonstar.com>	2018-04-17 12:53:20 -07:00
Oliver Gould	26750ce41f	Stop pushing telemetry reports from the proxy (#616 ) Now that the controller does not depend on pushed telemetry reports, the proxy need not depend on the telemetry API or maintain legacy sampling logic.	2018-04-12 17:39:29 -07:00
Eliza Weisman	f637c9cb9d	Ignore flaky telemetry tests on CI (#752 ) The tests for label metadata updates from the control plane are flaky on CI. This is likely due to the CI containers not having enough cores to execute the test proxy thread, the test proxy's controller client thread, the mock controller thread, and the test server thread simultaneously --- see #751 for more information. For now, I'm ignoring these on CI. Eventually, I'd like to change the mock controller code in test support so that we can trigger it to send a second metadata update only after the request has finished. I think this issue also makes merging #738 a higher priority, so that we can still have some tests running on CI that exercise some part of the label update behaviour.	2018-04-12 14:59:17 -07:00
Eliza Weisman	7e242ca07a	Add labels from service discovery to proxy metrics reports (#661 ) PR #654 adds pod-based metric labels to the Destination API responses for cluster-local services. This PR modifies the proxy to actually add these labels to reported Prometheus metrics for outbound requests to local services. It enhances the proxy's `control::discovery` module to track these labels and add a `LabelRequest` middleware to the service stack built in `Bind` for labeled services. Requests transiting `LabelRequest` are given an `Extension` which contains these labels, which are then added to events produced by the `Sensors` for these requests. When these events are aggregated to Prometheus metrics, the labels are added. I've also added some tests in `test/telemetry.rs` ensuring that these metrics are added correctly when the Destination service provides labels. Closes #660 Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-04-12 12:54:38 -07:00
Sean McArthur	2b9033cf16	proxy: fix flaky tcp graceful shutdown test (#735 )	2018-04-10 19:47:00 -07:00
Sean McArthur	20855519d2	proxy: improve graceful shutdown process (#684 ) - The listener is immediately closed on receipt of a shutdown signal. - All in-progress server connections are now counted, and the process will not shutdown until the connection count has dropped to zero. - In the case of HTTP1, idle connections are closed. In the case of HTTP2, the HTTP2 graceful shutdown steps are followed of sending various GOAWAYs.	2018-04-10 14:15:37 -07:00
Brian Smith	b8015bca4e	Proxy: Do L7 load balancing for all external HTTP services. (#726 ) Previously when the proxy could tell, by parsing, the request-target is not in the cluster, it would not override the destination. That is, load balancing would be disabled for such destinations. With this change, the proxy will do L7 load balancing for all HTTP services as long as the request-target has a DNS name. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-04-10 08:07:16 -10:00
Eliza Weisman	a701682e7f	Add pretty durations to panics from `assert_eventually!` (#677 ) This PR adds the pretty-printing for durations I added in #676 to the panic message from the `assert_eventually!` macro added in #669. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-04-06 10:49:17 -07:00
Eliza Weisman	5415480ec7	Add `assert_eventually!` macro to help de-flake telemetry tests (#669 ) Closes #615. Based on @olix0r's suggestion in https://github.com/runconduit/conduit/issues/613#issuecomment-376024744, this PR adds an `assert_eventually!` macro to retry an assertion a set number of times, waiting for 15 ms between retries. This is loosely based on ScalaTest's [eventually](http://doc.scalatest.org/1.8/org/scalatest/concurrent/Eventually.html). I've rewritten the flaky telemetry tests to use the `assert_eventually!` macro, to compensate for delays in the served metrics being updated between client requests and metrics scrapes.	2018-04-05 11:23:34 -07:00
Phil Calçado	b8f5e41e31	Add pod-based metric_labels to destinations response (#429 ) (#654 ) * Extracted logic from destination server * Make tests follow style used elsewhere in the code * Extract single interface for resolvers * Add tests for k8s and ipv4 resolvers * Fix small usability issues * Update dep * Act on feedback * Add pod-based metric_labels to destinations response * Add documentation on running control plane to BUILD.md Signed-off-by: Phil Calcado <phil@buoyant.io> * Fix mock controller in proxy tests (#656) Signed-off-by: Eliza Weisman <eliza@buoyant.io> * Address review feedback * Rename files in the destination package Signed-off-by: Kevin Lingerfelt <kl@buoyant.io>	2018-04-02 18:36:57 -07:00
Sean McArthur	7071aefafa	proxy: allow disable protocol detection on specific ports (#648 ) - Adds environment variables to configure a set of ports that, when an incoming connection has an SO_ORIGINAL_DST with a port matching, will disable protocol detection for that connection and immediately start a TCP proxy. - Adds a default list of well known ports: SMTP and MySQL. Closes #339	2018-04-02 14:24:36 -07:00
Brian Smith	7aa57ec830	Proxy: Completely replace current set of destinations on reconnect (#632 ) Previosuly, when the proxy was disconnected from the Destination service and then reconnects, the proxy would not forget old, outdated entries in its cache of endpoints. If those endpoints had been removed while the proxy was disconnected then the proxy would never become aware of that. Instead, on the first message after a reconnection, replace the entire set of cached entries with the new set, which may be empty. Prior to this change, the new test outbound_destinations_reset_on_reconnect_followed_by_no_endpoints_exists passed already but outbound_destinations_reset_on_reconnect_followed_by_add_none and outbound_destinations_reset_on_reconnect_followed_by_remove_none failed. Now all these tests pass. Fixes #573 Signed-off-by: Brian Smith <brian@briansmith.org>	2018-03-29 16:50:08 -10:00
Eliza Weisman	3011369d31	Add response classification to proxy metrics (#639 ) This PR adds a `classification` label to proxy response metrics, as @olix0r described in https://github.com/runconduit/conduit/issues/634#issuecomment-376964083. The label is either "success" or "failure", depending on the following rules: + if the response had a gRPC status code, then - gRPC status code 0 is considered a success - all others are considered failures + else if the response had an HTTP status code, then - status codes < 500 are considered success, - status codes >= 500 are considered failures + else if the response stream failed then - the response is a failure. I've also added end-to-end tests for the classification of HTTP responses (with some work towards classifying gRPC responses as well). Additionally, I've updated `doc/proxy_metrics.md` to reflect the added `classification` label. Signed-off-by: Eliza Weisman <eliza@buoyant.io>	2018-03-28 14:49:00 -07:00
Brian Smith	67b99fa989	Proxy: Clarify destination test support code queue handling (#617 ) Use `VecDeqeue` to make the queue structure clear. Follow good practice by minimizing the amount of time the lock is held. Clarify how defaulting logic works. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-03-26 10:45:05 -10:00
Oliver Gould	8b619b9762	Skip flaky tests for #613 (#614 ) The metrics endpoint tests are flaky because there are no guarantees that the metrics pipeline has processed events before the metrics endpoint is read. This can cause CI to fail spuriously. Disable these tests from running in CI until #613 is resolved.	2018-03-25 14:26:14 -07:00
Eliza Weisman	5eb14ee80a	Add request_duration_ms metric and increment request_total on request end (#589 ) This PR adds the `request_duration_ms` metric to the Prometheus metrics exported by the proxy. It also modifies the `request_total` metric so that it is incremented when a request stream finishes, rather than when it opens, for consistency with how the `response_total` metric is generated. Making this change required modifying `telemetry::sensors::http` to generate a `StreamRequestEnd` event similar to the `StreamResponseEnd` event. This is done similarly to how sensors are added to response bodies, by generalizing the `ResponseBody` type into a `MeasuredBody` type that can wrap a request or response body. Since this changed the type of request bodies, it necessitated changing request types pretty much everywhere else in the proxy codebase in order to fix the resulting type errors, which is why the diff for this PR is so large. Closes #570	2018-03-22 15:27:34 -07:00
Eliza Weisman	f5a4701d20	Fix double comma in outbound metrics (#601 ) Fixes #600 The proxy metrics endpoint has a bug where metrics recorded in the outbound direction can contain two commas in a row when no outbound label is present. This occurs because the code for formatting the outbound direction label mistakenly assumed that there would always be a destination pod owner label as well, but the proxy isn't currently aware of the destination's pod owner (waiting for #429). I've fixed this issue by moving the place where the comma is output from the `fmt::Display` impl for `RequestLabels` to the `fmt::Display` impl for `OutboudnLabels`. This way, the comma between the `direction` and `dst_` labels is only output when the `dst_` label is present. This bug made it to master since all of the proxy end-to-end tests for metrics only test the inbound router. I've rectified this issue by adding tests on the outbound router as well (which would fail against the current master due to the double comma bug). I've also added a test that asserts there are no double commas in exported metrics, to protect against regressions to this bug.	2018-03-22 14:17:10 -07:00
Eliza Weisman	5e50f88093	Add Prometheus /metrics endpoint to proxy (#569 ) This PR adds an endpoint to the proxy that serves metrics in Prometheus' text exposition format. The endpoint currently serves the `request_total`, `response_total`, `response_latency_ms`, and `response_duration_ms metrics`, as described in #536. The endpoint's port and address are configurable with the `CONDUIT_PROXY_METRICS_LISTENER` environment variable. Tests have been added in t`ests/telemetry.rs`	2018-03-21 16:19:32 -07:00
Sean McArthur	79b6285f8b	proxy: add SIGTERM and SIGINT handlers (#581 ) When the proxy is run in a Docker container, it runs as PID 1, with no default signal handlers setup. In order to react to signals from Kubernetes about shutting down, we need to set up explicit handlers. This adds handlers for SIGTERM and SIGINT. Closes #549	2018-03-16 18:53:20 -07:00
Eliza Weisman	16371b3201	Run all discovery tests for HTTP/1 as well as HTTP/2 (#556 ) In order to ensure we catch discovery and routing issues arising from different logic for HTTP/1 and HTTP/2 requests, I've modified tests/discovery.rs to run all applicable tests with both HTTP/1 and HTTP/2 requests. The tests themselves are largely unchanged, but now there are separate modules containing HTTP/1 and HTTP/2 versions of a majority of the tests.	2018-03-09 17:24:48 -08:00
Eliza Weisman	698e355537	Fix outbound HTTP/1 requests not using Destinations (#555 ) Commit 569d6939a799bb0df6bd4053de7d7e8ac6b49ab6 introduced a regression that caused the proxy to stop using the Destination service for outbound HTTP/1 requests with no authority in the request URI but a valid authority in the `Host:` header. The bug is due to some code in `Outbound::recognize` which assumed that a request had already been passed through `normalize_our_view_of_uri`. This was valid at one point while I was writing #492, as URIs were normalized prior to `recognize` and a request `Extension` was used to mark that they had been rewritten, and the host header and request URI could be assumed to be in agreement, but after merging #514 into the dev branch for #492, this behaviour changed and I forgot to update the logic in `recognize`. I've fixed the issue by adding the logic for routing on `Host:` headers back into `Outbound::recognize`. @seanmonstar added a test in `discovery.rs`, `outbound_http1_asks_controller_about_host`, which should exercise this case. I've added a couple more unit tests in that file to try and ensure we cover more of the different cases that can occur here. Fixes #552	2018-03-09 16:25:19 -08:00
Sean McArthur	b30448ff82	proxy: improve transparency of host headers and absolute-uris (#535 ) In some cases, we would adjust an existing Host header, or add one. And in all cases when an HTTP/1 request was received with an absolute-form target, it was not passed on. Now, the Host header is never changed. And if the Uri was in absolute-form, it is sent in the same format. Closes #518	2018-03-08 13:15:21 -08:00
Brian Smith	cb943fce3d	Simplify cluster zone suffix handling in the proxy (#528 ) * Temporarily stop trying to support configurable zones in the proxy. None of the zone configuration is tested and lots of things assume the cluster zone is `cluster.local`. Further, how exactly the proxy will actually learn the cluster zone hasn't been decided yet. Just hard-code the zone as "cluster.local" in the proxy until configurable zones are fully implemented and tested to be working correctly. Signed-off-by: Brian Smith <brian@briansmith.org> * Remove the CONDUIT_PROXY_DESTINATIONS_AUTOCOMPLETE_FQDN setting The way that Kubernetes configures DNS search suffixes has some negative consequences as some names like "example.com" are ambiguous: depending on whether there is a service "example" in the "com" namespace, "example.com" may refer to an external service or an internal service, and this can fluctuate over time. In recognition of that we added the CONDUIT_PROXY_DESTINATIONS_AUTOCOMPLETE_FQDN setting, thinking this would be part of a solution for users to opt out of the unfortunate behavior if their applications didn't depend on the DNS search suffix feature. It turns out similar effects can be acheived using a custom dnsConfig, starting in Kubernetes 1.10 when dnsConfig reaches the beta stability level. Now any CONDUIT_PROXY_DESTINATIONS_AUTOCOMPLETE_FQDN-based seems duplicative. Further, attempting to support it optionally made the code complex and hard to read. Therefore, let's just remove it. If/when somebody actually requests this functionality then we can add it back, if dnsConfig isn't a valid alternative for them. Signed-off-by: Brian Smith <brian@briansmith.org> * Further hard-code "cluster.local" as the zone, temporarily. Addresses review feedback. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-03-07 14:30:13 -10:00
Eliza Weisman	d2c8d588e6	Enforce that requests are mapped to connections for each Host: header values (#492 ) This PR ensures that the mapping of requests to outbound connections is segregated by `Host:` header values. In most cases, the desired behavior is provided by Hyper's connection pooling. However, Hyper does not handle the case where a request had no `Host:` header and the request URI had no authority part, and the request was routed based on the SO_ORIGINAL_DST in the desired manner. We would like these requests to each have their own outbound connection, but Hyper will reuse the same connection for such requests. Therefore, I have modified `conduit_proxy_router::Recognize` to allow implementations of `Recognize` to indicate whether the service for a given key can be cached, and to only cache the service when it is marked as cachable. I've also changed the `reconstruct_uri` function, which rewrites HTTP/1 requests, to mark when a request had no authority and no `Host:` header, and the authority was rewritten to be the request's ORIGINAL_DST. When this is the case, the `Recognize` implementations for `Inbound` and `Outbound` will mark these requests as non-cachable. I've also added unit tests ensuring that A, connections are created per `Host:` header, and B, that requests with no `Host:` header each create a new connection. The first test passes without any additional changes, but the second only passes on this branch. The tests were added in PR #489, but this branch supersedes that branch. Fixes #415. Closes #489.	2018-03-06 16:44:14 -08:00
Sean McArthur	e07700bbcc	proxy: preserve body headers in http1 (#457 ) As a goal of being a transparent proxy, we want to proxy requests and responses with as little modification as possible. Basically, servers and clients should see messages that look the same whether the proxy was injected or not. With that goal in mind, we want to make sure that body headers (things like `Content-Length`, `Transfer-Encoding`, etc) are left alone. Prior to this commit, we at times were changing behavior. Sometimes `Transfer-Encoding` was added to requests, or `Content-Length: 0` may have been removed. While RC 7230 defines that differences are semantically the same, implementations may not handle them correctly. Now, we've added some fixes to prevent any of these header changes from occurring, along with tests to make sure library updates don't regress. For requests: - With no message body, `Transfer-Encoding: chunked` should no longer be added. - With `Content-Length: 0`, the header is forwarded untouched. For responses: - Tests were added that responses not allowed to have bodies (to HEAD requests, 204, 304) did not have `Transfer-Encoding` added. - Tests that `Content-Length: 0` is preserved. - Tests that HTTP/1.0 responses with no body headers do not have `Transfer-Encoding` added. - Tests that `HEAD` responses forward `Content-Length` headers (but not an actual body). Closes #447 Signed-off-by: Sean McArthur <sean@seanmonstar.com>	2018-03-05 18:10:51 -08:00
Sean McArthur	0effefa5d7	proxy: detect TCP socket hang ups from client or server (#463 ) We previously `join`ed on piping data from both sides, meaning that the future didn't complete until both sides had disconnected. Even if the client disconnected, it was possible the server never knew, and we "leaked" this future. To fix this, the `join` is replaced with a `Duplex` future, which pipes from both ends into the other, while also detecting when one side shuts down. When a side does shutdown, a write shutdown is forwarded to the other side, to allow draining to occur for deployments that half-close sockets. Closes #434	2018-03-02 10:14:54 -08:00
Brian Smith	457e65e512	Fix intermittent outbound_times_out failure. (#471 ) This was caused by the fact that a new instance of `env_logger::init()` was added after the PR that rewrote them all to `env_logger::try_init()` was added. Fixes #469 Signed-off-by: Brian Smith <brian@briansmith.org>	2018-02-26 19:07:36 -10:00
Brian Smith	6d3a7c337d	Reduce memory allocations during logging. (#445 ) Stop initializing env_logger in every test. In env_logger 0.5, it may only be initialized once per process. Also, Prost will soon upgrade to env_logger 0.5 and this will (eventually) help reduce the number of versions of env_logger we have to build. Turning off the regex feature will (eventually) also reduce the number of dependencies we have to build. Unfortunately, as it is now, the number of dependencies has increased because env_logger increased its dependencies in 0.5. Signed-off-by: Brian Smith <brian@briansmith.org>	2018-02-26 18:32:47 -10:00

1 2

66 Commits