boulder

Commit Graph

Author	SHA1	Message	Date
Aaron Gable	dc14caf907	Add MPICFullResults feature flag to turn off VA early return (#8046 ) Add a new "MPICFullResults" feature flag. When this flag is enabled in the VA, it will wait for all Remote VAs to return their results for both Domain Control Validation and CAA checking, rather than short-circuiting as soon as it has seen enough results to know whether corroboration will or will not be achieved. We make this change because waiting for these to return honestly doesn't take that long, because we do validation (although not CAA rechecking) asynchronously, and because it improves the quality of our MPIC quorum summary logs (so we don't always say only 3/4 concurred because the fourth was cancelled). Fixes https://github.com/letsencrypt/boulder/issues/7809	2025-03-11 08:49:05 -07:00
Samantha Frank	e6c812a3db	va/ra: Deprecate EnforceMultiCAA and EnforceMPIC (#8025 ) Replace DCV and CAA checks (PerformValidation and IsCAAValid) in va/va.go and va/caa.go with their MPIC compliant counterparts (DoDCV and DoCAA) in va/vampic.go. Deprecate EnforceMultiCAA and EnforceMPIC and default code paths as though they are both true. Require that RIR and Perspective be set for primary and remote VAs. Fixes #7965 Fixes #7819	2025-03-03 16:33:27 -05:00
Samantha Frank	d64132eebc	VA: Use performValidation for IsCAAValid remote checks (#7850 ) - Remove undeployed feature flag MultiCAAFullResults - Perform local CAA checks prior to initiating remote checks, instead of starting remote checks and proceeding to perform local checks. - Remove VA.IsCAAValid specific remote validation logic, use VA.performRemoteOperation instead - Refactor va.logRemoteResults to be easier to test and omit the RVA problem - Drive-by fix: Calculate logEvent.Latency with va.clk.Since() instead of time.Since() like everything else in VA.performRemoteOperation	2024-11-28 15:24:47 -05:00
Samantha Frank	c3948314ff	va: Make the primary VA aware of the Perspective and RIR of each remote (#7839 ) - Make the primary VA aware of the expected Perspective and RIR of each remote VA. - All Perspectives should be unique, have the primary VA check for duplicate Perspectives at startup. - Update test setup functions to ensure that each remote VA client and corresponding inmem impl have a matching perspective and RIR. Part of #7819	2024-11-25 13:02:03 -05:00
Jacob Hoffman-Andrews	577a1e38eb	va: prepare to require minimum of 3 RVAs (#7815 ) To prepare for the MPIC requirement of having a minimum of 3 perspectives, I added code to `NewValidationAuthorityImpl` to error if there aren't enough remote VAs configured _and_ the current VA is the primary perspective. Then I fixed all the tests, which involved adding some backends in the unittests, and spinning up `remoteva-c` in the integration tests. As a reminder, the `boulder va` command always considers itself the primary perspective, while `boulder remoteva` gives itself a perspective based on its config. I wound up backing out the code in `NewValidationAuthorityImpl` because right now our remote VAs are actually running the `boulder va` command, so they would error out in prod, even though our actual primary perspective does have enough backends. So this wound up as a test-only change.	2024-11-19 10:23:32 -05:00
Jacob Hoffman-Andrews	a46c388f66	va: compute maxRemoteFailures based on MPIC (#7810 ) Previously this was a configuration field. Ports `maxAllowedFailures()` from `determineMaxAllowedFailures()` in #7794. Test updates: Remove the `maxRemoteFailures` param from `setup` in all VA tests. Some tests were depending on setting this param directly to provoke failures. For example, `TestMultiVAEarlyReturn` previously relied on "zero allowed failures". Since the number of allowed failures is now 1 for the number of remote VAs we were testing (2), the VA wasn't returning early with an error; it was succeeding! To fix that, make sure there are two failures. Since two failures from two RVAs wouldn't exercise the right situation, add a third RVA, so we get two failures from three RVAs. Similarly, TestMultiCAARechecking had several test cases that omitted this field, effectively setting it to zero allowed failures. I updated the "1 RVA failure" test case to expect overall success and added a "2 RVA failures" test case to expect overall failure (we previously expected overall failure from a single RVA failing). In TestMultiVA I had to change a test for `len(lines) != 1` to `len(lines) == 0`, because with more backends we were now logging more errors, and finding e.g. `len(lines)` to be 2.	2024-11-18 15:36:09 -08:00
Aaron Gable	3b62e81999	Clean up migration to separate remoteva executable (#7787 ) Fixes https://github.com/letsencrypt/boulder/issues/7733	2024-11-05 07:44:08 -08:00
Aaron Gable	c3c278a1a2	Deprecate EnforceMultiVA and MultiVAFullResults feature flags (#7520 ) These flags have been true and false, respectively, for years. We do not expect to change them at any time in the future, and their continued existence makes certain parts of the VA code significantly more complex. Remove all references to them, preserving behavior in the "enforce, but not full results" configuration. IN-10358 tracks the corresponding config changes	2024-06-04 11:57:03 -07:00
Aaron Gable	6ae6aa8e90	Dynamically generate grpc-creds at integration test startup (#7477 ) The summary here is: - Move test/cert-ceremonies to test/certs - Move .hierarchy (generated by the above) to test/certs/webpki - Remove our mapping of .hierarchy to /hierarchy inside docker - Move test/grpc-creds to test/certs/ipki - Unify the generation of both test/certs/webpki and test/certs/ipki into a single script at test/certs/generate.sh - Make that script the entrypoint of a new docker compose service - Have t.sh and tn.sh invoke that service to ensure keys and certs are created before tests run No production changes are necessary, the config changes here are just for testing purposes. Part of https://github.com/letsencrypt/boulder/issues/7476	2024-05-15 11:31:23 -04:00
Phil Porada	c1561b070b	Add a new remoteva binary (#7437 ) * Adds a new `remoteva` binary that takes a distinct configuration from the existing `boulder-va` * Removed the `boulder-remoteva` name registration from `boulder-va`. * Existing users of `boulder-remoteva` must either 1. laterally migrate to `boulder-va` which uses that same config, or 2. switch to using `remoteva` with a new config. Part of https://github.com/letsencrypt/boulder/issues/5294	2024-05-06 16:29:29 -04:00
Aaron Gable	78e4e82ffa	Feature cleanup (#7320 ) Remove three deprecated feature flags which have been removed from all production configs: - StoreLintingCertificateInsteadOfPrecertificate - LeaseCRLShards - AllowUnrecognizedFeatures Deprecate three flags which are set to true in all production configs: - CAAAfterValidation - AllowNoCommonName - SHA256SubjectKeyIdentifier IN-9879 tracked the removal of these flags.	2024-02-13 17:42:27 -08:00
Phil Porada	03152aadc6	RVA: Recheck CAA records (#7221 ) Previously, `va.IsCAAValid` would only check CAA records from the primary VA during initial domain control validation, completely ignoring any configured RVAs. The upcoming [MPIC](https://github.com/ryancdickson/staging/pull/8) ballot will require that it be done from multiple perspectives. With the currently deployed [Multi-Perspective Validation](https://letsencrypt.org/2020/02/19/multi-perspective-validation.html) in staging and production, this change brings us in line with the [proposed phase 3](https://github.com/ryancdickson/staging/pull/8/files#r1368708684). This change reuses the existing [MaxRemoteValidationFailures](`21fc191273/cmd/boulder-va/main.go (L35)`) variable for the required non-corroboration quorum. > Phase 3: June 15, 2025 - December 14, 2025 ("CAs MUST implement MPIC in blocking mode"): > > MUST implement MPIC? Yes > Required quorum?: Minimally, 2 remote perspectives must be used. If using less than 6 remote perspectives, 1 non-corroboration is allowed. If using 6 or more remote perspectives, 2 non-corroborations are allowed. > MUST block issuance if quorum is not met: Yes. > Geographic diversity requirements?: Perspectives must be 500km from 1) the primary perspective and 2) all other perspectives used in the quorum. > > Note: "Blocking Mode" is a nickname. As opposed to "monitoring mode" (described in the last milestone), CAs MUST NOT issue a certificate if quorum requirements are not met from this point forward. Adds new VA feature flags: * `EnforceMultiCAA` instructs a primary VA to command each of its configured RVAs to perform a CAA recheck. * `MultiCAAFullResults` causes the primary VA to block waiting for all RVA CAA recheck results to arrive. Renamed `va.logRemoteValidationDifferentials` to `va.logRemoteDifferentials` because it can handle initial domain control validations and CAA rechecking with minimal editing. Part of https://github.com/letsencrypt/boulder/issues/7061	2024-01-25 16:23:25 -05:00
Jacob Hoffman-Andrews	7b347dd6c3	Use different ports for instances of the same service (#7246 ) Part of #7245. This just provides a unique port for each instance, and breaks the service<->port mapping. A subsequent PR will move to listening on the same IP. Remove unused `-b` variants of crl-storer and akamai-purger. The new port scheme is that the first instance of a service is on `93xx` and the second instance of a service is on `94xx`. Part of a stacked change with #7243.	2024-01-10 14:32:33 -08:00
Jacob Hoffman-Andrews	c21b376623	Implement DoH for validation queries (#7178 ) Fixes: #7141	2023-12-11 10:49:00 -08:00
Matthew McPherrin	cb5384dcd7	Add --addr and/or --debug-addr flags to all commands (#7175 ) Many services already have --addr and/or --debug-addr flags. However, it wasn't universal, so this PR adds flags to commands where they're not currently present. This makes it easier to use a shared config file but listen on different ports, for running multiple instances on a single host. The config options are made optional as well, and removed from config-next/.	2023-12-07 17:41:01 -08:00
Aaron Gable	3b880e1ccf	Add CAAAfterValidation feature flag (#7082 ) Add a new feature flag "CAAAfterValidation" which, when set to true in the VA, causes the VA to only begin CAA checks after basic domain control validation has completed successfully. This will make successful validations take longer, since the DCV and CAA checks are performed serially instead of in parallel. However, it will also reduce the number of CAA checks we perform by up to 80%, since such a high percentage of validations also fail. IN-9575 tracks enabling this feature flag in staging and prod Fixes https://github.com/letsencrypt/boulder/issues/7058	2023-09-18 13:30:31 -07:00
Aaron Gable	e09c5faf5e	Deprecate CAA AccountURI and ValidationMethods feature flags (#7000 ) These flags are set to true in all environments.	2023-07-14 14:54:39 -04:00
Matthew McPherrin	b7d9f8c2e3	In config-next/, opentelemetry -> openTelemetry for consistency (#6888 ) In configs, opentelemetry -> openTelemetry As pointed out in review of #6867, these should match the case of their corresponding Go identifiers for consistency. JSON keys are case-insensitive in Go (part of why we've got a fork in go-jose), so this change should have no functional impact.	2023-05-15 17:07:29 -04:00
Samantha	310546a14e	VA: Support discovery of DNS resolvers via Consul (#6869 ) Deprecate `va.DNSResolver` in favor of backwards compatible `va.DNSProvider`. Fixes #6852	2023-05-12 12:54:31 -04:00
Matthew McPherrin	8427245675	OTel Integration test using jaeger (#6842 ) This adds Jaeger's all-in-one dev container (with no persistent storage) to boulder's dev docker-compose. It configures config-next/ to send all traces there. A new integration test creates an account and issues a cert, then verifies the trace contains some set of expected spans. This test found that async finalize broke spans, so I fixed that and a few related spots where we make a new context.	2023-05-05 10:41:29 -04:00
Matthew McPherrin	05c9106eba	lints: Consistently format JSON configuration files (#6755 ) - Consistently format existing test JSON config files - Add a small Python script which loads and dumps JSON files - Add CI JSON lint test to CI --------- Co-authored-by: Aaron Gable <aaron@aarongable.com>	2023-03-20 18:11:19 -04:00
Matthew McPherrin	e1ed1a2ac2	Remove beeline tracing (#6733 ) Remove tracing using Beeline from Boulder. The only remnant left behind is the deprecated configuration, to ensure deployability. We had previously planned to swap in OpenTelemetry in a single PR, but that adds significant churn in a single change, so we're doing this as multiple steps that will each be significantly easier to reason about and review. Part of #6361	2023-03-14 15:14:27 -07:00
Phil Porada	3866e4f60d	VA: Use default PortConfig during testing (#6609 ) Part of #3940	2023-01-25 16:16:08 -05:00
Aaron Gable	46c8d66c31	bgrpc.NewServer: support multiple services (#6487 ) Turn bgrpc.NewServer into a builder-pattern, with a config-based initialization, multiple calls to Add to add new gRPC services, and a final call to Build to produce the start() and stop() functions which control server behavior. All calls are chainable to produce compact code in each component's main() function. This improves the process of creating a new gRPC server in three ways: 1) It avoids the need for generics/templating, which was slightly verbose. 2) It allows the set of services to be registered on this server to be known ahead of time. 3) It greatly streamlines adding multiple services to the same server, which we use today in the VA and will be using soon in the SA and CA. While we're here, add a new per-service config stanza to the GRPCServerConfig, so that individual services on the same server can have their own configuration. For now, only provide a "ClientNames" key, which will be used in a follow-up PR. Part of #6454	2022-11-04 13:26:42 -07:00
Samantha	9c12e58c7b	grpc: Allow static host override in client config (#6423 ) - Add a new gRPC client config field which overrides the dNSName checked in the certificate presented by the gRPC server. - Revert all test gRPC credentials to `<service>.boulder` - Revert all ClientNames in gRPC server configs to `<service>.boulder` - Set all gRPC clients in `test/config` to use `serverAddress` + `hostOverride` - Set all gRPC clients in `test/config-next` to use `srvLookup` + `hostOverride` - Rename incorrect SRV record for `ca` with port `9096` to `ca-ocsp` - Rename incorrect SRV record for `ca` with port `9106` to `ca-crl` Resolves #6424	2022-10-03 15:23:55 -07:00
Samantha	90eb90bdbe	test: Replace sd-test-srv with consul (#6389 ) - Add a dedicated Consul container - Replace `sd-test-srv` with Consul - Add documentation for configuring Consul - Re-issue all gRPC credentials for `<service-name>.service.consul` Part of #6111	2022-09-19 16:13:53 -07:00
Jacob Hoffman-Andrews	db044a8822	log: fix spurious honeycomb warnings; improve stdout logger (#6364 ) Honeycomb was emitting logs directly to stderr like this: ``` WARN: Missing API Key. WARN: Dataset is ignored in favor of service name. Data will be sent to service name: boulder ``` Fix this by providing a fake API key and replacing "dataset" with "serviceName" in configs. Also add missing Honeycomb configs for crl-updater. For stdout-only logger, include checksums and escape newlines.	2022-09-14 11:25:02 -07:00
Aaron Gable	9abb39d4d6	Honeycomb integration proof-of-concept (#5408 ) Add Honeycomb tracing to all Boulder components which act as HTTP servers, gRPC servers, or gRPC clients. Add many values which we currently emit to logs to the trace spans. Add a way to configure the Honeycomb integration to our config files, and by default configure all of our tests to "mute" (send nothing). Followup changes will refine the configuration, attempt to reduce the new dependency load, and introduce better sampling. Part of https://github.com/letsencrypt/dev-misc-tickets/issues/218	2021-05-24 16:13:08 -07:00
Aaron Gable	a19ebfa0e9	VA: Query SRV to preload/cache DNS resolver addrs (#5360 ) Abstract out the way that the bdns library keeps track of the resolvers it uses to do DNS lookups. Create one implementation, the `StaticProvider`, which behaves exactly the same as the old mechanism (providing whatever names or addresses were given in the config). Create another implementation, `DynamicProvider`, which re-resolves the provided name on a regular basis. The dynamic provider consumes a single name, does a lookup on that name for any SRV records suggesting that it is running a DNS service, and then looks up A records to get the address of all the names returned by the SRV query. It exports its successes and failures as a prometheus metric. Finally, update the tests and config-next configs to work with this new mechanism. Give sd-test-srv the capability to respond to SRV queries, and put the names it provides into docker's default DNS resolver. Fixes #5306	2021-04-20 10:11:53 -07:00
Aaron Gable	547dbfc93a	Remove Common.DNSResolver from VA config (#5355 ) This field is not used by any production configs, so we can safely remove it. Also, add config fields for DNSTimeout and DNSAllowLoopbackAddress outside of the Common sub-struct, to allow for its removal later. Part of #5242	2021-03-19 10:02:04 -07:00
Jacob Hoffman-Andrews	b4e483d38b	Add gRPC MaxConnectionAge config. (#5311 ) This allows servers to tell clients to go away after some period of time, which triggers the clients to re-resolve DNS. Per grpc/grpc#12295, this is the preferred way to do this. Related: #5307.	2021-03-01 18:37:47 -08:00
Aaron Gable	3666322817	Add health-checker tool and use it from startservers.py (#5095 ) This adds a new tool, `health-checker`, which is a client of the new Health Checker Service that has been integrated into all of our boulder components. This tool takes an address, a timeout, and a config file. It then attempts to connect to a gRPC Health Service at the given address, retrying until it hits its timeout, using credentials specified by the config file. This is then wrapped by a new function `waithealth` in our Python helpers, which serves much the same function as `waitport`, but specifically for services which surface a gRPC Health Service This in turn requires slight modifications to `startservers`, namely specifying the address and port on which each service starts its gRPC listener. Finally, this change also introduces new credentials for this health-checker, and adds those credentials as a valid client to all services' json configs. A similar change would have to be made to our production configs if we were to establish a long-lived health checker/prober in prod. Fixes #5074	2020-10-06 15:01:35 -07:00
Aaron Gable	440c5f96d9	Remove unreferenced values from test configs (#4959 )	2020-07-15 13:50:00 -07:00
orangepizza	dee757c057	Remove multiva exception list code (#4933 ) Fixes #4931	2020-07-08 10:57:17 -07:00
Jacob Hoffman-Andrews	5527716410	Port v1 integration tests to v2. (#4807 ) As of this change, each test case in v1_integration.py has an equivalent in v2_integration.py. This mostly involved copying the test cases and tweaking them to use chisel2.py. I had to add support for updating email addresses in chisel2.py (copied from chisel.py) in order to support one of the test cases. The VA was not yet configured to recognize account paths that start with the ACMEv2 path, so I added that configuration. The most useful way to see what's changed in porting the test cases is to check out this branch and then do a diff between v1_integration.py and v2_integration.py.	2020-05-13 11:59:04 -07:00
Jacob Hoffman-Andrews	87fb6028c1	Add log validator to integration tests (#4782 ) For now this mainly provides an example config and confirms that log-validator can start up and shut down cleanly, as well as provide a stat indicating how many log lines it has handled. This introduces a syslog config to the boulder-tools image that will write logs to /var/log/program.log. It also tweaks the various .json config files so they have non-default syslogLevel, to ensure they actually write something for log-validator to verify.	2020-04-20 13:33:42 -07:00
Daniel McCarney	ddfc620c44	va: exempt multi-va enforcement by domain/acct ID. (#4458 ) In order to move multi perspective validation forward we need to support policy in Boulder configuration that can relax multi-va requirements temporarily. A similar mechanism was used in support of the gradual deprecation of the TLS-SNI-01 challenge type and with the introduction of CAA enforcement and has shown to be a helpful tool to have available when introducing changes that are expected to break sites. When the VA "multiVAPolicyFile" is specified it is assumed to be a YAML file containing two lists: 1. disabledNames - a list of domain names that are exempt from multi VA enforcement. 2. disabledAccounts - a list of account IDs that are exempt from multi VA enforcement. When a hostname or account ID is added to the policy we'll begin communication with the related ACME account contact to establish that this is a temporary measure and the root problem will need to be addressed before an eventual cut-off date. Resolves https://github.com/letsencrypt/boulder/issues/4455	2019-10-07 16:43:11 -04:00
Jacob Hoffman-Andrews	5e7fee0c4a	test: update test/config with deployed configs. (#4396 )	2019-08-09 12:08:56 -04:00
Roland Bracewell Shoemaker	3532dce246	Excise grpc maxConcurrentStreams configuration (#4257 )	2019-06-12 09:35:24 -04:00
Jacob Hoffman-Andrews	0c700143bb	Clean up README and test configs (#4185 ) - docker-rebuild isn't needed now that boulder and bhsm containers run directly off the boulder-tools image. - Remove DNS options from RA config. - Remove GSB options from VA config.	2019-04-30 13:26:19 -07:00
Daniel McCarney	063a98f02a	VA: additional feature flag control for multiVA. (#4122 ) * `EnforceMultiVA` to allow configuring multiple VAs but not changing the primary VA's result based on what the remote VAs return. * `MultiVAFullResults` to allow collecting all of the remote VA results. When all results are collected a JSON log line with the differential between the primary/remote VAs is logged. Resolves https://github.com/letsencrypt/boulder/issues/4066	2019-03-25 12:23:53 -04:00
Roland Bracewell Shoemaker	842739bccd	Remove deprecated features that have been purged from prod and staging configs (#4001 )	2019-01-15 16:16:35 -08:00
Daniel McCarney	d9d2f4e9b0	VA: Simplified HTTP-01 w/ IP address URLs (#3939 ) Continued bugs from the custom dialer approach used by the VA for HTTP-01 (most recently https://github.com/letsencrypt/boulder/issues/3889) motivated a rewrite. Instead of using a custom dialer to be able to control DNS resolution for HTTP validation requests we can construct URLs for the IP addresses we resolve and overload the Host header. This avoids having to do address resolution within the dialer and eliminates the complexity of the dialer `addrInfoChan`. The only thing left for our custom dialer now is to shave some time off of the provided context to help us discern timeouts before/after connect. The existing IP preference & fallback behaviour is preserved: e.g. if a host has both IPv6 and IPv4 addresses we connect to the first IPv6 address. If there is a network error connecting to that address (e.g. an error during "dial"), we try once more with the first IPv4 address. No other retries are done. Matching existing behaviour no fallback is done for HTTP level failures on an IPv6 address (e.g. mismatched webroots, redirect loops, etc). A new Prometheus counter "http01_fallbacks" is used to keep track of the number of fallbacks performed. As a result of moving the layer at which the retry happens a fallback like described above will now produce two validation records: one for the initial IPv6 connection, and one for the IPv4 connection. Neither will have the "addressesTried" field populated, just "addressesResolved" and "addressUsed". Previously with the dialer doing the retry we would have created just one validation record with an IPv4 "addressUsed" field and both an IPv6 and IPv4 address in the "addressesTried" field. Because this is a big diff for a key part of the VA the new code is gated by the `SimplifiedVAHTTP` feature flag. Resolves #3889	2018-11-19 14:15:39 -05:00
Roland Bracewell Shoemaker	876c727b6f	Update gRPC (#3817 ) Fixes #3474.	2018-08-20 10:55:42 -04:00
Joel Sing	9c2859c87b	Add support for CAA account-uri validation. (#3736 ) This adds support for the account-uri CAA parameter as specified by section 3 of https://tools.ietf.org/html/draft-ietf-acme-caa-04, allowing issuance to be restricted to one or more ACME accounts as specified by CAA records.	2018-06-08 12:08:03 -07:00
Joel Sing	2540d59296	Implement CAA validation-methods checking. (#3716 ) When performing CAA checking respect the validation-methods parameter (if present) and restrict the allowed authorization methods to those specified. This allows a domain to restrict authorization methods that can be used with Let's Encrypt. This is largely based on PR #3003 (by @lukaslihotzki), which was landed and then later reverted due to issue #3143. The bug the resulted in the previous code being reverted has been addressed (likely inadvertently) by `76973d0f`. This implementation also includes integration tests for CAA validation-methods. Fixes issue #3143.	2018-05-23 14:32:31 -07:00
Jacob Hoffman-Andrews	a4421ae75b	Run gRPC backends on multiple IPs instead of multiple ports (#3679 ) We're currently stuck on gRPC v1.1 because of a breaking change to certificate validation in gRPC 1.8. Our gRPC balancer uses a static list of multiple hostnames, and expects to validate against those hostnames. However gRPC expects that a service is one hostname, with multiple IP addresses, and validates all those IP addresses against the same hostname. See grpc/grpc-go#2012. If we follow gRPC's assumptions, we can rip out our custom Balancer and custom TransportCredentials, and will probably have a lower-friction time in general. This PR is the first step in doing so. In order to satisfy the "multiple IPs, one port" property of gRPC backends in our Docker container infrastructure, we switch to Docker's user-defined networking. This allows us to give the Boulder container multiple IP addresses on different local networks, and gives it different DNS aliases in each network. In startservers.py, each shard of a service listens on a different DNS alias for that service, and therefore a different IP address. The listening port for each shard of a service is now identical. This change also updates the gRPC service certificates. Now, each certificate that is used in a gRPC service (as opposed to something that is "only" a client) has three names. For instance, sa1.boulder, sa2.boulder, and sa.boulder (the generic service name). For now, we are validating against the specific hostnames. When we update our gRPC dependency, we will begin validating against the generic service name. Incidentally, the DNS aliases feature of Docker allows us to get rid of some hackery in entrypoint.sh that inserted entries into /etc/hosts. Note: Boulder now has a dependency on the DNS aliases feature in Docker. By default, docker-compose run creates a temporary container and doesn't assign any aliases to it. We now need to specify docker-compose run --use-aliases to get the correct behavior. Without --use-aliases, Boulder won't be able to resolve the hostnames it wants to bind to.	2018-05-07 10:38:31 -07:00
Jacob Hoffman-Andrews	2a1cd4981a	Allow configuring gRPC's MaxConcurrentStreams (#3642 ) During periods of peak load, some RPCs are significantly delayed (on the order of seconds) by client-side blocking. HTTP/2 clients have to obey a "max concurrent streams" setting sent by the server. In Go's HTTP/2 implementation, this value [defaults to 250](https://github.com/golang/net/blob/master/http2/server.go#L56), so the gRPC default is also 250. So whenever there are more than 250 requests in progress at a time, additional requests will be delayed until there is a slot available. During this peak load, we aren't hitting limits on CPU or memory, so we should increase the max concurrent streams limit to take better advantage of our available resources. This PR adds a config field to do that. Fixes #3641.	2018-04-12 17:17:17 -04:00
Jacob Hoffman-Andrews	a4f9de9e35	Improve nesting of RPC deadlines (#3619 ) gRPC passes deadline information through the RPC boundary, but client and server have the same deadline. Ideally we'd like the server to have a slightly tighter deadline than the client, so if one of the server's onward RPCs or other network calls times out, the server can pass back more detailed information to the client, rather than the client timing out the server and losing the opportunity to log more detailed information about which component caused the timeout. In this change, I subtract 100ms from the deadline on the server side of our interceptors, using our existing serverInterceptor. I also check that there is at least 100ms remaining in which to do useful work, so the server doesn't begin a potentially expensive task only to abort it. Fixes #3608.	2018-04-06 15:40:18 +01:00
Roland Bracewell Shoemaker	cc5ec34539	Allow configuration of multiple DNS resolvers (#3612 ) * Allow configuration of multiple DNS resolvers * Use multiple DNS resolvers in integration tests Fixes #3611.	2018-04-05 11:51:22 -04:00

1 2

75 Commits