boulder

Commit Graph

Author	SHA1	Message	Date
Aaron Gable	a00821ada6	Scale ARI suggested window to cert lifetime (#8024 ) Compute the width of the ARI suggested renewal window as 2% of the validity period. This means that 90-day certificates have their suggested window shrink slightly from 48 hours to 43.2 hours, and gives six-day (160h) certs a suggested window 3.2 hours wide. Also move the center of that window to the midpoint of the certificate validity period for certs which are valid for less than 10 days, so that operators have (proportionally) a little more time to respond to renewal issues. Fixes https://github.com/letsencrypt/boulder/issues/7996	2025-03-05 15:32:25 -08:00
James Renken	f6c748c1c3	WFE/nonce: Remove deprecated NoncePrefixKey field (#7825 ) Remove the deprecated WFE & nonce config field `NoncePrefixKey`, which has been replaced by `NonceHMACKey`. <del>DO NOT MERGE until:</del> - <del>#7793 (in `release-2024-11-18`) has been deployed, AND:</del> - <del>`NoncePrefixKey` has been removed from all running configs.</del> Fixes #7632	2025-02-06 15:32:49 -08:00
Aaron Gable	c5a28cd26d	WFE: Refuse to finalize orders with unrecognized profiles (#7988 ) The current profiles draft (https://datatracker.ietf.org/doc/draft-aaron-acme-profiles/00/) says: > If a server receives a request to finalize an Order whose profile the > CA is no longer willing to issue under, it MUST respond with a > problem document of type "invalidProfile". The server SHOULD attempt > to avoid this situation, e.g. by ensuring that all Orders for a > profile have expired before it stops issuing under that profile. Add types and helper functions representing this new error type to the berrors, probs, and web packages. Update the WFE code which rejects new-order requests with unrecognized profiles to use these new types, and add similar code to the WFE's finalize path. Update the unit and integration tests to reflect the fact that we now configure at least one profile in both Staging and Prod (tracked in IN-10574).	2025-01-30 14:10:02 -08:00
Jacob Hoffman-Andrews	55b8cbef6c	tests: increase wfe log level (#7982 ) We've been seeing some flaky integration tests where issuance fails. The integration test only has access to the generic user-facing error. The real error is available as `InternalError` in the WFE logs, but we need a higher log level to see it.	2025-01-27 11:24:08 -08:00
James Renken	e4668b4ca7	Deprecate DisableLegacyLimitWrites & UseKvLimitsForNewOrder flags; remove code using certificatesPerName & newOrdersRL tables (#7858 ) Remove code using `certificatesPerName` & `newOrdersRL` tables. Deprecate `DisableLegacyLimitWrites` & `UseKvLimitsForNewOrder` flags. Remove legacy `ratelimit` package. Delete these RA test cases: - `TestAuthzFailedRateLimitingNewOrder` (rl: `FailedAuthorizationsPerDomainPerAccount`) - `TestCheckCertificatesPerNameLimit` (rl: `CertificatesPerDomain`) - `TestCheckExactCertificateLimit` (rl: `CertificatesPerFQDNSet`) - `TestExactPublicSuffixCertLimit` (rl: `CertificatesPerDomain`) Rate limits in NewOrder are now enforced by the WFE, starting here: `5a9b4c4b18/wfe2/wfe.go (L781)` We collect a batch of transactions to check limits, check them all at once, go through and find which one(s) failed, and serve the failure with the Retry-After that's furthest in the future. All this code doesn't really need to be tested again; what needs to be tested is that we're returning the correct failure. That code is `NewOrderLimitTransactions`, and the `ratelimits` package's tests cover this. The public suffix handling behavior is tested by `TestFQDNsToETLDsPlusOne`: `5a9b4c4b18/ratelimits/utilities_test.go (L9)` Some other RA rate limit tests were deleted earlier, in #7869. Part of #7671.	2025-01-10 12:50:57 -08:00
Samantha Frank	1ddd4633f5	DB: Promote pausing schema from config-next to config (#7878 )	2024-12-11 14:38:55 -05:00
Aaron Gable	bac5602c6d	Always use INCRBY for redis rate limits (#7856 ) Deprecate the IncrementRateLimits feature flag, and always use the redis INCRBY instruction to update rate limit TATs. Fixes https://github.com/letsencrypt/boulder/issues/7855	2024-12-02 15:25:33 -08:00
Jacob Hoffman-Andrews	5be3e99a4d	features: remove deprecated features (#7805 ) Fixes #7802	2024-11-13 10:22:32 -05:00
Aaron Gable	2603aa45a8	Remove weakKeyFile and blockedKeyFile support (#7783 ) Goodkey has two ways to detect a key as weak: it runs a variety of algorithmic checks (such as Fermat factorization and rocacheck), or the key can be listed in a "weak key file". Similarly, it has two ways to detect a key as blocked: it can call a generic function (which we use to query our database), or the key can be listed in a "blocked key file". This is two methods too many. Reliance on files of weak or blocked keys introduces unnecessary complexity to both the implementation and configuration of the goodkey package. Remove both "key file" options and delete all code which supported them. Also remove //test/block-a-key, as it was only used to generate these test files. IN-10762 tracked the removal of these files in prod. Fixes https://github.com/letsencrypt/boulder/issues/7748	2024-11-06 10:48:39 -08:00
Samantha Frank	6c85b8d019	wfe/sa/features: Deprecate TrackReplacementCertificatesARI (#7766 )	2024-10-24 13:38:33 -04:00
Samantha Frank	e5edb7077f	wfe/features: Deprecate UseKvLimitsForNewOrder (#7765 ) Default code paths that depended on this flag to be true. Part of #5545	2024-10-23 18:13:24 -04:00
Samantha Frank	c034221f59	config: Default to checking renewal exemption at WFE (#7706 ) Part of https://github.com/letsencrypt/boulder/issues/7511	2024-09-27 16:42:54 -04:00
Aaron Gable	146b78a0f7	Remove all static minica keys (#7489 ) Remove the redis-tls, wfe-tls, and mail-test-srv keys which were generated by minica and then checked in to the repo. All three are replaced by the dynamically-generated ipki directory. Part of https://github.com/letsencrypt/boulder/issues/7476	2024-05-17 11:45:40 -07:00
Aaron Gable	6ae6aa8e90	Dynamically generate grpc-creds at integration test startup (#7477 ) The summary here is: - Move test/cert-ceremonies to test/certs - Move .hierarchy (generated by the above) to test/certs/webpki - Remove our mapping of .hierarchy to /hierarchy inside docker - Move test/grpc-creds to test/certs/ipki - Unify the generation of both test/certs/webpki and test/certs/ipki into a single script at test/certs/generate.sh - Make that script the entrypoint of a new docker compose service - Have t.sh and tn.sh invoke that service to ensure keys and certs are created before tests run No production changes are necessary, the config changes here are just for testing purposes. Part of https://github.com/letsencrypt/boulder/issues/7476	2024-05-15 11:31:23 -04:00
Aaron Gable	327f96d281	Update integration test hierarchy for the modern era (#7411 ) Update the hierarchy which the integration tests auto-generate inside the ./hierarchy folder to include three intermediates of each key type, two to be actively loaded and one to be held in reserve. To facilitate this: - Update the generation script to loop, rather than hard-coding each intermediate we want - Improve the filenames of the generated hierarchy to be more readable - Replace the WFE's AIA endpoint with a thin aia-test-srv so that we don't have to have NameIDs hardcoded in our ca.json configs Having this new hierarchy will make it easier for our integration tests to validate that new features like "unpredictable issuance" are working correctly. Part of https://github.com/letsencrypt/boulder/issues/729	2024-04-08 14:06:00 -07:00
Jacob Hoffman-Andrews	ce5632b480	Remove `service1` / `service2` names in consul (#7266 ) These names corresponded to single instances of a service, and were primarily used for (a) specifying which interface to bind a gRPC port on and (b) allowing `health-checker` to check individual instances rather than a service as a whole. For (a), change the `--grpc-addr` flags to bind to "all interfaces." For (b), provide a specific IP address and port for health checking. This required adding a `--hostOverride` flag for `health-checker` because the service certificates contain hostname SANs, not IP address SANs. Clarify the situation with nonce services a little bit. Previously we had one nonce "service" in Consul and got nonces from that (i.e. randomly between the two nonce-service instances). Now we have two nonce services in consul, representing multiple datacenters, and one of them is explicitly configured as the "get" service, while both are configured as the "redeem" service. Part of #7245. Note this change does not yet get rid of the rednet/bluenet distinction, nor does it get rid of all use of 10.88.88.88. That will be a followup change.	2024-01-22 09:34:20 -08:00
Jacob Hoffman-Andrews	cd3bbf91ad	test: move SRV stanzas from config-next to config (#7243 ) Service discovery via SRV records is now deployed in prod.	2024-01-10 10:31:23 -08:00
Phil Porada	72e01b337a	ceremony: Distinguish between intermediate and cross-sign ceremonies (#7005 ) In `//cmd/ceremony`: * Added `CertificateToCrossSignPath` to the `cross-certificate` ceremony type. This new input field takes an existing certificate that will be cross-signed and performs checks against the manually configured data in each ceremony file. * Added byte-for-byte subject/issuer comparison checks to root, intermediate, and cross-certificate ceremonies to detect that signing is happening as expected. * Added Fermat factorization check from the `//goodkey` package to all functions that generate new key material. In `//linter`: * The Check function now exports linting certificate bytes. The idea is that a linting certificate's `tbsCertificate` bytes can be compared against the final certificate's `tbsCertificate` bytes as a verification that `x509.CreateCertificate` was deterministic and produced identical DER bytes after each signing operation. Other notable changes: * Re-orders the issuers list in each CA config to match staging and production. There is an ordering issue mentioned by @aarongable two years ago on IN-5913 that didn't make it's way back to this repository. > Order here matters – the default chain we serve for each intermediate should be the first listed chain containing that intermediate. * Enables `ECDSAForAll` in `config-next` CA configs to match Staging. * Generates 2x new ECDSA subordinate CAs cross-signed by an RSA root and adds these chains to the WFE for clients to download. * Increased the test.sh startup timeout to account for the extra ceremony run time. Fixes https://github.com/letsencrypt/boulder/issues/7003 --------- Co-authored-by: Aaron Gable <aaron@letsencrypt.org>	2023-08-23 14:01:19 -04:00
Aaron Gable	22fd579cf2	ARI: write Retry-After header before body (#6787 ) When sending an ARI response, write the Retry-After header before writing the JSON response body. This is necessary because http.ResponseWriter implicitly calls WriteHeader whenever Write is called, flushing all headers to the network and preventing any additional headers from being written. Unfortunately, the unittests use httptest.ResponseRecorder, which doesn't seem to enforce this invariant (it's happy to report headers which were written after the body). Add a header check to the integration tests, to make up for this deficiency.	2023-03-31 10:48:45 -07:00
Matthew McPherrin	49851d7afd	Remove Beeline configuration (#6765 ) In a previous PR, #6733, this configuration was marked deprecated pending removal. Here is that removal.	2023-03-23 16:58:36 -04:00
Matthew McPherrin	05c9106eba	lints: Consistently format JSON configuration files (#6755 ) - Consistently format existing test JSON config files - Add a small Python script which loads and dumps JSON files - Add CI JSON lint test to CI --------- Co-authored-by: Aaron Gable <aaron@aarongable.com>	2023-03-20 18:11:19 -04:00
Phil Porada	aae4175186	Remove deprecated feature flags (#6566 ) Remove deprecated feature flags. Fixes #6559	2023-01-23 20:56:15 -05:00
Samantha	9c12e58c7b	grpc: Allow static host override in client config (#6423 ) - Add a new gRPC client config field which overrides the dNSName checked in the certificate presented by the gRPC server. - Revert all test gRPC credentials to `<service>.boulder` - Revert all ClientNames in gRPC server configs to `<service>.boulder` - Set all gRPC clients in `test/config` to use `serverAddress` + `hostOverride` - Set all gRPC clients in `test/config-next` to use `srvLookup` + `hostOverride` - Rename incorrect SRV record for `ca` with port `9096` to `ca-ocsp` - Rename incorrect SRV record for `ca` with port `9106` to `ca-crl` Resolves #6424	2022-10-03 15:23:55 -07:00
Samantha	90eb90bdbe	test: Replace sd-test-srv with consul (#6389 ) - Add a dedicated Consul container - Replace `sd-test-srv` with Consul - Add documentation for configuring Consul - Re-issue all gRPC credentials for `<service-name>.service.consul` Part of #6111	2022-09-19 16:13:53 -07:00
Jacob Hoffman-Andrews	db044a8822	log: fix spurious honeycomb warnings; improve stdout logger (#6364 ) Honeycomb was emitting logs directly to stderr like this: ``` WARN: Missing API Key. WARN: Dataset is ignored in favor of service name. Data will be sent to service name: boulder ``` Fix this by providing a fake API key and replacing "dataset" with "serviceName" in configs. Also add missing Honeycomb configs for crl-updater. For stdout-only logger, include checksums and escape newlines.	2022-09-14 11:25:02 -07:00
Jacob Hoffman-Andrews	4467cf27db	Update config from config-next (#6051 ) This copies over settings from config-next that are now deployed in prod. Also, I updated a comment in sd-test-srv to more accurately describe how SRV records work.	2022-04-19 12:10:26 -07:00
Aaron Gable	910dde95f6	Clean up goodkey configs (#5993 ) Fixes https://github.com/letsencrypt/boulder/issues/5851	2022-03-15 15:26:19 -07:00
Aaron Gable	5c02deabfb	Remove wfe1 integration tests (#5840 ) These tests are testing functionality that is no longer in use in production deployments of Boulder. As we go about removing wfe1 functionality, these tests will break, so let's just remove them wholesale right now. I have verified that all of the tests removed in this PR are duplicated against wfe2. One of the changes in this PR is to cease starting up the wfe1 process in the integration tests at all. However, that component was serving requests for the AIA Issuer URL, which gets queried by various OCSP and revocation tests. In order to keep those tests working, this change also adds an integration-test-only handler to wfe2, and updates the CA configuration to point at the new handler. Part of #5681	2021-12-10 12:40:22 -08:00
Jacob Hoffman-Andrews	ba0ea090b2	integration: save hierarchy across runs (#5729 ) This allows repeated runs using the same hiearchy, and avoids spurious errors from ocsp-updater saying "This CA doesn't have an issuer cert with ID XXX" Fixes #5721	2021-10-20 17:06:33 -07:00
Aaron Gable	9abb39d4d6	Honeycomb integration proof-of-concept (#5408 ) Add Honeycomb tracing to all Boulder components which act as HTTP servers, gRPC servers, or gRPC clients. Add many values which we currently emit to logs to the trace spans. Add a way to configure the Honeycomb integration to our config files, and by default configure all of our tests to "mute" (send nothing). Followup changes will refine the configuration, attempt to reduce the new dependency load, and introduce better sampling. Part of https://github.com/letsencrypt/dev-misc-tickets/issues/218	2021-05-24 16:13:08 -07:00
Aaron Gable	379826d4b5	WFE2: Improve support for multiple issuers & chains (#5247 ) This change simplifies and hardens the wfe2's support for having multiple issuers, and multiple chains for each issuer, configured and loaded in memory. The only config-visible change is replacing the old two separate config values (`certificateChains` and `alternateCertificateChains`) with a single value (`chains`). This new value does not require the user to know and hand-code the AIA URLs at which the certificates are available; instead the chains are simply presented as lists of files. If this new config value is present, the old config values will be ignored; if it is not, the old config values will be respected. Behind the scenes, the chain loading code has been completely changed. Instead of loading PEM bytes directly from the file, and then asserting various things (line endings, no trailing bits, etc) about those bytes, we now parse a certificate from the file, and in-memory recreate the PEM from that certificate. This approach allows the file loading to be much more forgiving, while also being stricter: we now check that each certificate in the chain is correctly signed by the next cert, and that the last cert in the chain is a self-signed root. Within the WFE itself, most of the internal structure has been retained. However, both the internal `issuerCertificates` (used for checking that certs we are asked to revoke were in fact issued by us) and the `certificateChains` (used to append chains to end-entity certs when served to clients) have been updated to be maps keyed by IssuerNameID. This allows revocation checking to not have to iterate through the whole list of issuers, and also makes it easy to double-check that the signatures on end-entity certs are valid before serving them. Actual checking of the validity will come in a follow-up change, due to the invasive nature of the necessary test changes. Fixes #5164	2021-01-27 15:07:58 -08:00
Jacob Hoffman-Andrews	56d581613c	Update test/config. (#4923 ) This copies over a number of features flags and other settings from test/config-next that have been applied in prod. Also, remove the config-next gate on various tests.	2020-07-01 17:59:14 -07:00
Roland Bracewell Shoemaker	7673f02803	Use cmd/ceremony in integration tests (#4832 ) This ended up taking a lot more work than I expected. In order to make the implementation more robust a bunch of stuff we previously relied on has been ripped out in order to reduce unnecessary complexity (I think I insisted on a bunch of this in the first place, so glad I can kill it now). In particular this change: * Removes bhsm and pkcs11-proxy: softhsm and pkcs11-proxy don't play well together, and any softhsm manipulation would need to happen on bhsm, then require a restart of pkcs11-proxy to pull in the on-disk changes. This makes manipulating softhsm from the boulder container extremely difficult, and because of the need to initialize new on each run (described below) we need direct access to the softhsm2 tools since pkcs11-tool cannot do slot initialization operations over the wire. I originally argued for bhsm as a way to mimic a network attached HSM, mainly so that we could do network level fault testing. In reality we've never actually done this, and the extra complexity is not really realistic for a handful of reasons. It seems better to just rip it out and operate directly on a local softhsm instance (the other option would be to use pkcs11-proxy locally, but this still would require manually restarting the proxy whenever softhsm2-util was used, and wouldn't really offer any realistic benefit). * Initializes the softhsm slots on each integration test run, rather than when creating the docker image (this is necessary to prevent churn in test/cert-ceremonies/generate.go, which would need to be updated to reflect the new slot IDs each time a new boulder-tools image was created since slot IDs are randomly generated) * Installs softhsm from source so that we can use a more up to date version (2.5.0 vs. 2.2.0 which is in the debian repo) * Generates the root and intermediate private keys in softhsm and writes out the root and intermediate public keys to /tmp for use in integration tests (the existing test-{ca,root} certs are kept in test/ because they are used in a whole bunch of unit tests. At some point these should probably be renamed/moved to be more representative of what they are used for, but that is left for a follow-up in order to keep the churn in this PR as related to the ceremony work as possible) Another follow-up item here is that we should really be zeroing out the database at the start of each integration test run, since certain things like certificates and ocsp responses will be signed by a key/issuer that is no longer is use/doesn't match the current key/issuer. Fixes #4832.	2020-06-03 15:20:23 -07:00
Jacob Hoffman-Andrews	87fb6028c1	Add log validator to integration tests (#4782 ) For now this mainly provides an example config and confirms that log-validator can start up and shut down cleanly, as well as provide a stat indicating how many log lines it has handled. This introduces a syslog config to the boulder-tools image that will write logs to /var/log/program.log. It also tweaks the various .json config files so they have non-default syslogLevel, to ensure they actually write something for log-validator to verify.	2020-04-20 13:33:42 -07:00
Jacob Hoffman-Andrews	36c1f1ab2d	Deprecate some feature flags (#4771 ) Deprecate some feature flags. These are all enabled in production.	2020-04-13 15:49:55 -07:00
Daniel McCarney	bff9eb0534	CI/Dev: restore allowOrigins wfe/wfe2 config. (#4650 ) In `67ec373a96` we removed "unused" WFE and WFE2 config elements. Unfortunately I missed that one of these elements, `allowOrigins`, is used and without this config in place CORS is broken. We have unit tests for the CORS headers but we did not have any end-to-end integration tests that would catch a problem with the WFE/WFE2 missing the `allowOrigins` config element. This commit restores the `allowOrigins` config value across the WFE/WFE2 configs and also adds a very small integration test. That test only checks one CORS header and only for the HTTP ACMEv2 endpoint but I think it's sufficient for the moment (and definitely better than nothing). Prior to fixing the config elements the integration test fails as expected: ``` --- FAIL: TestWFECORS (0.00s) wfe_test.go:28: "" != "*" FAIL FAIL github.com/letsencrypt/boulder/test/integration 0.014s FAIL ```	2020-01-17 14:41:20 -05:00
Daniel McCarney	67ec373a96	CI/Dev: Delete old/unused WFE config elements. (#4641 ) The config elements deleted from the four WFE config files are not used anywhere.	2020-01-10 12:37:22 -05:00
Roland Bracewell Shoemaker	ea231adc36	features: remove deprecated feature flags (#4607 ) Confirmed none of these features are currently present in any staging or production configs.	2019-12-09 15:59:27 -05:00
Jacob Hoffman-Andrews	bdd29a1e27	Promote authzv2 to test/config now that it's live (#4421 ) This also removes some awkward dancing we did in integration_test.py to run setup_twenty_days_ago under the opposite config of whatever we were about to run tests under. Reverts most of #4288 and #4290.	2019-09-05 12:33:56 -07:00
Jacob Hoffman-Andrews	5e7fee0c4a	test: update test/config with deployed configs. (#4396 )	2019-08-09 12:08:56 -04:00
Daniel McCarney	d1daeee831	Config: serverAddresses -> serverAddress. (#4035 ) The plural `serverAddresses` field in gRPC config has been deprecated for a bit now. We've removed the last usages of it in our staging/prod environments and can clear out the related code. Moving forward we only support a singular `serverAddress` and rely on DNS to direct to multiple instances of a given server.	2019-01-25 10:50:53 -08:00
Roland Bracewell Shoemaker	842739bccd	Remove deprecated features that have been purged from prod and staging configs (#4001 )	2019-01-15 16:16:35 -08:00
Jacob Hoffman-Andrews	4670be1210	Reduce log level for WFE in tests. (#3918 ) Our Travis output is quite verbose with the WFE output, and it's very rare that we have to reference it. I'd like to remove the INFO-level logs (i.e. the logs of every request) so that it's easier to see real errors, and faster to scroll to the bottom of logs of failed runs.	2018-11-01 09:50:41 -04:00
Jacob Hoffman-Andrews	b2f5cf39b9	Bring test/config up to date with test/config-next (#3743 ) Notably, enable the precertificate flow, RPCHeadroom, and multi-IP hostnames. Lots of other changes and feature flags too.	2018-06-01 12:00:52 -07:00
Jacob Hoffman-Andrews	f7dd91534d	Backport config-next/wfe2.json changes to config/ (#3721 )	2018-05-17 08:24:34 -04:00
Jacob Hoffman-Andrews	a4421ae75b	Run gRPC backends on multiple IPs instead of multiple ports (#3679 ) We're currently stuck on gRPC v1.1 because of a breaking change to certificate validation in gRPC 1.8. Our gRPC balancer uses a static list of multiple hostnames, and expects to validate against those hostnames. However gRPC expects that a service is one hostname, with multiple IP addresses, and validates all those IP addresses against the same hostname. See grpc/grpc-go#2012. If we follow gRPC's assumptions, we can rip out our custom Balancer and custom TransportCredentials, and will probably have a lower-friction time in general. This PR is the first step in doing so. In order to satisfy the "multiple IPs, one port" property of gRPC backends in our Docker container infrastructure, we switch to Docker's user-defined networking. This allows us to give the Boulder container multiple IP addresses on different local networks, and gives it different DNS aliases in each network. In startservers.py, each shard of a service listens on a different DNS alias for that service, and therefore a different IP address. The listening port for each shard of a service is now identical. This change also updates the gRPC service certificates. Now, each certificate that is used in a gRPC service (as opposed to something that is "only" a client) has three names. For instance, sa1.boulder, sa2.boulder, and sa.boulder (the generic service name). For now, we are validating against the specific hostnames. When we update our gRPC dependency, we will begin validating against the generic service name. Incidentally, the DNS aliases feature of Docker allows us to get rid of some hackery in entrypoint.sh that inserted entries into /etc/hosts. Note: Boulder now has a dependency on the DNS aliases feature in Docker. By default, docker-compose run creates a temporary container and doesn't assign any aliases to it. We now need to specify docker-compose run --use-aliases to get the correct behavior. Without --use-aliases, Boulder won't be able to resolve the hostnames it wants to bind to.	2018-05-07 10:38:31 -07:00
Daniel McCarney	054f181458	load-generator: send correct ACMEv2 Content-Type on POST (#3667 ) load generator: send correct ACMEv2 Content-Type on POST. This PR updates the Boulder load-generator to send the correct ACMEv2 Content-Type header when POSTing the ACME server. This is required for ACMEv2 and without it all POST requests to the WFE2 running a test/config-next configuration result in malformed 400 errors. While only required by ACMEv2 this commit sends it for ACMEv1 requests as well. No harm no foul. integration tests: allow running just the load generator. Prior to this PR an omission in an if statement in integration-test.py meant that you couldn't invoke test/integration-test.py with just the --load argument to only run the load generator. This commit updates the if to allow this use case.	2018-05-01 12:22:43 -07:00
Roland Bracewell Shoemaker	0a86573a73	Update integration tests	2018-04-20 13:18:40 -07:00
Jacob Hoffman-Andrews	c556a1a20d	Reduce spurious errors in integration test (#3436 ) Boulder is fairly noisy about gRPC connection errors. This is a mixed blessing: Our gRPC configuration will try to reconnect until it hits an RPC deadline, and most likely eventually succeed. In that case, we don't consider those to really be errors. However, in cases where a connection is repeatedly failing, we'd like to see errors in the logs about connection failure, rather than "deadline exceeded." So we want to keep logging of gRPC errors. However, right now we get a lot of these errors logged during integration tests. They make the output hard to read, and may disguise more serious errors. So we'd like to avoid causing such errors in normal integration test operation. This change reorders the startup of Boulder components by their gRPC dependencies, so everything's backend is likely to be up and running before it starts. It also reverses that order for clean shutdowns, and waits for each process to exit before signalling the next one. With these changes, I still got connection errors. Taking listenbuddy out of the gRPC path fixed them. I believe the issue is that listenbuddy is not a truly transparent proxy. In particular, it accepts an inbound TCP connection before opening an outbound TCP connection. If opening that outbound connection results in "connection refused," it closes the inbound connection. That means gRPC sees a "connection closed" (or "connection reset"?) rather than "connection refused". I'm guessing it handles those cases differently, explaining the different error results. We've been using listenbuddy to trigger disconnects while Boulder is running, to ensure that gRPC's reconnect code works. I think we can probably rely on gRPC's reconnect to work. The initial problem that led us to start testing this was a configuration problem; now that we have the configuration we want, we should be fine and don't need to keep testing reconnects on every integration test run.	2018-02-12 18:17:50 -08:00
Daniel McCarney	ba264a5091	Remove unused WFE2 feature flags. (#3375 ) The WFE2 doesn't check any of the feature flags that are configured in the `test/config/wfe2.json` and `test/config-next/wfe2.json` config files - we default to acting as if all new features are enabled for the V2 work. This commit removes the flags from the config to avoid confusion or expectations that changing the config will disable the features.	2018-01-17 12:28:19 -08:00

1 2

54 Commits