boulder

Commit Graph

Author	SHA1	Message	Date
Jacob Hoffman-Andrews	cd3bbf91ad	test: move SRV stanzas from config-next to config (#7243 ) Service discovery via SRV records is now deployed in prod.	2024-01-10 10:31:23 -08:00
Phil Porada	72e01b337a	ceremony: Distinguish between intermediate and cross-sign ceremonies (#7005 ) In `//cmd/ceremony`: * Added `CertificateToCrossSignPath` to the `cross-certificate` ceremony type. This new input field takes an existing certificate that will be cross-signed and performs checks against the manually configured data in each ceremony file. * Added byte-for-byte subject/issuer comparison checks to root, intermediate, and cross-certificate ceremonies to detect that signing is happening as expected. * Added Fermat factorization check from the `//goodkey` package to all functions that generate new key material. In `//linter`: * The Check function now exports linting certificate bytes. The idea is that a linting certificate's `tbsCertificate` bytes can be compared against the final certificate's `tbsCertificate` bytes as a verification that `x509.CreateCertificate` was deterministic and produced identical DER bytes after each signing operation. Other notable changes: * Re-orders the issuers list in each CA config to match staging and production. There is an ordering issue mentioned by @aarongable two years ago on IN-5913 that didn't make it's way back to this repository. > Order here matters – the default chain we serve for each intermediate should be the first listed chain containing that intermediate. * Enables `ECDSAForAll` in `config-next` CA configs to match Staging. * Generates 2x new ECDSA subordinate CAs cross-signed by an RSA root and adds these chains to the WFE for clients to download. * Increased the test.sh startup timeout to account for the extra ceremony run time. Fixes https://github.com/letsencrypt/boulder/issues/7003 --------- Co-authored-by: Aaron Gable <aaron@letsencrypt.org>	2023-08-23 14:01:19 -04:00
Aaron Gable	22fd579cf2	ARI: write Retry-After header before body (#6787 ) When sending an ARI response, write the Retry-After header before writing the JSON response body. This is necessary because http.ResponseWriter implicitly calls WriteHeader whenever Write is called, flushing all headers to the network and preventing any additional headers from being written. Unfortunately, the unittests use httptest.ResponseRecorder, which doesn't seem to enforce this invariant (it's happy to report headers which were written after the body). Add a header check to the integration tests, to make up for this deficiency.	2023-03-31 10:48:45 -07:00
Matthew McPherrin	49851d7afd	Remove Beeline configuration (#6765 ) In a previous PR, #6733, this configuration was marked deprecated pending removal. Here is that removal.	2023-03-23 16:58:36 -04:00
Matthew McPherrin	05c9106eba	lints: Consistently format JSON configuration files (#6755 ) - Consistently format existing test JSON config files - Add a small Python script which loads and dumps JSON files - Add CI JSON lint test to CI --------- Co-authored-by: Aaron Gable <aaron@aarongable.com>	2023-03-20 18:11:19 -04:00
Phil Porada	aae4175186	Remove deprecated feature flags (#6566 ) Remove deprecated feature flags. Fixes #6559	2023-01-23 20:56:15 -05:00
Samantha	9c12e58c7b	grpc: Allow static host override in client config (#6423 ) - Add a new gRPC client config field which overrides the dNSName checked in the certificate presented by the gRPC server. - Revert all test gRPC credentials to `<service>.boulder` - Revert all ClientNames in gRPC server configs to `<service>.boulder` - Set all gRPC clients in `test/config` to use `serverAddress` + `hostOverride` - Set all gRPC clients in `test/config-next` to use `srvLookup` + `hostOverride` - Rename incorrect SRV record for `ca` with port `9096` to `ca-ocsp` - Rename incorrect SRV record for `ca` with port `9106` to `ca-crl` Resolves #6424	2022-10-03 15:23:55 -07:00
Samantha	90eb90bdbe	test: Replace sd-test-srv with consul (#6389 ) - Add a dedicated Consul container - Replace `sd-test-srv` with Consul - Add documentation for configuring Consul - Re-issue all gRPC credentials for `<service-name>.service.consul` Part of #6111	2022-09-19 16:13:53 -07:00
Jacob Hoffman-Andrews	db044a8822	log: fix spurious honeycomb warnings; improve stdout logger (#6364 ) Honeycomb was emitting logs directly to stderr like this: ``` WARN: Missing API Key. WARN: Dataset is ignored in favor of service name. Data will be sent to service name: boulder ``` Fix this by providing a fake API key and replacing "dataset" with "serviceName" in configs. Also add missing Honeycomb configs for crl-updater. For stdout-only logger, include checksums and escape newlines.	2022-09-14 11:25:02 -07:00
Jacob Hoffman-Andrews	4467cf27db	Update config from config-next (#6051 ) This copies over settings from config-next that are now deployed in prod. Also, I updated a comment in sd-test-srv to more accurately describe how SRV records work.	2022-04-19 12:10:26 -07:00
Aaron Gable	910dde95f6	Clean up goodkey configs (#5993 ) Fixes https://github.com/letsencrypt/boulder/issues/5851	2022-03-15 15:26:19 -07:00
Aaron Gable	5c02deabfb	Remove wfe1 integration tests (#5840 ) These tests are testing functionality that is no longer in use in production deployments of Boulder. As we go about removing wfe1 functionality, these tests will break, so let's just remove them wholesale right now. I have verified that all of the tests removed in this PR are duplicated against wfe2. One of the changes in this PR is to cease starting up the wfe1 process in the integration tests at all. However, that component was serving requests for the AIA Issuer URL, which gets queried by various OCSP and revocation tests. In order to keep those tests working, this change also adds an integration-test-only handler to wfe2, and updates the CA configuration to point at the new handler. Part of #5681	2021-12-10 12:40:22 -08:00
Jacob Hoffman-Andrews	ba0ea090b2	integration: save hierarchy across runs (#5729 ) This allows repeated runs using the same hiearchy, and avoids spurious errors from ocsp-updater saying "This CA doesn't have an issuer cert with ID XXX" Fixes #5721	2021-10-20 17:06:33 -07:00
Aaron Gable	9abb39d4d6	Honeycomb integration proof-of-concept (#5408 ) Add Honeycomb tracing to all Boulder components which act as HTTP servers, gRPC servers, or gRPC clients. Add many values which we currently emit to logs to the trace spans. Add a way to configure the Honeycomb integration to our config files, and by default configure all of our tests to "mute" (send nothing). Followup changes will refine the configuration, attempt to reduce the new dependency load, and introduce better sampling. Part of https://github.com/letsencrypt/dev-misc-tickets/issues/218	2021-05-24 16:13:08 -07:00
Aaron Gable	379826d4b5	WFE2: Improve support for multiple issuers & chains (#5247 ) This change simplifies and hardens the wfe2's support for having multiple issuers, and multiple chains for each issuer, configured and loaded in memory. The only config-visible change is replacing the old two separate config values (`certificateChains` and `alternateCertificateChains`) with a single value (`chains`). This new value does not require the user to know and hand-code the AIA URLs at which the certificates are available; instead the chains are simply presented as lists of files. If this new config value is present, the old config values will be ignored; if it is not, the old config values will be respected. Behind the scenes, the chain loading code has been completely changed. Instead of loading PEM bytes directly from the file, and then asserting various things (line endings, no trailing bits, etc) about those bytes, we now parse a certificate from the file, and in-memory recreate the PEM from that certificate. This approach allows the file loading to be much more forgiving, while also being stricter: we now check that each certificate in the chain is correctly signed by the next cert, and that the last cert in the chain is a self-signed root. Within the WFE itself, most of the internal structure has been retained. However, both the internal `issuerCertificates` (used for checking that certs we are asked to revoke were in fact issued by us) and the `certificateChains` (used to append chains to end-entity certs when served to clients) have been updated to be maps keyed by IssuerNameID. This allows revocation checking to not have to iterate through the whole list of issuers, and also makes it easy to double-check that the signatures on end-entity certs are valid before serving them. Actual checking of the validity will come in a follow-up change, due to the invasive nature of the necessary test changes. Fixes #5164	2021-01-27 15:07:58 -08:00
Jacob Hoffman-Andrews	56d581613c	Update test/config. (#4923 ) This copies over a number of features flags and other settings from test/config-next that have been applied in prod. Also, remove the config-next gate on various tests.	2020-07-01 17:59:14 -07:00
Roland Bracewell Shoemaker	7673f02803	Use cmd/ceremony in integration tests (#4832 ) This ended up taking a lot more work than I expected. In order to make the implementation more robust a bunch of stuff we previously relied on has been ripped out in order to reduce unnecessary complexity (I think I insisted on a bunch of this in the first place, so glad I can kill it now). In particular this change: * Removes bhsm and pkcs11-proxy: softhsm and pkcs11-proxy don't play well together, and any softhsm manipulation would need to happen on bhsm, then require a restart of pkcs11-proxy to pull in the on-disk changes. This makes manipulating softhsm from the boulder container extremely difficult, and because of the need to initialize new on each run (described below) we need direct access to the softhsm2 tools since pkcs11-tool cannot do slot initialization operations over the wire. I originally argued for bhsm as a way to mimic a network attached HSM, mainly so that we could do network level fault testing. In reality we've never actually done this, and the extra complexity is not really realistic for a handful of reasons. It seems better to just rip it out and operate directly on a local softhsm instance (the other option would be to use pkcs11-proxy locally, but this still would require manually restarting the proxy whenever softhsm2-util was used, and wouldn't really offer any realistic benefit). * Initializes the softhsm slots on each integration test run, rather than when creating the docker image (this is necessary to prevent churn in test/cert-ceremonies/generate.go, which would need to be updated to reflect the new slot IDs each time a new boulder-tools image was created since slot IDs are randomly generated) * Installs softhsm from source so that we can use a more up to date version (2.5.0 vs. 2.2.0 which is in the debian repo) * Generates the root and intermediate private keys in softhsm and writes out the root and intermediate public keys to /tmp for use in integration tests (the existing test-{ca,root} certs are kept in test/ because they are used in a whole bunch of unit tests. At some point these should probably be renamed/moved to be more representative of what they are used for, but that is left for a follow-up in order to keep the churn in this PR as related to the ceremony work as possible) Another follow-up item here is that we should really be zeroing out the database at the start of each integration test run, since certain things like certificates and ocsp responses will be signed by a key/issuer that is no longer is use/doesn't match the current key/issuer. Fixes #4832.	2020-06-03 15:20:23 -07:00
Jacob Hoffman-Andrews	87fb6028c1	Add log validator to integration tests (#4782 ) For now this mainly provides an example config and confirms that log-validator can start up and shut down cleanly, as well as provide a stat indicating how many log lines it has handled. This introduces a syslog config to the boulder-tools image that will write logs to /var/log/program.log. It also tweaks the various .json config files so they have non-default syslogLevel, to ensure they actually write something for log-validator to verify.	2020-04-20 13:33:42 -07:00
Jacob Hoffman-Andrews	36c1f1ab2d	Deprecate some feature flags (#4771 ) Deprecate some feature flags. These are all enabled in production.	2020-04-13 15:49:55 -07:00
Daniel McCarney	bff9eb0534	CI/Dev: restore allowOrigins wfe/wfe2 config. (#4650 ) In `67ec373a96` we removed "unused" WFE and WFE2 config elements. Unfortunately I missed that one of these elements, `allowOrigins`, is used and without this config in place CORS is broken. We have unit tests for the CORS headers but we did not have any end-to-end integration tests that would catch a problem with the WFE/WFE2 missing the `allowOrigins` config element. This commit restores the `allowOrigins` config value across the WFE/WFE2 configs and also adds a very small integration test. That test only checks one CORS header and only for the HTTP ACMEv2 endpoint but I think it's sufficient for the moment (and definitely better than nothing). Prior to fixing the config elements the integration test fails as expected: ``` --- FAIL: TestWFECORS (0.00s) wfe_test.go:28: "" != "*" FAIL FAIL github.com/letsencrypt/boulder/test/integration 0.014s FAIL ```	2020-01-17 14:41:20 -05:00
Daniel McCarney	67ec373a96	CI/Dev: Delete old/unused WFE config elements. (#4641 ) The config elements deleted from the four WFE config files are not used anywhere.	2020-01-10 12:37:22 -05:00
Roland Bracewell Shoemaker	ea231adc36	features: remove deprecated feature flags (#4607 ) Confirmed none of these features are currently present in any staging or production configs.	2019-12-09 15:59:27 -05:00
Jacob Hoffman-Andrews	bdd29a1e27	Promote authzv2 to test/config now that it's live (#4421 ) This also removes some awkward dancing we did in integration_test.py to run setup_twenty_days_ago under the opposite config of whatever we were about to run tests under. Reverts most of #4288 and #4290.	2019-09-05 12:33:56 -07:00
Jacob Hoffman-Andrews	5e7fee0c4a	test: update test/config with deployed configs. (#4396 )	2019-08-09 12:08:56 -04:00
Daniel McCarney	d1daeee831	Config: serverAddresses -> serverAddress. (#4035 ) The plural `serverAddresses` field in gRPC config has been deprecated for a bit now. We've removed the last usages of it in our staging/prod environments and can clear out the related code. Moving forward we only support a singular `serverAddress` and rely on DNS to direct to multiple instances of a given server.	2019-01-25 10:50:53 -08:00
Roland Bracewell Shoemaker	842739bccd	Remove deprecated features that have been purged from prod and staging configs (#4001 )	2019-01-15 16:16:35 -08:00
Jacob Hoffman-Andrews	4670be1210	Reduce log level for WFE in tests. (#3918 ) Our Travis output is quite verbose with the WFE output, and it's very rare that we have to reference it. I'd like to remove the INFO-level logs (i.e. the logs of every request) so that it's easier to see real errors, and faster to scroll to the bottom of logs of failed runs.	2018-11-01 09:50:41 -04:00
Jacob Hoffman-Andrews	b2f5cf39b9	Bring test/config up to date with test/config-next (#3743 ) Notably, enable the precertificate flow, RPCHeadroom, and multi-IP hostnames. Lots of other changes and feature flags too.	2018-06-01 12:00:52 -07:00
Jacob Hoffman-Andrews	f7dd91534d	Backport config-next/wfe2.json changes to config/ (#3721 )	2018-05-17 08:24:34 -04:00
Jacob Hoffman-Andrews	a4421ae75b	Run gRPC backends on multiple IPs instead of multiple ports (#3679 ) We're currently stuck on gRPC v1.1 because of a breaking change to certificate validation in gRPC 1.8. Our gRPC balancer uses a static list of multiple hostnames, and expects to validate against those hostnames. However gRPC expects that a service is one hostname, with multiple IP addresses, and validates all those IP addresses against the same hostname. See grpc/grpc-go#2012. If we follow gRPC's assumptions, we can rip out our custom Balancer and custom TransportCredentials, and will probably have a lower-friction time in general. This PR is the first step in doing so. In order to satisfy the "multiple IPs, one port" property of gRPC backends in our Docker container infrastructure, we switch to Docker's user-defined networking. This allows us to give the Boulder container multiple IP addresses on different local networks, and gives it different DNS aliases in each network. In startservers.py, each shard of a service listens on a different DNS alias for that service, and therefore a different IP address. The listening port for each shard of a service is now identical. This change also updates the gRPC service certificates. Now, each certificate that is used in a gRPC service (as opposed to something that is "only" a client) has three names. For instance, sa1.boulder, sa2.boulder, and sa.boulder (the generic service name). For now, we are validating against the specific hostnames. When we update our gRPC dependency, we will begin validating against the generic service name. Incidentally, the DNS aliases feature of Docker allows us to get rid of some hackery in entrypoint.sh that inserted entries into /etc/hosts. Note: Boulder now has a dependency on the DNS aliases feature in Docker. By default, docker-compose run creates a temporary container and doesn't assign any aliases to it. We now need to specify docker-compose run --use-aliases to get the correct behavior. Without --use-aliases, Boulder won't be able to resolve the hostnames it wants to bind to.	2018-05-07 10:38:31 -07:00
Daniel McCarney	054f181458	load-generator: send correct ACMEv2 Content-Type on POST (#3667 ) load generator: send correct ACMEv2 Content-Type on POST. This PR updates the Boulder load-generator to send the correct ACMEv2 Content-Type header when POSTing the ACME server. This is required for ACMEv2 and without it all POST requests to the WFE2 running a test/config-next configuration result in malformed 400 errors. While only required by ACMEv2 this commit sends it for ACMEv1 requests as well. No harm no foul. integration tests: allow running just the load generator. Prior to this PR an omission in an if statement in integration-test.py meant that you couldn't invoke test/integration-test.py with just the --load argument to only run the load generator. This commit updates the if to allow this use case.	2018-05-01 12:22:43 -07:00
Roland Bracewell Shoemaker	0a86573a73	Update integration tests	2018-04-20 13:18:40 -07:00
Jacob Hoffman-Andrews	c556a1a20d	Reduce spurious errors in integration test (#3436 ) Boulder is fairly noisy about gRPC connection errors. This is a mixed blessing: Our gRPC configuration will try to reconnect until it hits an RPC deadline, and most likely eventually succeed. In that case, we don't consider those to really be errors. However, in cases where a connection is repeatedly failing, we'd like to see errors in the logs about connection failure, rather than "deadline exceeded." So we want to keep logging of gRPC errors. However, right now we get a lot of these errors logged during integration tests. They make the output hard to read, and may disguise more serious errors. So we'd like to avoid causing such errors in normal integration test operation. This change reorders the startup of Boulder components by their gRPC dependencies, so everything's backend is likely to be up and running before it starts. It also reverses that order for clean shutdowns, and waits for each process to exit before signalling the next one. With these changes, I still got connection errors. Taking listenbuddy out of the gRPC path fixed them. I believe the issue is that listenbuddy is not a truly transparent proxy. In particular, it accepts an inbound TCP connection before opening an outbound TCP connection. If opening that outbound connection results in "connection refused," it closes the inbound connection. That means gRPC sees a "connection closed" (or "connection reset"?) rather than "connection refused". I'm guessing it handles those cases differently, explaining the different error results. We've been using listenbuddy to trigger disconnects while Boulder is running, to ensure that gRPC's reconnect code works. I think we can probably rely on gRPC's reconnect to work. The initial problem that led us to start testing this was a configuration problem; now that we have the configuration we want, we should be fine and don't need to keep testing reconnects on every integration test run.	2018-02-12 18:17:50 -08:00
Daniel McCarney	ba264a5091	Remove unused WFE2 feature flags. (#3375 ) The WFE2 doesn't check any of the feature flags that are configured in the `test/config/wfe2.json` and `test/config-next/wfe2.json` config files - we default to acting as if all new features are enabled for the V2 work. This commit removes the flags from the config to avoid confusion or expectations that changing the config will disable the features.	2018-01-17 12:28:19 -08:00
Jacob Hoffman-Andrews	827f7859f2	Fix issuerCert in test configs. (#3310 ) Previously, there was a disagreement between WFE and CA as to what the correct issuer certificate was. Consolidate on test-ca2.pem (h2ppy h2cker fake CA). Also, the CA configs contained an outdated entry for "IssuerCert", which was not being used: The CA configs now use an "Issuers" array to allow signing by multiple issuer certificates at once (for instance when rolling intermediates). Removed this outdated entry, and the config code for CA to load it. I've confirmed these changes match what is currently in production. Added an integration test to check for this problem in the future. Fixes #3309, thanks to @icing for bringing the issue to our attention! This also includes changes from #3321 to clarify certificates for WFE.	2018-01-09 07:56:39 -05:00
Jacob Hoffman-Andrews	fdd854a7e5	Fix various WFE2 bugs. (#3292 ) - Encode certificate as PEM. - Use lowercase for field names. - Use termsOfServiceAgreed instead of Agreement - Use a different ToS URL for v2 that points at the v2 HTTPS port. Resolves #3280	2017-12-19 13:13:29 -08:00
Jacob Hoffman-Andrews	0a64fd4066	Bring test/config up-to-date. (#3056 ) Methodology: Copy test/config-next/* into test/config/, then manually review the diffs, removing any diffs that are not yet in production.	2017-09-11 16:55:58 -04:00
Daniel McCarney	bd3e2747ba	Duplicate WFE to WFE2. (#2839 ) This PR is the initial duplication of the WFE to create a WFE2 package. The rationale is briefly explained in `wfe2/README.md`. Per #2822 this PR only lays the groundwork for further customization and deduplication. Presently both the WFE and WFE2 are identical except for the following configuration differences: * The WFE offers HTTP and HTTPS on 4000 and 4430 respectively, the WFE2 offers HTTP on 4001 and 4431. * The WFE has a debug port on 8000, the WFE2 uses the next free "8000 range port" and puts its debug service on 8013 Resolves https://github.com/letsencrypt/boulder/issues/2822	2017-07-05 13:32:45 -07:00

38 Commits