boulder

Commit Graph

Author	SHA1	Message	Date
Jacob Hoffman-Andrews	796a7aa2f4	integration: move to Python3 (#4313 ) * integration: move to Python3 - Add parentheses to all print and raise calls. - Python3 distinguishes bytes from strings. Add encode() and decode() calls as needed to provide the correct type. - Use requests library consistently (urllib3 is not in Python3). - Remove shebang from Python files without a main, and update shebang for integration-test.py.	2019-07-02 09:28:49 -04:00
Jacob Hoffman-Andrews	38ef76bcba	Refactor test_caa into four test cases. (#4290 ) The three new cases separately test: - Rechecking CAA during authz reuse. - Successful issuance for a positive CAA record - Rejected issuance for a negative CAA record - The various CAA extensions from https://tools.ietf.org/html/draft-ietf-acme-caa-06 Importantly, this also switches `recheck.good-caa-reserved.com` to use a dynamically generated random name. This should fix the problem where running integration tests locally several times resulted in hitting an exact match rate limit error, requiring a clear of the fqdnSets table. This also moves the creation of the client for test_recheck_caa into its own early-setup function, so there is less test-case-specific setup in integration-test.py.	2019-06-25 10:51:56 -07:00
Jacob Hoffman-Andrews	df19fd9e58	Integration test for v1 authz reuse when v2 flag is enabled (#4288 ) When NewAuthorizationSchema is enabled, we still want v1 authzs to be reusable in new orders. This tests that that code is implemented correctly. Updates #4241	2019-06-25 10:50:58 -07:00
Jacob Hoffman-Andrews	2a7437af83	Remove seventy- and zero-day ago integration setup (#4292 ) These two setup phases were only used by `test_expired_authz_404`, which is adequately covered by unittests. Since each setup and teardown is rather time consuming, this speeds up and simplifies integration tests. Before: 5m10 After: 4m46	2019-06-25 09:35:58 -07:00
Roland Bracewell Shoemaker	24f150f8fc	Re-apply #4279 with requests fix (#4286 ) Move from using `requests` to `urllib2` in `helpers.py`. Verified this works with `docker-compose up`. In the future we really should be installing our own python dependencies in the boulder-tools image rather than relying on getting them by using the certbot virtualenv.	2019-06-24 11:58:37 -07:00
Adrien Ferrand	8e31d58113	Revert "tests: Switch to instant OCSP verification in int. tests (#4279 )" (#4285 ) This reverts commit `f4b9235acb`. Fixes #4284	2019-06-23 12:06:16 -07:00
Roland Bracewell Shoemaker	f4b9235acb	tests: Switch to instant OCSP verification in int. tests (#4279 ) * Switch to instant OCSP verification in integration tests * Move waitport to helpers and use it to determine if ocsp-responder is alive in test_single_ocsp	2019-06-21 09:53:01 -04:00
Jacob Hoffman-Andrews	18a3c78d6f	Refactor test_caa and twenty-days-ago setup (#4261 ) As part of #4241, I need to introduce some twenty-days-ago setup. So I refactored the only current instance (test_caa) to use a style where setup functions can be registered right next to the test cases they affect. The @register_twenty_days_ago is Python for "call register_twenty_days_ago with the thing on the next line as an argument." I also cleaned up a bunch of related stuff: * Removed the ACCOUNT_URI environment variable and associated function params. This was introduced in in #3736 to pass a URI to challtestsrv before we refactored for more dynamic updates. It's not used any more. * Removed a try / except from startChallSrv that needlessly hid errors. * Move setting of DNS fixtures for caa_test into the test case itself.	2019-06-18 14:58:06 -07:00
Roland Bracewell Shoemaker	11d16df3a6	Add authz2 expired-authz-purger tool (#4226 ) Fixes #4188.	2019-05-30 14:01:01 -07:00
Roland Bracewell Shoemaker	4d40cf58e4	Enable integration tests for authz2 and fix a few bugs (#4221 ) Enables integration tests for authz2 and fixes a few bugs that were flagged up during the process. Disables expired-authorization-purger integration tests if config-next is being used as expired-authz-purger expects to purge some stuff but doesn't know about authz2 authorizations, a new test will be added with #4188. Fixes #4079.	2019-05-23 15:06:50 -07:00
Daniel McCarney	30d155911b	tests: initialize caa_client var before ref. (#4166 ) Without this change running a single integration tests with `INT_SKIP_SETUP` like so: ``` docker-compose run --use-aliases -e INT_FILTER="test_http_multiva_threshold_pass" -e INT_SKIP_SETUP=true -e RUN="integration" boulder ./test.sh; ``` Produces an error like: ``` + python2 test/integration-test.py --chisel --load --filter test_http_multiva_threshold_pass --skip-setup Traceback (most recent call last): File "test/integration-test.py", line 309, in <module> main() File "test/integration-test.py", line 217, in main caa_account_uri = caa_client.account.uri if caa_client is not None else None UnboundLocalError: local variable 'caa_client' referenced before assignment ```	2019-04-18 14:07:18 -04:00
Jacob Hoffman-Andrews	498cfca8d3	Split v1 integration test cases into their own file (#4157 ) This makes it a little clearer which bits are test setup helpers, and which bits are actual test cases. It may also make it a little easier to see which cases from the v1 tests also need a v2 test case. Fixes #4126	2019-04-16 11:36:33 -07:00
Daniel McCarney	b99b35009e	load-generator: support all challenge types, run in CI. (#4140 ) ## CI: restore load-generator run. This restores running the `load-generator` during CI to make sure it doesn't bitrot. It was previously removed while we debugged the VA getting jammed up and not cleanly shutting down. Since the global `pebble-challtestsrv` and the `load-generator`'s internal chall test srv will conflict this requires moving the `load-generator` run to the end of integration tests and updating `startservers.py` to allow the load gen integration test code to stop the `pebble-challtestsrv` before starting the `load-generator`. The `load-generator` and associated config are updated to allow specifying bind addresses for the DNS interface of the internal challtestsrv. Multiple addresses are supported so that the `load-generator`'s chall test srv can listen on port DNS ports Boulder is configured to use. The `load-generator` config now accepts a `fakeDNS` parameter that can be used to specify the default IPv4 address returned by the `load-generator`'s DNS server for A queries. ## load-generator: support different challenges/strategies. Updates the load-generator to support HTTP-01, DNS-01, and TLS-ALPN-01 challenge response servers. A new challenge selection configuration parameter (`ChallengeStrategy`) can be set to `"http-01"`, `"dns-01"`, or `"tls-alpn-01"` to solve only challenges of that type. Using `"random"` will let the load-generator choose a challenge type randomly. Resolves https://github.com/letsencrypt/boulder/issues/3900	2019-04-04 11:44:14 -07:00
Jacob Hoffman-Andrews	8f578f3a93	Improve integration tests (#4143 ) - Move fakeclock, get_future_output, and random_domain to helpers.py. - Remove tempdir handling from integration-test.py since it's already done in helpers.py - Consolidate handling of config dir into helpers.py, and add CONFIG_NEXT boolean. - Move RevokeAtRA config gating into verify_revocation to reduce redundancy. - Skip load-balancing test when filter is enabled. - Ungate test_sct_embedding - Rework test_ct_submissions, which was out of date. In particular, have a couple of logs where submitFinalCert: false, and make ct-test-srv store submission counts by hostnames for better test case isolation.	2019-04-04 10:59:38 -07:00
Daniel McCarney	063a98f02a	VA: additional feature flag control for multiVA. (#4122 ) * `EnforceMultiVA` to allow configuring multiple VAs but not changing the primary VA's result based on what the remote VAs return. * `MultiVAFullResults` to allow collecting all of the remote VA results. When all results are collected a JSON log line with the differential between the primary/remote VAs is logged. Resolves https://github.com/letsencrypt/boulder/issues/4066	2019-03-25 12:23:53 -04:00
Daniel McCarney	de30d22303	load-generator: remove acme v1 support. (#4132 ) We don't intend to load test the legacy WFE implementation in the future and if we need to we can always revive this code from git. Removing it will make refactoring the ACME v2 code to be closer to RFC 8555 easier.	2019-03-25 12:22:18 -04:00
Jacob Hoffman-Andrews	f61242e751	Unshadow v2 integration tests. (#4131 ) Previously the v2_integration tests were imported to the global namespace in integration-test.py. As a result, some were shadowed and didn't run, or called methods that were in the main namespace rather than their own. This PR imports and runs them under their own namespace. It also fixes some tests that were broken. Notably: - Fixes chisel2.expect_problem. - Fixes incorrect namespacing on some expect_problem calls. - Remove unused ValidationError from v2_integration. - Replace client.key with client.net.key.	2019-03-20 17:11:49 -07:00
Jacob Hoffman-Andrews	677b9b88ad	Remove GSB support. (#4115 ) This is no longer enabled in prod; cleaning up the code. https://community.letsencrypt.org/t/let-s-encrypt-no-longer-checking-google-safe-browsing/82168	2019-03-15 10:24:44 -07:00
Daniel McCarney	279947ade2	CI/Devenv: restore 20s RA->VA timeout. (#4084 ) I tried dropping the RA->VA timeout to make the `test_http_challenge_timeout` integration test faster. It seems to flake in CI so I'm restoring the original 20s timeout. This makes `test_http_challenge_timeout` slower but c'est la vie.	2019-02-22 08:53:18 -08:00
Daniel McCarney	3324989205	CI/Dev: Increase RA->VA timeout to 8s. (#4062 ) There has been some flakyness in CI related to RA->VA timeouts.	2019-02-15 13:38:12 -08:00
Roland Bracewell Shoemaker	3e54cea295	Implement direct revocation at RA (#4043 ) Implements a feature that enables immediate revocation instead of marking a certificate revoked and waiting for the OCSP-Updater to generate the OCSP response. This means that as soon as the request returns from the WFE the revoked OCSP response should be available to the user. This feature requires that the RA be configured to use the standalone Akamai purger service. Fixes #4031.	2019-02-14 14:47:42 -05:00
Daniel McCarney	1c0be52e53	VA: Add integration test for HTTP timeouts. (#4050 ) Also update `TestHTTPTimeout` to test with the `SimplifiedVAHTTP` feature flag enabled.	2019-02-12 13:42:01 -08:00
Roland Bracewell Shoemaker	3129c57bb8	Require email domains end in a IANA suffix (#4037 )	2019-01-28 17:05:58 -08:00
Daniel McCarney	98663717d8	VA: Rework SimplifiedVAHTTP for pre-resolved dials. (#4016 ) The URL construction approach we were previously using for the refactored VA HTTP-01 validation code was nice but broke SNI for HTTP->HTTPS redirects. In order to preserve this functionality we need to use a custom `DialContext` handler on the HTTP Transport that overrides the target host to use a pre-resolved IP. Resolves https://github.com/letsencrypt/boulder/issues/3969	2019-01-21 15:08:40 -05:00
Jacob Hoffman-Andrews	92e8e1708a	Update config and config-next challenge settings. (#4017 ) - Allow tls-alpn-01 challenge in config. - Disallow tls-sni-01 challenge in config-next. - Remove gating of tls-alpn integration test. - Remove TLSSNIRevalidation in config-next.	2019-01-18 10:30:38 -08:00
Daniel McCarney	b0f407dcf0	RA: Remove deprecated UpdateAuthorization RPC. (#3993 ) Staging and prod both deployed the PerformValidationRPC feature flag. All running WFE/WFE2 instances are using the more accurately named PerformValidation RPC and we can strip out the old UpdateAuthorization bits. The feature flag for PerformValidationRPC remains until we clean up the staging/prod configs. Resolves #3947 and completes the last of #3930	2019-01-07 16:35:27 -08:00
Daniel McCarney	11433e1ea0	VA: Fix SimplifiedVAHTTP01 redirect query param handling. (#3988 ) When the `SimplifiedVAHTTP01` feature flag is enabled we need to preserve query parameters when reconstructing a redirect URL for the resolved IP address. To add integration testing for this condition the Boulder tools images are updated to in turn pull in an updated `pebble-challtestsrv` command that tracks request history. A new Python wrapper for the `pebble-challtestsrv` HTTP API is added to centralize interacting with the chall test srv to add mock data and to get the history of HTTP requests that have been processed.	2019-01-04 14:20:44 -05:00
Daniel McCarney	f72c371bdc	Set pebble-challtestsrv IP from FAKE_DNS at startup. (#3984 ) `pebble-challtestsrv` added a `-defaultIPv4` arg we can use to simplify the integration tests and fix FAKE_DNS usage outside of integration tests. A new boulder-tools image with an updated `pebble-challtestsrv` is used and `test/startservers.py` is changed to populate `-defaultIPv4` via the `FAKE_DNS` env var.	2018-12-13 13:49:12 -05:00
Daniel McCarney	893e8459d6	Use pebble-challtestrv cmd, letsencrypt/challtestsrv package. (#3980 ) Now that Pebble has a `pebble-challtestsrv` we can remove the `challtestrv` package and associated command from Boulder. I switched CI to use `pebble-challtestsrv`. Notably this means that we have to add our expected mock data using the HTTP management interface. The Boulder-tools images are regenerated to include the `pebble-challtestsrv` command. Using this approach also allows separating the TLS-ALPN-01 and HTTPS HTTP-01 challenges by binding each challenge type in the `pebble-challtestsrv` to different interfaces both using the same VA HTTPS port. Mock DNS directs the VA to the correct interface. The load-generator command that was previously using the `challtestsrv` package from Boulder is updated to use a vendored copy of the new `github.org/letsencrypt/challtestsrv` package. Vendored dependencies change in two ways: 1) Gomock is updated to the latest release (matching what the Bouldertools image provides) 2) A couple of new subpackages in `golang.org/x/net/` are added by way of transitive dependency through the challtestsrv package. Unit tests are confirmed to pass for `gomock`: ``` ~/go/src/github.com/golang/mock/gomock$ git log --pretty=format:'%h' -n 1 51421b9 ~/go/src/github.com/golang/mock/gomock$ go test ./... ok github.com/golang/mock/gomock 0.002s ? github.com/golang/mock/gomock/internal/mock_matcher [no test files] ``` For `/x/net` all tests pass except two `/x/net/icmp` `TestDiag.go` test cases that we have agreed are OK to ignore. Resolves https://github.com/letsencrypt/boulder/issues/3962 and https://github.com/letsencrypt/boulder/issues/3951	2018-12-12 14:32:56 -05:00
Daniel McCarney	bd4c254942	Use Challtestsrv for HTTP-01 integration tests, add redirect tests (#3960 ) To complete https://github.com/letsencrypt/boulder/issues/3956 the `challtestsrv` is updated such that its existing TLS-ALPN-01 challenge test server will serve HTTP-01 responses with a self-signed certificate when a non-TLS-ALPN-01 request arrives. This lets the TLS-ALPN-01 challenge server double as a HTTPS version of the HTTP challenge server. The `challtestsrv` now also supports adding/remove redirects that will be served to clients when requesting matching paths. The existing chisel/chisel2 integration tests are updated to use the `challtestsrv` instead of starting their own standalone servers. This centralizes our mock challenge responses and lets us bind the `challtestsrv` to the VA's HTTP port in `startservers.py` without clashing ports later on. New integration tests are added for HTTP-01 redirect scenarios using the updated `challtestserv`. These test cases cover: * valid HTTP -> HTTP redirect * valid HTTP -> HTTPS redirect * Invalid HTTP -> non-HTTP/HTTPS port redirect * Invalid HTTP-> non-HTTP/HTTPS protocol scheme redirect * Invalid HTTP-> bare IP redirect * Invalid HTTP redirect loop The new integration tests shook out two fixes that were required for the legacy VA HTTP-01 code (`afad22b`) and one fix for the challtestsrv mock DNS (`59b7d6d`). Resolves https://github.com/letsencrypt/boulder/issues/3956	2018-11-30 17:20:10 -05:00
Roland Bracewell Shoemaker	6a47decc33	Deflake akamai purger integration testing (#3961 ) The problem here was that we were doing revocation tests in the v2 integration file that didn't block on getting the revoked OCSP status. This meant that if the OCSP responder was running slow it could execute a revoked cert tick between reseting the akamai test server in the next test and sending another purge request which would mean we saw two purge requests when we expected to see one. The fix was to add the blocking and purge checking/reseting to the v2 tests. Doing this without duplicating a bunch of code required factoring a number of functions out into a third helpers file (I think more code could be abstracted out to this file but just wanted to start with what was needed for this change.)	2018-11-30 14:17:23 -08:00
Roland Bracewell Shoemaker	142ff9c075	Allow integration test filter pass through and skipping integration setup (#3954 ) Fixes #3943.	2018-11-28 16:15:33 -08:00
Daniel McCarney	8f5de538c1	RA: Add PerformValidation RPC to replace UpdateAuthorization. (#3942 ) The existing RA `UpdateAuthorization` RPC needs replacing for two reasons: 1. The name isn't accurate - `PerformValidation` better captures the purpose of the RPC. 2. The `core.Challenge` argument is superfluous since Key Authorizations are not sent in the initiation POST from the client anymore. The corresponding unmarshal and verification is now removed. Notably this means broken clients that were POSTing the wrong thing and failing pre-validation will now likely fail post-validation. To remove `UpdateAuthorization` the new `PerformValidation` RPC is added alongside the old one. WFE and WFE2 are updated to use the new RPC when the perform validation feature flag is enabled. We can remove `UpdateAuthorization` and its associated wrappers once all WFE instances have been updated. Resolves https://github.com/letsencrypt/boulder/issues/3930	2018-11-28 10:12:47 -05:00
Roland Bracewell Shoemaker	ba7a8e8e5d	Add fake Akamai purge server for integration testing (#3946 ) Fixes #3916.	2018-11-27 09:49:05 -05:00
Roland Bracewell Shoemaker	465be64f3f	Remove unnecessary usage of UpdatePendingAuthorization in RA (#3927 ) Removes superfluous usage of `UpdatePendingAuthorization` in the RA to update the key authorization and test if the authorization is pending and instead uses the result of the initial `GetAuthorization` call in the WFE. Fixes #3923.	2018-11-12 17:23:56 -08:00
Roland Bracewell Shoemaker	876c727b6f	Update gRPC (#3817 ) Fixes #3474.	2018-08-20 10:55:42 -04:00
Daniel McCarney	a13185a5db	Revert "Temporarily allow fetching of expired authzs. #3778" (#3800 ) This reverts commit `fa8814baab`.	2018-07-23 13:12:20 -04:00
Daniel McCarney	bbf0102cdc	Remove UseAIAIssuerURL feature flag and code. (#3790 ) We aren't going to deploy this as-is and its causing integration test problems for downstream clients.	2018-07-03 16:29:44 -04:00
Jacob Hoffman-Andrews	fa8814baab	Temporarily allow fetching of expired authzs. #3778 This also allows deactivating expired authzs, which is fine. Fixes #3777	2018-06-29 13:57:34 -04:00
Roland Bracewell Shoemaker	1e6699d03e	Remove hyphens from ACME-CAA parameters (#3772 ) The hyphens were incompatible with RFC 6844 (but not RFC 6844bis), and broke some CAA-processing software in practice. Hugo revised the ACME-CAA draft (https://datatracker.ietf.org/doc/html/draft-ietf-acme-caa-05) to remove the hyphens.	2018-06-21 13:49:48 -07:00
Joel Sing	9c2859c87b	Add support for CAA account-uri validation. (#3736 ) This adds support for the account-uri CAA parameter as specified by section 3 of https://tools.ietf.org/html/draft-ietf-acme-caa-04, allowing issuance to be restricted to one or more ACME accounts as specified by CAA records.	2018-06-08 12:08:03 -07:00
Daniel McCarney	8583e42964	RA: Forbid contact addresses for IANA example domains. (#3748 ) We see a fair number of ACME accounts/registrations with contact addresses for the RFC2606 Section 3 "Reserved Example Second Level Domain Names" (`example.com`, `example.net`, `example.org`). These are not real contact addresses and are likely the result of the user copy-pasting example configuration. These users will miss out on expiration emails and other subscriber communications :-( This commit updates the RA's `validateEmail` function to reject any contact addresses for reserved example domain names. The corresponding unit test is updated accordingly. Resolves https://github.com/letsencrypt/boulder/issues/3719	2018-06-08 13:42:51 -04:00
Maciej Dębski	bb9ddb124e	Implement TLS-ALPN-01 and integration test for it (#3654 ) This implements newly proposed TLS-ALPN-01 validation method, as described in https://tools.ietf.org/html/draft-ietf-acme-tls-alpn-01 This challenge type is disabled except in the config-next tree.	2018-06-06 13:04:09 -04:00
Joel Sing	2540d59296	Implement CAA validation-methods checking. (#3716 ) When performing CAA checking respect the validation-methods parameter (if present) and restrict the allowed authorization methods to those specified. This allows a domain to restrict authorization methods that can be used with Let's Encrypt. This is largely based on PR #3003 (by @lukaslihotzki), which was landed and then later reverted due to issue #3143. The bug the resulted in the previous code being reverted has been addressed (likely inadvertently) by `76973d0f`. This implementation also includes integration tests for CAA validation-methods. Fixes issue #3143.	2018-05-23 14:32:31 -07:00
Jacob Hoffman-Andrews	dbcb16543e	Start using multiple-IP hostnames for load balancing (#3687 ) We'd like to start using the DNS load balancer in the latest version of gRPC. That means putting all IPs for a service under a single hostname (or using a SRV record, but we're not taking that path). This change adds an sd-test-srv to act as our service discovery DNS service. It returns both Boulder IP addresses for any A lookup ending in ".boulder". This change also sets up the Docker DNS for our boulder container to defer to sd-test-srv when it doesn't know an answer. sd-test-srv doesn't know how to resolve public Internet names like `github.com`. Resolving public names is required for the `godep-restore` test phase, so this change breaks out a copy of the boulder container that is used only for `godep-restore`. This change implements a shim of a DNS resolver for gRPC, so that we can switch to DNS-based load balancing with the currently vendored gRPC, then when we upgrade to the latest gRPC we won't need a simultaneous config update. Also, this change introduces a check at the end of the integration test that each backend received at least one RPC, ensuring that we are not sending all load to a single backend.	2018-05-23 09:47:14 -04:00
Joel Sing	1da6af39a1	Add an integration test for CAA rechecking. (#3709 ) The existing CAA tests only test the CAA checks on the validation path and not the CAA rechecking in the case where an existing authorization is present (but older than the 8 hour window). This extends the CAA integration tests to also cover the CAA rechecking code path, by reusing older authorizations and rejecting issuance via CAA.	2018-05-15 09:55:28 -07:00
Joel Sing	087074c73b	Fix issue with expired authz test. (#3704 ) The test_expired_authz_404() test is currently broken in two ways - firstly, there is no way for it to distinguish between a 404 from an expired authz and a 404 from a non-existant authz. Secondly, the test_expired_authz_purger() test runs and wipes out all of the existing authorizations, including the one that was set up from setup_seventy_days_ago(), before the expired test runs. Avoid this by running the expired authorization purger test from later in main(). Also, add a minimal canary that will detect if all authorizations have been purged (although this still does not guarantee that we got a 404 due to expiration).	2018-05-11 10:56:32 -07:00
Daniel McCarney	054f181458	load-generator: send correct ACMEv2 Content-Type on POST (#3667 ) load generator: send correct ACMEv2 Content-Type on POST. This PR updates the Boulder load-generator to send the correct ACMEv2 Content-Type header when POSTing the ACME server. This is required for ACMEv2 and without it all POST requests to the WFE2 running a test/config-next configuration result in malformed 400 errors. While only required by ACMEv2 this commit sends it for ACMEv1 requests as well. No harm no foul. integration tests: allow running just the load generator. Prior to this PR an omission in an if statement in integration-test.py meant that you couldn't invoke test/integration-test.py with just the --load argument to only run the load generator. This commit updates the if to allow this use case.	2018-05-01 12:22:43 -07:00
Daniel McCarney	fa5c917665	RA: Don't lose CA error types when prefixing err msg. (#3633 ) Previously we updated the RA's issueCertificateInner function to prefix errors returned from the CA with meaningful information about which CA RPC caused the failure. Unfortunately by using fmt.Errorf to do this we're discarding the underlying error type. This can cause unexpected server internal errors downstream if (for e.g.) the CA rejects a CSR with a malformed error (see #3632). This PR updates the issueCertificateInner error message prefixing to maintain the error type if the underlying error is a berrors.BoulderError. A RA unit test with several mock CAs is added to test the prefixing occurs as expected without loss of error type. This PR also adds an integration test that ensures we reject a CSR with >100 names with a malformed error. This is not strictly related to this PR but since I wrote it while debugging the root issue I thought I'd include it. To allow this test to pass the pendingAuthorizationsPerAccount in test/rate-limit-policies.yml and associated tests had to be adjusted. Resolves #3632	2018-04-10 11:28:03 -07:00
Jacob Hoffman-Andrews	bc2085bbe0	Default to DNS challenge in chisel and chisel2. (#3621 ) This allows these tools to easily be run in command line mode from the host machine against a Boulder running inside docker-compose up without modifying the FAKE_DNS field in docker-compose.yml. This allows for easier testing of various conditions.	2018-04-05 15:37:10 -04:00
Jacob Hoffman-Andrews	76329cc1c0	Add check for correct time in SCTs. (#3570 ) In publisher and in the integration test, check that SCTs are in a reasonable range. Also, update CreateTestingSignedSCT (used by ct-test-srv) to produce SCTs correctly with a timetamp in Unix epoch milliseconds.	2018-03-19 14:40:33 -04:00
Jacob Hoffman-Andrews	268d9b1491	Run v2 integration tests as part of v1 tests. (#3569 ) - Remove acme-v2 test phase. - Rename integration-test-v2.py to v2_integration, so it can be imported. - Import all symbols from v2_integration before running test_*. - In chisel2: - Rename DIRECTORY so it doesn't collide. - Incidental logging and error fixes. - Merge v1 and v2 load testing into a single function. - Run cert-checker just once, after all other test cases. - In v2_integration: - Remove unnecessary imports. - Import chisel2 methods in the chisel2 namespace so they don't collide with chisel methods. - Remove main and shutdown code.	2018-03-19 10:19:02 -04:00
Jacob Hoffman-Andrews	d8fa5ba222	Automatically run all integration test cases (#3564 ) Previously, each time we defined a new test case in integration-test.py, we had to explicitly call it. This made it easy to leave out cases without realizing it. After this change, we will automatically find all functions named "test_" and call them. As a result, I found that we weren't calling `test_revoked_by_account`, and it was failing. So I fixed it as part of this PR. Fixes #3518	2018-03-15 13:42:51 -07:00
Jacob Hoffman-Andrews	0a517aa9c0	Remove config-next gating on v2 and wildcard features (#3563 ) Also, move the last of the v2 migrations from db-next into db.	2018-03-15 13:14:25 -07:00
Jacob Hoffman-Andrews	65b88a8dbc	Run certlint in cert-checker (#3550 ) This pulls in the certlint dependency, which in turn pulls in publicsuffix as a dependency. Fixes #3549	2018-03-15 17:42:58 +00:00
Jacob Hoffman-Andrews	5a2f715413	Ungate a little extra.	2018-03-14 18:37:25 -07:00
Roland Bracewell Shoemaker	9c9e944759	Add SCT embedding (#3521 ) Adds SCT embedding to the certificate issuance flow. When a issuance is requested a precertificate (the requested certificate but poisoned with the critical CT extension) is issued and submitted to the required CT logs. Once the SCTs for the precertificate have been collected a new certificate is issued with the poison extension replace with a SCT list extension containing the retrieved SCTs. Fixes #2244, fixes #3492 and fixes #3429.	2018-03-12 11:58:30 -07:00
Daniel McCarney	531d9ce52c	Run load-generator against V1 and V2 API in CI. (#3509 ) This commit adds short 15s runs of the load generator against the V1 and V2 APIs during the three integration test runs (v1 config, v1 config-next, and v2). 15s was selected because 30s caused too much output and the build log to be truncated. Presently the latency output is not being checked for errors. This was too flaky in practice. A fix for a race condition in the load-generator code itself related to HTTP status code tracking is included in this commit. The pending authz rate limit also needed to be adjusted to keep the load-generator from failing requests after hitting 429s.	2018-03-05 15:34:15 -05:00
Jacob Hoffman-Andrews	d6448dbb52	Add random subdomain for rate limit test. (#3471 ) The test for the certificates_per_name rate limit uses an exact domain name that has an override in the rate limit config file to have a limit of 0. This works correctly most of the time. However, if that mechanism fails once (due to some bug), future runs of the integation test will continue to fail, because there will now be an issued certificate for "lim.it" in the DB, and subsequent attempts will be considered renewals. This change adds a random subdomain to the test, so that it's not eligible for the renewal exemption.	2018-02-22 10:23:11 -08:00
Jacob Hoffman-Andrews	c556a1a20d	Reduce spurious errors in integration test (#3436 ) Boulder is fairly noisy about gRPC connection errors. This is a mixed blessing: Our gRPC configuration will try to reconnect until it hits an RPC deadline, and most likely eventually succeed. In that case, we don't consider those to really be errors. However, in cases where a connection is repeatedly failing, we'd like to see errors in the logs about connection failure, rather than "deadline exceeded." So we want to keep logging of gRPC errors. However, right now we get a lot of these errors logged during integration tests. They make the output hard to read, and may disguise more serious errors. So we'd like to avoid causing such errors in normal integration test operation. This change reorders the startup of Boulder components by their gRPC dependencies, so everything's backend is likely to be up and running before it starts. It also reverses that order for clean shutdowns, and waits for each process to exit before signalling the next one. With these changes, I still got connection errors. Taking listenbuddy out of the gRPC path fixed them. I believe the issue is that listenbuddy is not a truly transparent proxy. In particular, it accepts an inbound TCP connection before opening an outbound TCP connection. If opening that outbound connection results in "connection refused," it closes the inbound connection. That means gRPC sees a "connection closed" (or "connection reset"?) rather than "connection refused". I'm guessing it handles those cases differently, explaining the different error results. We've been using listenbuddy to trigger disconnects while Boulder is running, to ensure that gRPC's reconnect code works. I think we can probably rely on gRPC's reconnect to work. The initial problem that led us to start testing this was a configuration problem; now that we have the configuration we want, we should be fine and don't need to keep testing reconnects on every integration test run.	2018-02-12 18:17:50 -08:00
Roland Bracewell Shoemaker	fc5c8f76b6	Remove unused features (#3393 ) This removes a number of unused features (i.e. they are never checked anywhere).	2018-01-25 08:55:05 -05:00
Jacob Hoffman-Andrews	827f7859f2	Fix issuerCert in test configs. (#3310 ) Previously, there was a disagreement between WFE and CA as to what the correct issuer certificate was. Consolidate on test-ca2.pem (h2ppy h2cker fake CA). Also, the CA configs contained an outdated entry for "IssuerCert", which was not being used: The CA configs now use an "Issuers" array to allow signing by multiple issuer certificates at once (for instance when rolling intermediates). Removed this outdated entry, and the config code for CA to load it. I've confirmed these changes match what is currently in production. Added an integration test to check for this problem in the future. Fixes #3309, thanks to @icing for bringing the issue to our attention! This also includes changes from #3321 to clarify certificates for WFE.	2018-01-09 07:56:39 -05:00
Jacob Hoffman-Andrews	90f7998b15	Speed up expired authz purger (#3267 ) Now, rather than LIMIT / OFFSET, this uses the highest id from the last batch in each new batch's query. This makes efficient use of the index, and means the database does not have to scan over a large number of non-expired rows before starting to find any expired rows. This also changes the structure of the purge function to continually push ids for deletion onto a channel, to be processed by goroutines consuming that channel. Also, remove the --yes flag and prompting.	2017-12-11 12:05:43 -05:00
Jacob Hoffman-Andrews	68d5cc3331	Restore gRPC metrics (#3265 ) The go-grpc-prometheus package by default registers its metrics with Prometheus' global registry. In #3167, when we stopped using the global registry, we accidentally lost our gRPC metrics. This change adds them back. Specifically, it adds two convenience functions, one for clients and one for servers, that makes the necessary metrics object and registers it. We run these in the main function of each server. I considered adding these as part of StatsAndLogging, but the corresponding ClientMetrics and ServerMetrics objects (defined by go-grpc-prometheus) need to be subsequently made available during construction of the gRPC clients and servers. We could add them as fields on Scope, but this seemed like a little too much tight coupling. Also, update go-grpc-prometheus to get the necessary methods. ``` $ go test github.com/grpc-ecosystem/go-grpc-prometheus/... ok github.com/grpc-ecosystem/go-grpc-prometheus 0.069s ? github.com/grpc-ecosystem/go-grpc-prometheus/examples/testproto [no test files] ```	2017-12-07 15:44:55 -08:00
Jacob Hoffman-Andrews	6cd777bd8d	Fix up stats after #3167 (#3185 ) There were two bugs in #3167: All process-level stats got prefixed with "boulder", which broke dashboards. All request_time stats got dropped, because measured_http was using the prometheus DefaultRegisterer. To fix, this PR plumbs through a scope object to measured_http, and uses an empty prefix when calling NewProcessCollector().	2017-10-18 11:14:59 -07:00
Daniel McCarney	9b922b9feb	Ensure `LockCol` is set correctly on reg update. (#3113 ) In `2fb247488f` we consolidated the `regModelV2` and `regModelv1` structs to one `regModel` type. In the process we accidentally lost the explicit assignment of the to-be-updated registration model's `LockCol` with the value of the existing registration's `LockCol`. This meant that the Update was occurring with a where clause `LockCol=0` (the default value). In practice this meant that the first reg update would succeed (since the reg row starts with LockCol=0) but any regs that had already been updated once before would modify 0 rows in the update (because the where clause on `LockCol` failed) and this in turn was translated into a ServerInternal error since we knew the reg being updated did exist. This commit updates the SA's `UpdateRegistration` function to properly set the `LockCol` on the to-be-updated row. This commit additionally adds an integration test for registration contact information updating to ensure we don't fall into this trap in the future.	2017-09-22 15:41:22 -07:00
Jacob Hoffman-Andrews	9ab2ff4e03	Add CAA-specific error. (#3051 ) Previously, CAA problems were lumped in under "ConnectionProblem" or "Unauthorized". This should make things clearer and easier to differentiate. Fixes #3043	2017-09-14 14:11:41 -07:00
Jacob Hoffman-Andrews	4128e0d95a	Add time-dependent integration testing (#3060 ) Fixes #3020. In order to write integration tests for some features, especially related to rate limiting, rechecking of CAA, and expiration of authzs, orders, and certs, we need to be able to fake the passage of time in integration tests. To do so, this change switches out all clock.Default() instances for cmd.Clock(), which can be set manually with the FAKECLOCK environment variable. integration-test.py now starts up all servers once before the main body of tests, with FAKECLOCK set to a date 70 days ago, and does some initial setup for a new integration test case. That test case tries to fetch a 70-day-old authz URL, and expects it to 404. In order to make this work, I also had to change a number of our test binaries to shut down cleanly in response to SIGTERM. Without that change, stopping the servers between the setup phase and the main tests caused startservers.check() to fail, because some processes exited with nonzero status. Note: This is an initial stab at things, to prove out the technique. Long-term, I think we will want to use an idiom where test cases are classes that have a number of optional setup phases that may be run at e.g. 70 days prior and 5 days prior. This could help us avoid a proliferation of global state as we add more time-dependent test cases.	2017-09-13 12:34:14 -07:00
Jacob Hoffman-Andrews	a0ec53d183	Raise Exceptions rather than strings. (#3015 ) raise("foo") isn't valid Python, but raise Exception("foo") is.	2017-08-28 15:23:26 -04:00
Daniel McCarney	71f8ae0e87	Improve renewal rate limiting (#2832 ) As described in Boulder issue #2800 the implementation of the SA's `countCertificates` function meant that the renewal exemption for the Certificates Per Domain rate limit was difficult to work with. To maximize allotted certificates clients were required to perform all new issuances first, followed by the "free" renewals. This arrangement was difficult to coordinate. In this PR `countCertificates` is updated such that renewals are excluded from the count reliably. To do so the SA takes the serials it finds for a given domain from the issuedNames table and cross references them with the FQDN sets it can find for the associated serials. With the FQDN sets a second query is done to find all the non-renewal FQDN sets for the serials, giving a count of the total non-renewal issuances to use for rate limiting. Resolves #2800	2017-06-27 15:39:59 -04:00
Daniel McCarney	b2d29c9e90	Properly initialize submissions_b count (#2784 ) The `submissions_b` count in the integration test `test_ct_submission` function was being populated initially by using `url_a` when it _should_ be initialized using `url_b` since it's the count of submissions to log b. This resolves https://github.com/letsencrypt/boulder/issues/2723 I tested this fix with a branch that ran this test 12 times per build. Prior to this fix multiple builds out of 20 (~4-5) would fail. With this fix, all 20 passed.	2017-05-24 15:37:01 -07:00
Daniel McCarney	4bc28ff0c4	Relaxes CT integration test hack further. (#2670 ) In 18f4c5c we introduced a workaround for the CT submission integration test to allow exactly expected, or twice as many CT log submissions as expected to account for the case where the ocsp-updater and the CA race. This didn't completely patch over the issue because the number of submissions can fall between `n` and `2n`. This commit updates the hack to be even hackier (twice as hacky or your money back). Now we consider any value between `n` and `2n` as a test pass.	2017-04-07 16:02:40 -04:00
Roland Bracewell Shoemaker	ccf8c45eea	Purge everything that would be expired in a year at start of eap test (#2649 ) Instead of running it at the current time to clean out left over cruft run it with a FAKECLOCK of +1 year so that we catch everything that could get in the way.	2017-04-04 14:11:42 -07:00
Roland Bracewell Shoemaker	acbd9ed3a7	Purge both pending and finalized authorizations as well as challenges (#2149 ) Fixes #2148. Instead of just doing a blanket `DELETE FROM ...` this changes the `expired-authz-purger` to select all of the expired IDs (for both pending and finalized authorizations) then loop over them deleting each and its associated challenges from their respective tables. Local testing indicates the performance of this is not awful but we should do a test run on staging to verify. If it ends up taking way too long to run there the easiest optimization would be to turn the slice of IDs into a channel and run multiple workers looping over the channel deleting stuff instead of just a single one. Makes a few small integration test changes in order to facilitate deleting both pending and finalized authorizations.	2017-03-24 11:04:35 -07:00
Daniel McCarney	2114596e58	Workaround #2610 for flaky ct submission test. (#2611 ) Presently the CA and the ocsp-updater can race on the initial submission of a certificate to the configured logs. This results in double submitting certificates. In integration tests with the fake CT server this manifests as an occasional failure of the `test_ct_submission` test (Issue #2579). The race we currently experience is expected to be fixed in the future by a planned redesign so for now this commit works around the failure by allowing either the expected number of submissions, or exactly double the expected. This fixes #2579. The need to fix the underlying race was captured in #2610. The workaround was verified by submitting 10 builds to travis, all succeeded.	2017-03-20 09:03:54 -04:00
Jacob Hoffman-Andrews	154ee0af3b	Add DNS challenge to integration test. (#2548 ) Part of #2521.	2017-02-13 09:17:13 -08:00
Roland Bracewell Shoemaker	18de73f0d8	Pass nil errors through boulder/grpc wrapError/unwrapError (#2544 ) Instead of trying to wrap or unwrap them which causes panics. Also, expand the test_ct_submission integration test to include resubmissions.	2017-02-06 18:19:39 -08:00
Jacob Hoffman-Andrews	d012a87049	Remove specialized exit codes. (#2537 ) Simply rely on exceptions from check_output. Also, factor out common params for check_output into a `run` helper function. Makes sure we always capture stderr into stdout.	2017-01-31 22:30:14 -08:00
Jacob Hoffman-Andrews	01e78fbd1b	Restore error check for config-next. (#2525 ) This check was previously commented out because it would fail under gRPC, but now that the underlying bug is fixed we can uncomment it.	2017-01-25 15:49:15 -05:00
Jacob Hoffman-Andrews	ad3738bbf5	Robustify expired_authz_purger test.	2017-01-24 18:02:35 -08:00
Jacob Hoffman-Andrews	ecd8d558f3	Review feedback.	2017-01-24 17:45:19 -08:00
Jacob Hoffman-Andrews	94bd21c082	Merge branch 'master' of github.com:letsencrypt/boulder into chisel2	2017-01-23 13:30:11 -08:00
Daniel McCarney	15e73edc5a	Google Safe Browsing V4 Improvements (#2504 ) This PR has three primary contributions: 1. The existing code for using the V4 safe browsing API introduced in #2446 had some bugs that are fixed in this PR. 2. A gsb-test-srv is added to provide a mock Google Safebrowsing V4 server for integration testing purposes. 3. A short integration test is added to test end-to-end GSB lookup for an "unsafe" domain. For 1) most notably Boulder was assuming the new V4 library accepted a directory for its database persistence when it instead expects an existing file to be provided. Additionally the VA wasn't properly instantiating feature flags preventing the V4 api from being used by the VA. For 2) the test server is designed to have a fixed set of "bad" domains (Currently just honest.achmeds.discount.hosting.com). When asked for a database update by a client it will package the list of bad domains up & send them to the client. When the client is asked to do a URL lookup it will check the local database for a matching prefix, and if found, perform a lookup against the test server. The test server will process the lookup and increment a count for how many times the bad domain was asked about. For 3) the Boulder startservers.py was updated to start the gsb-test-srv and the VA is configured to talk to it using the V4 API. The integration test consists of attempting issuance for a domain pre-configured in the gsb-test-srv as a bad domain. If the issuance succeeds we know the GSB lookup code is faulty. If the issuance fails, we check that the gsb-test-srv received the correct number of lookups for the "bad" domain and fail if the expected isn't reality. Notes for reviewers: * The gsb-test-srv has to be started before anything will use it. Right now the v4 library handles database update request failures poorly and will not retry for 30min. See google/safebrowsing#44 for more information. * There's not an easy way to test for "good" domain lookups, only hits against the list. The design of the V4 API is such that a list of prefixes is delivered to the client in the db update phase and if the domain in question matches no prefixes then the lookup is deemed unneccesary and not performed. I experimented with sending 256 1 byte prefixes to try and trick the client to always do a lookup, but the min prefix size is 4 bytes and enumerating all possible prefixes seemed gross. * The test server has a /add endpoint that could be used by integration tests to add new domains to the block list, but it isn't being used presently. The trouble is that the client only updates its database every 30 minutes at present, and so adding a new domain will only take affect after the client updates the database. Resolves #2448	2017-01-23 11:07:20 -08:00
Jacob Hoffman-Andrews	7705b18a70	Refactor integration test. Add a new tiny client called chisel, in place of test.js. This reduces the number of language runtimes Boulder depends on for its tests. Also, since chisel uses the acme Python library, we get more testing of that library, which underlies Certbot. This also gives us more flexibility to hook different parts of the issuance flows in our tests. Reorganize integration-test.py itself. There was not clear separation of specific test cases. Some test cases were added as part of run_node_test; some were wrapped around it. There is now much closer to one function per test case. Eventually we may be able to adopt Python's test infrastructure for these test cases. Remove some unused imports; consolidate on urllib2 instead of urllib. For getting serial number and expiration date, replace shelling out to OpenSSL with using pyOpenSSL, since we already have an in-memory parsed certificate. Replace ISSUANCE_FAILED, REVOCATION_FAILED, MAILER_FAILED with simple die, since we don't use these. Later, I'd like to remove the other specific exit codes. We don't make very good use of them, and it would be more effective to just use stack traces or, even better, reporting of which test cases failed. Make single_ocsp_sign responsible for its own subprocess lifecycle. Skip running startservers if WFE is already running, to make it easier to iterate against a running Boulder (saves a few seconds of Boulder startup).	2017-01-22 20:51:27 -08:00
Jacob Hoffman-Andrews	9dacdd5443	Fix SA wrappers for maps. (#2498 ) We turn arrays into maps with a range command. Previously, we were taking the address of the iteration variable in that range command, which meant incorrect results since the iteration variable gets reassigned. Also change the integration test to catch this error. Fixes #2496	2017-01-17 14:07:07 -08:00
Josh Soref	8adf9d41cf	Spelling (#2500 ) Various spelling fixes.	2017-01-16 10:44:52 -05:00
Jacob Hoffman-Andrews	82a048cfb9	Use config overrides for expiration-mailer. (#2473 ) Previously, the expiration-mailer would always run with the default config, even if BOULDER_CONFIG_DIR was used to point at config-next. This led to missing some config parse problems that should have been test failures.	2017-01-12 12:13:27 -08:00
Roland Shoemaker	e850b27588	Fix typo	2016-12-15 12:47:36 -08:00
Roland Shoemaker	38c46fdd2e	Review fixes pt. 2	2016-12-15 11:58:19 -08:00
Roland Shoemaker	07068b4d1e	Review fixes pt. 1	2016-12-15 11:47:37 -08:00
Roland Shoemaker	26e2d8a5ca	Add admin-revoker integration tests for serial-revoke and auth-revoke	2016-12-12 15:43:35 -08:00
Roland Bracewell Shoemaker	a26d08f817	Kill OCSP-Responder if integration-test.py fails (#2291 ) Fixes #2192.	2016-10-26 16:50:48 -07:00
Jacob Hoffman-Andrews	f21a7e5ad2	Fix non-Docker integration test. (#2184 ) Use labels ending in _key for private key labels. Create two separate slots in make-softhsm rather than overwriting a single slot. Update make-softhsm instructions to point out both files to edit. Improve error output from integation test.	2016-09-16 18:21:33 -07:00
Jacob Hoffman-Andrews	87fee12d6c	Improve single-ocsp command (#2181 ) Output base64-encoded DER, as expected by ocsp-responder. Use flags instead of template for Status, ThisUpdate, NextUpdate. Provide better help. Remove old test (wasn't run automatically). Add it to integration test, and use its output for integration test of issuer ocsp-responder. Add another slot to boulder-tools HSM image, to store root key.	2016-09-15 15:28:54 -07:00
Jacob Hoffman-Andrews	c97f28055c	Update tests to use multi-issuer format and ca2 (#1638 ) Builds on #1635.	2016-08-05 13:42:03 -07:00
Ben Irving	1a4f099899	Split up boulder-config.json (Expiration Mailer) (#2036 ) Part of #1962.	2016-07-12 15:55:52 -07:00
Ben Irving	67fd6ef67c	Add certificatesPerName rate limit to integration test (#1940 ) This PR, covers the code path where the certificatesPerName rate limit is exceeded. Additionally, a node package (cli) was upgraded as the spinner was preventing the redirection of I/O. See this commit: node-js-libs/cli@ff064fe. Fixes #1614 https://github.com/letsencrypt/boulder/pull/1940	2016-06-17 16:10:05 -07:00
Jacob Hoffman-Andrews	4283fb5dd4	Improve syslog defaults. (#1932 ) Under the new defaults, if the syslog section is missing, we'll use the default config that we use in prod: no logs to stdout, INFO and below to syslog. This allows us to remove the syslog section from prod configs, and potentially move it to individual service configs in the future. * Improve syslog defaults. * Add stdout logging for purger test. * Use plain int for sysloglevel. * Fix JSON syntax * Fix syslog config for expired-authz-purger. https://github.com/letsencrypt/boulder/pull/1932	2016-06-17 11:26:11 -07:00
Jacob Hoffman-Andrews	163d9547f4	Remove the agreement flag from test.js. (#1885 ) Since we only use this for testing, not a live client, it's unnecessary complexity.	2016-06-06 13:19:57 -07:00
Kane York	37ef594527	Add cmd/expired-authz-purger (#1828 ) * Add cmd/expired-authz-purger with integration test * Return count * gofmt >.> * add to boulder-config-next.json * Commit missing file * Exec on the dbMap * fprintf the error message * Review fixes + test * Review fixes pt. 1 * Review fixes pt. 2 (actually add test file this time :\|) * Fix prompt * Switch to using flag lib * Use COUNT(1) * Revert config -> flag stuff * Review fixes * Revert-revert COUNT(1) change * Review fixes pt. 1 * Nest config struct * Test review fixes * Factor out getting future output with FAKECLOCK * Review fixes pt. 2 * Review fixes pt. 3	2016-06-03 16:00:19 -04:00

1 2 3 4

173 Commits