boulder

Commit Graph

Author	SHA1	Message	Date
Kruti Sutaria	e9b6148448	Remove code that rejects old TLS requests (#7711 ) The Boulder WFE accepts incoming connections (from our load balancers) via either TLS or plain HTTP. When those connections are made over TLS, it already enforces that the client be using TLS 1.3 or above. When those connections are made over plain HTTP, the load balancer includes the TLS version as a header, and Boulder was performing filtering based on that. Our load balancers are now configured to reject older TLS versions, so we can remove this check. Fixes https://github.com/letsencrypt/boulder/issues/7710	2024-10-01 11:34:20 -07:00
Samantha Frank	61a9aa5353	WFE: Plumb ARI explanationURL through for incidents (#7730 )	2024-09-30 15:25:22 -04:00
Samantha Frank	3451952a6e	WFE: Only log warnings for non-limit errors (#7717 ) Only log errors from `wfe.checkNewOrderLimits` and `wfe.checkNewAccountLimits` if they're not `errors.RateLimit`.	2024-09-26 11:20:49 -04:00
James Renken	707b734a75	Remove outdated limitation in TestNonceBalancer (#7694 ) Also fix minor typos in comments. Part of https://github.com/letsencrypt/boulder/issues/7696	2024-09-04 13:35:20 -07:00
Aaron Gable	dad9e08606	Lay the groundwork for supporting IP identifiers (#7692 ) Clean up how we handle identifiers throughout the Boulder codebase by - moving the Identifier protobuf message definition from sa.proto to core.proto; - adding support for IP identifier to the "identifier" package; - renaming the "identifier" package's exported names to be clearer; and - ensuring we use the identifier package's helper functions everywhere we can. This will make future work to actually respect identifier types (such as in Authorization and Order protobuf messages) simpler and easier to review. Part of https://github.com/letsencrypt/boulder/issues/7311	2024-08-30 11:40:38 -07:00
Aaron Gable	da7865cb10	Add go1.23.0 to CI (#7665 ) Begin testing on go1.23. To facilitate this, also update /x/net, golangci-lint, staticcheck, and pebble-challtestsrv to versions which support go1.23. As a result of these updates, also fix a handful of new lint findings, mostly regarding passing non-static (i.e. potentially user-controlled) format strings into Sprintf-style functions. Additionally, delete one VA unittest that was duplicating the checks performed by a different VA unittest, but with a context timeout bug that caused it to break when go1.23 subtly changed DialContext behavior.	2024-08-23 14:56:53 -07:00
Aaron Gable	cac431c661	WFE: Use RA.GetAuthorization to filter out disabled challenges (#7659 ) Have the WFE ask the RA for authorizations, rather than asking the SA directly. This extra layer of indirection allows us to filter out challenges which have been disabled, so that clients don't think they can attempt challenges that we have disabled. Also shuffle the order of challenges within the authz objects rendered by the API. We used to have code which does this at authz creation time, but of course that was completely ineffectual once we stored the challenges as just a bitmap in the database. Update the WFE unit tests to mock RA.GetAuthorization instead of SA.GetAuthorization2. This includes making the mock more accurate, so that (e.g.) valid authorizations contain valid challenges, and the challenges have their correct types (e.g. "http-01" instead of just "http"). Also update the OTel tracing test to account for the new RPC. Part of https://github.com/letsencrypt/boulder/issues/5913	2024-08-22 13:42:58 -07:00
Samantha Frank	c9be034c00	ratelimits: Add a feature-flag which makes key-value implementation authoritative (#7666 ) - Add feature flag `UseKvLimitsForNewOrder` - Add feature flag `UseKvLimitsForNewAccount` - Flush all Redis shards before running integration or unit tests, this avoids false positives between local testing runs Fixes #7664 Blocked by #7676	2024-08-22 15:56:30 -04:00
Samantha Frank	14c0b2c3bb	ratelimits: Check at NewOrder and SpendOnly later (#7669 ) - Check `CertificatesPerDomain` at newOrder and spend at Finalize time. - Check `CertificatesPerAccountPerDomain` at newOrder and spend at Finalize time. - Check `CertificatesPerFQDNSet` at newOrder and spend at Finalize time. - Fix a bug in`FailedAuthorizationsPerDomainPerAccountSpendOnlyTransaction()` which results in failed authorizations being spent for the exact FQDN, not the eTLD+1. - Remove redundant "max names" check at transaction construction time - Enable key-value rate limits in the RA	2024-08-15 19:08:17 -04:00
Aaron Gable	46859a22d9	Use consistent naming for dnsName gRPC fields (#7654 ) Find all gRPC fields which represent DNS Names -- sometimes called "identifier", "hostname", "domain", "identifierValue", or other things -- and unify their naming. This naming makes it very clear that these values are strings which may be included in the SAN extension of a certificate with type dnsName. As we move towards issuing IP Address certificates, all of these fields will need to be replaced by fields which carry both an identifier type and value, not just a single name. This unified naming makes it very clear which messages and methods need to be updated to support non-dnsName identifiers. Part of https://github.com/letsencrypt/boulder/issues/7647	2024-08-12 14:32:55 -07:00
Aaron Gable	fa732df492	Remove challenge.ProvidedKeyAuthorization (#7655 ) This field was deprecated in https://github.com/letsencrypt/boulder/pull/7515, and has been fully replaced by vapb.PerformValidationRequest.ExpectedKeyAuthorization. Fixes https://github.com/letsencrypt/boulder/issues/7514	2024-08-12 14:08:06 -07:00
Samantha Frank	9e286918f8	SFE: Rebrand Self-Service to Portal (#7662 )	2024-08-12 16:10:34 -04:00
Aaron Gable	e398c4d1ec	Clearer error message when goodkey fails unexpectedly (#7642 ) This will prevent users from believing their key is at fault when the actual error is between Boulder and the database. Fixes https://github.com/letsencrypt/boulder/issues/7624	2024-08-06 09:23:06 -07:00
Aaron Gable	e54c5bb85e	RA: pass through unpause requests to SA (#7630 ) Have the RA's UnpauseAccount gRPC method forward the requested account ID to the SA's corresponding method, and in turn forward the SA's count of unpaused identifiers back to the caller in the response. Changing the response message from emptypb.Empty to a new rapb.UnpauseAccountResponse is safe, because message names are not transmitted on the wire, only message field numbers. While we're here, drastically simplify the wfe_test and sfe_test Mock RAs, so they don't have to implement methods that aren't actually used by the tests. Fixes https://github.com/letsencrypt/boulder/issues/7536	2024-07-25 16:34:02 -04:00
Samantha Frank	986c78a2b4	WFE: Reject new orders containing paused identifiers (#7599 ) Part of #7406 Fixes #7475	2024-07-25 13:46:40 -04:00
Aaron Gable	ff851f7107	WFE: Include profile name in returned Order json (#7626 ) Integration testing revealed that the WFE was not rendering the profile name in the Order JSON object. Fix the one spot where it was missed. Part of https://github.com/letsencrypt/boulder/issues/7332	2024-07-24 14:30:24 -07:00
Aaron Gable	48439e4532	Advertise available profiles in directory resource (#7603 ) Change the way profiles are configured at the WFE to allow them to be accompanied by descriptive strings. Augment the construction of the directory resource's "meta" sub-object to include these profile names and descriptions. This config swap is safe, since no Boulder WFE instance is configured with `CertificateProfileNames` yet. Fixes https://github.com/letsencrypt/boulder/issues/7602	2024-07-22 15:31:08 -07:00
Aaron Gable	2028b03e1d	Suppress boring nonce log events (#7576 ) Requests to the new-nonce endpoint make up about 20% of our WFE log lines, but they're uninteresting and largely useless for debugging. Suppress the log event for successful requests to reduce our log volume.	2024-07-09 16:45:02 -04:00
Samantha Frank	55c274d132	ratelimits: Exempt renewals from NewOrdersPerAccount and CertificatesPerDomain (#7513 ) - Rename `NewOrderRequest` field `LimitsExempt` to `IsARIRenewal` - Introduce a new `NewOrderRequest` field, `IsRenewal` - Introduce a new (temporary) feature flag, `CheckRenewalExemptionAtWFE` WFE: - Perform renewal detection in the WFE when `CheckRenewalExemptionAtWFE` is set - Skip (key-value) `NewOrdersPerAccount` and `CertificatesPerDomain` limit checks when renewal detection indicates the the order is a renewal. RA: - Leave renewal detection in the RA intact - Skip renewal detection and (legacy) `NewOrdersPerAccount` and `CertificatesPerDomain` limit checks when `CheckRenewalExemptionAtWFE` is set and the `NewOrderRequest` indicates that the order is a renewal. Fixes #7508 Part of #5545	2024-06-27 16:39:31 -04:00
Samantha Frank	a38ed99341	ratelimits: Move transaction construction out of the WFE (#7557 ) - Shrink the number of public `ratelimits` methods by relocating two sizeable transaction constructors. Simplify the spend and refund call-sites in the WFE. - Spend calls now block instead of being called asynchronously.	2024-06-26 11:49:28 -04:00
Phil Porada	e3eb37fe34	WFE: Normalize SANs in NewOrder request (#7554 ) In #7530, `wfe.NewOrder` [began constructing a rate limit transaction](https://github.com/letsencrypt/boulder/pull/7530/files#diff-3f950e720c205ce9fa8dea12c6fd7fd44272c2671f19d0e06962abfbea00d491R2340-R2344) with a precondition that all names must be lower-cased, however the actual implementation of the precondition was accidentally overlooked. This fix corrects that and adds a unit test to prevent a future regression. Other changes: - Only normalized names count towards max names limit - Only normalized names will be logged in the web.RequestEvent --------- Co-authored-by: Samantha Frank <hello@entropy.cat>	2024-06-20 12:28:40 -04:00
Phil Porada	8c324a5e8a	RA: Add UnpauseAccountRequest protobuf message and service (#7537 ) Add the `ra.UnpauseAccount` which takes an `rapb.UnpauseAccountRequest` input parameter. The method is just a stub to allow downstream SFE development to continue. There is relevant ongoing work in the SA which will eventually reside in this stub method.	2024-06-20 11:21:46 -04:00
Aaron Gable	8545ea8364	KeyPolicy: add custom constructor and make all fields private (#7543 ) Change how goodkey.KeyPolicy keeps track of allowed RSA and ECDSA key sizes, to make it slightly more flexible while still retaining the very locked-down allowlist of only 6 acceptable key sizes (RSA 2048, 3076, and 4092, and ECDSA P256, P384, and P521). Add a new constructor which takes in a collection of allowed key sizes, so that users of the goodkey package can customize which keys they accept. Rename the existing constructor to make it clear that it uses hardcoded default values. With these new constructors available, make all of the goodkey.KeyPolicy member fields private, so that a KeyPolicy can only be built via these constructors.	2024-06-18 17:52:50 -04:00
Jacob Hoffman-Andrews	e198d3529d	wfe: check well-formedness of requested names early (#7530 ) This allows us to give a user-meaningful error about malformed names early on, instead of propagating internal errors from the new rate limiting system. This moves the well-formedness logic from `WillingToIssue` into a new function `WellFormedDomainNames`, which calls `ValidDomain` on each name and combines the errors into suberrors if there is more than one. `WillingToIssue` now calls `WellFormedDomainNames` to keep the existing behavior. Additionally, WFE calls `WellFormedDomainNames` before checking rate limits. This creates a slight behavior change: If an order contains both malformed domain names and wellformed but blocked domain names, suberrors will only be generated for the malformed domain names. This is reflected in the changes to `TestWillingToIssue_Wildcard`. Adds a WFE test case for receiving malformed identifiers in a new-order request. Follows up on #3323 and #7218 Fixes #7526 Some small incidental fixes: - checkWildcardHostList was checking `pa.blocklist` for `nil` before accessing `pa.wildcardExactBlocklist`. Fix that. - move table test for WillingToIssue into a new test case for WellFormedDomainNames - move two standalone test cases into the big table test	2024-06-10 13:46:55 -07:00
Aaron Gable	09693f03dc	Deprecate Challenge.ProvidedKeyAuthorization (#7515 ) The core.Challenge.ProvidedKeyAuthorization field is problematic, both because it is poorly named (which is admittedly easily fixable) and because it is a field which we never expose to the client yet it is held on a core type. Deprecate this field, and replace it with a new vapb.PerformValidationRequest.ExpectedKeyAuthorization field. Within the VA, this also simplifies the primary logic methods to just take the expected key authorization, rather than taking a whole (largely unnecessary) challenge object. This has large but wholly mechanical knock-on effects on the unit tests. While we're here, improve the documentation on core.Challenge itself, and remove Challenge.URI, which was deprecated long ago and is wholly unused. Part of https://github.com/letsencrypt/boulder/issues/7514	2024-06-04 14:48:36 -07:00
Aaron Gable	f84050a20e	Reduce redis rate limit construction logs from audit to info (#7516 ) Everything logged at the error level is implicitly given the audit tag as well. That's not merited in this case, so downgrade these error logs to be info logs instead.	2024-06-03 13:42:58 -07:00
Samantha	14203c0dcf	WFE: Remove UpdateRenewal method (#7506 ) Fixes #7495	2024-05-28 16:13:04 -04:00
Aaron Gable	5be3650e56	Remove deprecated WFE.RedeemNonceServices (#7493 ) Fixes https://github.com/letsencrypt/boulder/issues/6610	2024-05-21 13:13:13 -04:00
Aaron Gable	4663b9898e	Use custom mocks instead of mocks.StorageAuthority (#7494 ) Replace "mocks.StorageAuthority" with "sapb.StorageAuthorityClient" in our test mocks. The improves them by removing implementations of the methods the tests don't actually need, instead of inheriting lots of extraneous methods from the huge and cumbersome mocks.StorageAuthority. This reduces our usage of mocks.StorageAuthority to only the WFE tests (which create one in the frequently-used setup() function), which will make refactoring those mocks in the pursuit of https://github.com/letsencrypt/boulder/issues/7476 much easier. Part of https://github.com/letsencrypt/boulder/issues/7476	2024-05-21 09:16:17 -07:00
Aaron Gable	146b78a0f7	Remove all static minica keys (#7489 ) Remove the redis-tls, wfe-tls, and mail-test-srv keys which were generated by minica and then checked in to the repo. All three are replaced by the dynamically-generated ipki directory. Part of https://github.com/letsencrypt/boulder/issues/7476	2024-05-17 11:45:40 -07:00
Aaron Gable	eb607e5b10	Remove more test keys (#7488 ) Part of https://github.com/letsencrypt/boulder/issues/7476	2024-05-16 11:20:07 -04:00
Phil Porada	d19f70407c	wfe: Don't serve renewalInfoPath with trailing slash in the directory (#7482 ) [draft-ietf-acme-ari-03 section 4.1](https://www.ietf.org/archive/id/draft-ietf-acme-ari-03.html#section-4.1) states the following indicating that it's the clients responsibility to add a `/` after the `renewalInfoPath`, not the server. > Thus the full request url is constructed as follows, where the "\|\|" operator indicates string concatenation and the renewalInfo url is taken from the Directory object: ``` url = renewalInfo \|\| '/' \|\| base64url(AKI) \|\| '.' \|\| base64url(Serial) ``` Fixes https://github.com/letsencrypt/boulder/issues/7481	2024-05-13 16:10:29 -04:00
Samantha	10b7e638d5	WFE: Remove support for draft-ietf-acme-ari01 CertID format (#7448 ) Fixes #7183 --------- Co-authored-by: Aaron Gable <aaron@letsencrypt.org>	2024-05-10 14:54:36 -04:00
Phil Porada	d219948d3a	wfe: Remove ResolverAddrs field from being displayed to the client (#7464 ) The ResolverAddrs field is only useful for internal debugging and shouldn't be displayed to clients. Fixes https://github.com/letsencrypt/boulder/issues/7462	2024-05-03 10:51:30 -07:00
Aaron Gable	327f96d281	Update integration test hierarchy for the modern era (#7411 ) Update the hierarchy which the integration tests auto-generate inside the ./hierarchy folder to include three intermediates of each key type, two to be actively loaded and one to be held in reserve. To facilitate this: - Update the generation script to loop, rather than hard-coding each intermediate we want - Improve the filenames of the generated hierarchy to be more readable - Replace the WFE's AIA endpoint with a thin aia-test-srv so that we don't have to have NameIDs hardcoded in our ca.json configs Having this new hierarchy will make it easier for our integration tests to validate that new features like "unpredictable issuance" are working correctly. Part of https://github.com/letsencrypt/boulder/issues/729	2024-04-08 14:06:00 -07:00
Samantha	a88bd68ead	WFE: Count NewOrders which indicate replacement (#7416 ) Add support for counting new orders which indicate replacement according to draft-ietf-acme-ari. Fixes #7405	2024-04-08 12:32:45 -04:00
Phil Porada	5f616ccdb9	Upgrade go-jose from v2.6.1 to v.4.0.1 (#7345 ) Upgrade from the old go-jose v2.6.1 to the newly minted go-jose v4.0.1. Cleans up old code now that `jose.ParseSigned` can take a list of supported signature algorithms. Fixes https://github.com/letsencrypt/boulder/issues/7390 --------- Co-authored-by: Aaron Gable <aaron@letsencrypt.org>	2024-04-02 17:49:51 -04:00
Samantha	3e2d852f3c	ARI: Return HTTP 409 "Conflict" when the certificate identified by 'replaces' has already been replaced (#7385 ) Fixes #7338	2024-03-21 15:57:22 -04:00
Samantha	c6b50558e6	WFE: Add support for certificate profiles (#7373 ) - Parse and validate the `profile` field in `newOrder` requests. - Pass the `profile` field from `newOrder` calls to the resulting `RA.NewOrder` call. - When the client requests a specific profile, ensure that the profile field is populated in the order returned. Fixes #7332 Part of #7309	2024-03-20 12:49:45 -04:00
Samantha	5e68cbe552	WFE: Gate ARI limit exemption and replacement tracking on a feature flag (#7383 ) Gate checking of replacement orders and exemption for ARI replacements on the `TrackReplacementCertificatesARI` feature flag.	2024-03-18 12:22:01 -04:00
Samantha	e2c89ddb7e	ARI: Fix typo in error (#7382 )	2024-03-13 12:07:02 -07:00
Samantha	529157ce56	ratelimits: Fix transaction building for Failed Authorizations Limit (#7344 ) - Update the failed authorizations limit to use 'enum:regId:domain' for transactions while maintaining 'enum:regId' for overrides. - Modify the failed authorizations transaction builder to generate a transaction for each order name. - Rename the `FailedAuthorizationsPerAccount` enum to `FailedAuthorizationsPerDomainPerAccount` to align with its corrected implementation. This change is possible because the limit isn't yet deployed in staging or production. Blocks #7346 Part of #5545	2024-03-06 13:48:32 -05:00
Samantha	a97e074b5a	WFE/ARI: Add method for tracking certificate replacement (#7298 ) Implement draft-ietf-acme-ari-02 changes in WFE newOrder: - Add a `replaces` field to the newOrder request object - Ensure that `replaces` values provided by subscribers are vetted according to the requirements set out in the draft specification - When a NewOrder request falls inside the suggested RenewalWindow, exempt from rate limits in the WFE and indicate exemption in the RA NewOrder request Part of #7038	2024-02-26 16:47:08 -05:00
Aaron Gable	78e4e82ffa	Feature cleanup (#7320 ) Remove three deprecated feature flags which have been removed from all production configs: - StoreLintingCertificateInsteadOfPrecertificate - LeaseCRLShards - AllowUnrecognizedFeatures Deprecate three flags which are set to true in all production configs: - CAAAfterValidation - AllowNoCommonName - SHA256SubjectKeyIdentifier IN-9879 tracked the removal of these flags.	2024-02-13 17:42:27 -08:00
Samantha	8d7e84b013	ARI: Make renewal window determination reusable (#7283 ) Part of #7038	2024-02-08 13:58:10 -05:00
Samantha	97a19b18d2	WFE: Check NewOrder rate limits (#7201 ) Add non-blocking checks of New Order limits to the WFE using the new key-value based rate limits package. Part of #5545	2024-01-26 21:05:30 -05:00
Samantha	21044c5236	WFE: Two changes to NewRegistration key-value rate limits (#7258 ) Make NewRegistration more consistent with the implementation in NewOrder (#7201): - Construct transactions just once, - use batched spending instead of multiple spend calls, and - do not attempt a refund for requests that fail due to RateLimit errors. Part of #5545	2024-01-23 12:09:20 -05:00
Aaron Gable	ab6e023b6f	Simplify issuance.NameID and how it is used (#7260 ) Rename "IssuerNameID" to just "NameID". Similarly rename the standalone functions which compute it to better describe their function. Add a .NameID() directly to issuance.Issuer, so that callers in other packages don't have to directly access the .Cert member of an Issuer. Finally, rearrange the code in issuance.go to be sensibly grouped as concerning NameIDs, Certificates, or Issuers, rather than all mixed up between the three. Fixes https://github.com/letsencrypt/boulder/issues/5152	2024-01-17 12:55:56 -08:00
Aaron Gable	d57edfa0f1	Run more go vet checks (#7255 ) Enable the atomicalign, deepequalerrors, findcall, nilness, reflectvaluecompare, sortslice, timeformat, and unusedwrite go vet analyzers, which golangci-lint does not enable by default. Additionally, enable new go vet analyzers by default as they become available. The fieldalignment and shadow analyzers remain disabled because they report so many errors that they should be fixed in a separate PR. Note that the nilness analyzer appears to have found one very real bug in tlsalpn.go.	2024-01-17 12:27:55 -05:00
Samantha	25ea9e9cf0	WFE: Implement CertID format as per draft-ietf-acme-ari-02 (#7184 ) Add support for draft-ietf-acme-ari-02 format alongside the existing draft-ietf-acme-ari-01 implementation. Both formats are interchangeable. Fixes #7037	2023-12-15 14:34:28 -08:00
Aaron Gable	5e1bc3b501	Simplify the features package (#7204 ) Replace the current three-piece setup (enum of feature variables, map of feature vars to default values, and autogenerated bidirectional maps of feature variables to and from strings) with a much simpler one-piece setup: a single struct with one boolean-typed field per feature. This preserves the overall structure of the package -- a single global feature set protected by a mutex, and Set, Reset, and Enabled methods -- although the exact function signatures have all changed somewhat. The executable config format remains the same, so no deployment changes are necessary. This change does deprecate the AllowUnrecognizedFeatures feature, as we cannot tell the json config parser to ignore unknown field names, but that flag is set to False in all of our deployment environments already. Fixes https://github.com/letsencrypt/boulder/issues/6802 Fixes https://github.com/letsencrypt/boulder/issues/5229	2023-12-12 15:51:57 -05:00
Samantha	eb49d4487e	ratelimits: Implement batched Spends and Refunds (#7143 ) - Move default and override limits, and associated methods, out of the Limiter to new limitRegistry struct, embedded in a new public TransactionBuilder. - Export Transaction and add corresponding Transaction constructor methods for each limit Name, making Limiter and TransactionBuilder the API for interacting with the ratelimits package. - Implement batched Spends and Refunds on the Limiter, the new methods accept a slice of Transactions. - Add new boolean fields check and spend to Transaction to support more complicated cases that can arise in batches: 1. the InvalidAuthorizations limit is checked at New Order time in a batch with many other limits, but should only be spent when an Authorization is first considered invalid. 2. the CertificatesPerDomain limit is overridden by CertficatesPerDomainPerAccount, when this is the case, spends of the CertificatesPerDomain limit should be "best-effort" but NOT deny the request if capacity is lacking. - Modify the existing Spend/Refund methods to support Transaction.check/spend and 0 cost Transactions. - Make bucketId private and add a constructor for each bucket key format supported by ratelimits. - Move domainsForRateLimiting() from the ra.go to ratelimits. This avoids a circular import issue in ra.go. Part of #5545	2023-12-07 11:56:02 -05:00
Phil Porada	6925fad324	Finish migration from int64 timestamps to timestamppb (#7142 ) This is a cleanup PR finishing the migration from int64 timestamps to protobuf `timestamppb.Timestamps` by removing all usage of the old int64 fields. In the previous PR https://github.com/letsencrypt/boulder/pull/7121 all fields were switched to read from the protobuf timestamppb fields. Adds a new case to `core.IsAnyNilOrZero` to check various properties of a `timestamppb.Timestamp` reducing the visual complexity for receivers. Fixes https://github.com/letsencrypt/boulder/issues/7060	2023-11-27 13:37:31 -08:00
Samantha	ca6314fa48	ratelimits: API improvements necessary for batches and limit fixes (#7117 ) The `Limiter` API has been adjusted significantly to both improve both safety and ergonomics and two `Limit` types have been corrected to match the legacy implementations. Safety Previously, the key used for looking up limit overrides and for fetching individual buckets from the key-value store was constructed within the WFE. This posed a risk: if the key was malformed, the default limit would still be enforced, but individual overrides would fail to function properly. This has been addressed by the introduction of a new `BucketId` type along with a `BucketId` constructor for each `Limit` type. Each constructor is responsible for producing a well-formed bucket key which undergoes the very same validation as any potentially matching override key. Ergonomics Previously, each of the `Limiter` methods took a `Limit` name, a bucket identifier, and a cost to be spent/ refunded. To simplify this, each method now accepts a new `Transaction` type which provides a cost, and wraps a `BucketId` identifying the specific bucket. The two changes above, when taken together, make the implementation of batched rate limit transactions considerably easier, as a batch method can accept a slice of `Transaction`. Limit Corrections PR #6947 added all of the existing rate limits which could be made compatible with the key-value approach. Two of these were improperly implemented; - `CertificatesPerDomain` and `CertificatesPerFQDNSet`, were implemented as - `CertificatesPerDomainPerAccount` and `CertificatesPerFQDNSetPerAccount`. Since we do not actually associate these limits with a particular ACME account, the `regID` portion of each of their bucket keys has been removed.	2023-11-08 13:29:01 -05:00
Aaron Gable	16081d8e30	Invert RequireCommonName into AllowNoCommonName (#7139 ) The RequireCommonName feature flag was our only "inverted" feature flag, which defaulted to true and had to be explicitly set to false. This inversion can lead to confusion, especially to readers who expect all Go default values to be zero values. We plan to remove the ability for our feature flag system to support default-true flags, which the existence of this flag blocked. Since this flag has not been set in any real configs, inverting it is easy. Part of https://github.com/letsencrypt/boulder/issues/6802	2023-11-06 10:58:30 -08:00
Phil Porada	a5c2772004	Add and populate new protobuf Timestamp fields (#7070 ) * Adds new `google.protobuf.Timestamp` fields to each .proto file where we had been using `int64` fields as a timestamp. * Updates relevant gRPC messages to populate the new `google.protobuf.Timestamp` fields in addition to the old `int64` timestamp fields. * Added tests for each `<x>ToPB` and `PBto<x>` functions to ensure that new fields passed into a gRPC message arrive as intended. * Removed an unused error return from `PBToCert` and `PBToCertStatus` and cleaned up each call site. Built on-top of https://github.com/letsencrypt/boulder/pull/7069 Part 2 of 4 related to https://github.com/letsencrypt/boulder/issues/7060	2023-10-11 12:12:12 -04:00
Samantha	9aef5839b5	WFE: Add new key-value ratelimits implementation (#7089 ) Integrate the key-value rate limits from #6947 into the WFE. Rate limits are backed by the Redis source added in #7016, and use the SRV record shard discovery added in #7042. Part of #5545	2023-10-04 14:12:38 -04:00
Phil Porada	034316ef6a	Rename int64 timestamp related protobuf fields to <fieldname>NS (#7069 ) Rename all of int64 timestamp fields to `<fieldname>NS` to indicate they are Unix nanosecond timestamps. Part 1 of 4 related to https://github.com/letsencrypt/boulder/issues/7060	2023-09-15 13:49:07 -04:00
Phil Porada	e7f78291ba	wfe2: Check nonce length (#7045 ) Performs a bounds check in `nonceWellFormed` to prevent a slice bounds out of range error.	2023-08-22 09:28:53 -07:00
Samantha	b141fa7c78	WFE: Correct Error Handling for Nonce Redemption RPCs with Unknown Prefixes (#7004 ) Fix an issue related to the custom gRPC Picker implementation introduced in #6618. When a nonce contained a prefix not associated with a known backend, the Picker would continuously rebuild, re-resolve DNS, and eventually throw a 500 "Server Error" at RPC timeout. The Picker now promptly returns a 400 "Bad Nonce" error as expected, in response the requesting client should retry their request with a fresh nonce. Additionally: - WFE unit tests use derived nonces when `"BOULDER_CONFIG_DIR" == "test/config-next"`. - `Balancer.Build()` in "noncebalancer" forces a rebuild until non-zero backends are available. This matches the [balancer/roundrobin](`d524b40946/balancer/roundrobin/roundrobin.go (L49-L53)`) implementation. - Nonces with no matching backend increment "jose_errors" with label `"type": "JWSInvalidNonce"` and "nonce_no_backend_found". - Nonces of incorrect length are now rejected at the WFE and increment "jose_errors" with label `"type": "JWSMalformedNonce"` instead of `"type": "JWSInvalidNonce"`. - Nonces not encoded as base64url are now rejected at the WFE and increment "jose_errors" with label `"type": "JWSMalformedNonce"` instead of `"type": "JWSInvalidNonce"`. Fixes #6969 Part of #6974	2023-07-28 12:07:52 -04:00
Jacob Hoffman-Andrews	d7ccffa32e	wfe: remove special "multiple certificates" error (#6983 ) This was introduced early in Boulder development when we had the concept of a "short serial" (monotonically increasing) which would be prepended to random bytes to form the full serial. We wanted to specially report the case that there were duplicates of a given short serial since it meant a problem with our monotonicity. We've long since abandoned that idea, and also this code can't be exercised because sa.SelectCertificate does a LIMIT 1 anyhow.	2023-07-11 09:53:16 -04:00
Jacob Hoffman-Andrews	8dcbc4c92f	Add must.Do utility function (#6955 ) This can take two values (typically the return values of a two-value function) and panic if the error is non-nil, returning the interesting value. This is particularly useful for cases where we statically know the call will succeed. Thanks to @mcpherrinm for the idea!	2023-06-26 14:43:30 -07:00
Aaron Gable	9e3b4bec18	Remove contact addresses from WFE logs (#6939 ) The contacts field of an account can be very verbose, and is irrelevant to the vast majority -- e.g. creating orders, validating challenges, and downloading certificates -- of requests made by an account. To reduce the length of our WFE log lines, remove the Contacts field from all logs. When we actually need it, we can get it from the database. Also remove the RequestEvent.TLS field, which is unused.	2023-06-20 14:56:27 -07:00
Aaron Gable	8224fad20b	Update to go1.20.5 (#6946 ) We are already running go1.20.5 in production.	2023-06-20 14:55:37 -07:00
Jacob Hoffman-Andrews	cde4b9c90f	wfe: return proper error for goodkey timeout (#6938 ) In WFE, we do a goodkey check when validating a self-authenticated POST (i.e. when creating an account). For a while, that was a purely local check, looking at a list of bad keys or bad moduluses, or checking for factorability. At some point we also added a backend check, querying the SA to see if a key was blocked. However, we did not update this one code path to distinguish "bad key" from "timeout querying SA." That meant that sometimes we would give a badPublicKey Problem Document when we should have given an internalServerError. Related: https://github.com/letsencrypt/boulder/issues/6795#issuecomment-1574217398	2023-06-20 12:42:21 -07:00
Phil Porada	19380cda68	WFE: Enforce parseJWS precondition for more safety while handling JWS (#6860 ) Define a `bJSONWebSignature` struct which embeds a `*jose.JSONWebSignature`. The only method that can produce a `bJSONWebSignature` is `wfe.parseJWS` so that we can ensure safety/sanity checks are performed on the incoming data. Restricts several methods and functions to take a `jose.Header` as an input parameter, rather than a full JWS. Fixes https://github.com/letsencrypt/boulder/issues/5676.	2023-05-17 11:55:16 -04:00
Aaron Gable	62ff373885	Probs: remove divergences from RFC8555 (#6877 ) Remove the remaining divergences from RFC8555 regarding what error types we use in certain situations. Specifically: - use "invalidContact" instead of "invalidEmail"; - use "unsupportedContact" for contact addresses that use a protocol other than "mailto:"; and - use "unsupportedIdentifier" for identifiers that specify a type other than "dns".	2023-05-15 12:35:12 -07:00
Matthew McPherrin	8c9c55609b	Remove redundant jose import alias (#6887 ) This PR should have no functional change; just a cleanup.	2023-05-15 09:45:58 -07:00
Matthew McPherrin	3aae67b8a9	Opentelemetry: Add option for public endpoints (#6867 ) This PR adds a new configuration block specifically for the otelhttp instrumentation. This block is separate from the existing "opentelemetry" configuration, and is only relevant when using otelhttp instrumentation. It does not share any codepath with the existing configuration, so it is at the top level to indicate which services it applies to. There's a bit of plumbing new configuration through. I've adopted the measured_http package to also set up opentelemetry instead of just metrics, which should hopefully allow any future changes to be smaller (just config & there) and more consistent between the wfe2 and ocsp responder. There's one option here now, which disables setting [otelhttp.WithPublicEndpoint](https://pkg.go.dev/go.opentelemetry.io/contrib/instrumentation/net/http/otelhttp#WithPublicEndpoint). This option is designed to do exactly what we want: Don't accept incoming spans as parents of the new span created in the server. Previously we had a setting to disable parent-based sampling to help with this problem, which doesn't really make sense anymore, so let's just remove it and simplify that setup path. The default of "false" is designed to be the safe option. It's set to True in the test/ configs for integration tests that use traces, and I expect we'll likely set it true in production eventually once the LBs are configured to handle tracing themselves. Fixes #6851	2023-05-12 15:34:34 -04:00
Aaron Gable	1fcd951622	Probs: simplifications and cleanup (#6876 ) Make minor, non-user-visible changes to how we structure the probs package. Notably: - Add new problem types for UnsupportedContact and UnsupportedIdentifier, which are specified by RFC8555 and which we will use in the future, but haven't been using historically. - Sort the problem types and constructor functions to match the (alphabetical) order given in RFC8555. - Rename some of the constructor functions to better match their underlying problem types (e.g. "TLSError" to just "TLS"). - Replace the redundant ProblemDetailsToStatusCode function with simply always returning a 500 if we haven't properly set the problem's HTTPStatus. - Remove the ability to use either the V1 or V2 error namespace prefix; always use the proper RFC namespace prefix.	2023-05-12 12:10:13 -04:00
Aaron Gable	02fa680b08	Update path to ARI endpoint (#6859 ) Update the document number to the latest version, and remove the /get/ prefix since it now supports both the GET and POST portions of the spec. Also update one piece of tooling to properly get the ARI URL from the directory, rather than hard-coding it.	2023-05-03 15:20:51 -07:00
Matthew McPherrin	0060e695b5	Introduce OpenTelemetry Tracing (#6750 ) Add a new shared config stanza which all boulder components can use to configure their Open Telemetry tracing. This allows components to specify where their traces should be sent, what their sampling ratio should be, and whether or not they should respect their parent's sampling decisions (so that web front-ends can ignore sampling info coming from outside our infrastructure). It's likely we'll need to evolve this configuration over time, but this is a good starting point. Add basic Open Telemetry setup to our existing cmd.StatsAndLogging helper, so that it gets initialized at the same time as our other observability helpers. This sets certain default fields on all traces/spans generated by the service. Currently these include the service name, the service version, and information about the telemetry SDK itself. In the future we'll likely augment this with information about the host and process. Finally, add instrumentation for the HTTP servers and grpc clients/servers. This gives us a starting point of being able to monitor Boulder, but is fairly minimal as this PR is already somewhat unwieldy: It's really only enough to understand that everything is wired up properly in the configuration. In subsequent work we'll enhance those spans with more data, and add more spans for things not automatically traced here. Fixes https://github.com/letsencrypt/boulder/issues/6361 --------- Co-authored-by: Aaron Gable <aaron@aarongable.com>	2023-04-21 10:46:59 -07:00
Aaron Gable	22fd579cf2	ARI: write Retry-After header before body (#6787 ) When sending an ARI response, write the Retry-After header before writing the JSON response body. This is necessary because http.ResponseWriter implicitly calls WriteHeader whenever Write is called, flushing all headers to the network and preventing any additional headers from being written. Unfortunately, the unittests use httptest.ResponseRecorder, which doesn't seem to enforce this invariant (it's happy to report headers which were written after the body). Add a header check to the integration tests, to make up for this deficiency.	2023-03-31 10:48:45 -07:00
Mike	708ef798dd	chore: fix typo (#6772 )	2023-03-28 18:30:30 -04:00
Aaron Gable	6d6f3632da	Change SetCommonName to RequireCommonName (#6749 ) Change the SetCommonName flag, introduced in #6706, to RequireCommonName. Rather than having the flag control both whether or not a name is hoisted from the SANs into the CN and whether or not the CA is willing to issue certs with no CN, this updated flag now only controls the latter. By default, the new flag is true, and continues our current behavior of failing issuance if we cannot set a CN in the cert. When the flag is set to false, then we are willing to issue certificates for which the CSR contains no CN and there is no SAN short enough to be hoisted into the CN field. When we have rolled out this change, we can move on to the next flag in this series: HoistCommonName, which will control whether or not a SAN is hoisted at all, effectively giving the CSRs (and therefore the clients) full control over whether their certificate contains a SAN. This change is safe because no environment explicitly sets the SetCommonName flag to false yet. Fixes #5112	2023-03-21 11:07:06 -07:00
Aaron Gable	ec1abb4d2e	ARI: Implement POST API (#6738 ) Add ARI POST method stub implementation to the WFE. Fixes https://github.com/letsencrypt/boulder/issues/6033	2023-03-15 11:51:21 -04:00
Matthew McPherrin	e1ed1a2ac2	Remove beeline tracing (#6733 ) Remove tracing using Beeline from Boulder. The only remnant left behind is the deprecated configuration, to ensure deployability. We had previously planned to swap in OpenTelemetry in a single PR, but that adds significant churn in a single change, so we're doing this as multiple steps that will each be significantly easier to reason about and review. Part of #6361	2023-03-14 15:14:27 -07:00
Jacob Hoffman-Andrews	5832f7bfac	wfe: don't log InternalErrors on 404 (#6702 ) For simple 404s, there's no need to log an InternalError in addition to the user-facing error, so pass `nil` to sendError as the internalError parameter. This cleans up Certificate, GetOrder, and FinalizeOrder; all the places I could find that checked for `NotFound` and also logged an unnecessary InternalError. I also removed a redundant an unnecessary error wrapping, and a reference to "short serial" which is not a concept we have anymore.	2023-03-02 11:46:13 -08:00
Jacob Hoffman-Andrews	85fd3ed8b7	sa: remove GetPrecertificate (#6692 ) This was mostly unused. The only caller was orphan-finder, which used it to determine if a certificate was already in the database. But this is not particularly important functionality, so I've removed it.	2023-03-01 11:30:51 -08:00
Aaron Gable	e7d5a6f9c0	Add Retry-After to "processing" Order responses (#6700 ) [RFC 8555 section 7.4](https://www.rfc-editor.org/rfc/rfc8555.html#section-7.4) states regarding Orders in the "processing" state: > "processing": The certificate is being issued. Send a POST-as-GET > request after the time given in the Retry-After header field of > the response, if any. Add a Retry-After header when serving Order objects that are in the "processing" state. This may help control clients which implement Order polling but without any built-in backoff. The retry interval is hard-coded to be 3s, slightly above our current 99th percentile Finalize latency.	2023-02-28 17:56:55 -05:00
Jacob Hoffman-Andrews	c7113d684c	wfe: fix comment about RA checking revocation (#6690 ) This was introduced as a copy-paste error in #5983: https://github.com/letsencrypt/boulder/pull/5983/files#diff-3f950e720c205ce9fa8dea12c6fd7fd44272c2671f19d0e06962abfbea00d491R919-R921 vs https://github.com/letsencrypt/boulder/pull/5983/files#diff-3f950e720c205ce9fa8dea12c6fd7fd44272c2671f19d0e06962abfbea00d491R866-R868	2023-02-23 16:42:26 -08:00
Phil Porada	6c84a69043	Remove MandatoryPOSTasGET flag (#6672 ) Remove the `MandatoryPOSTasGET` flag from the WFE2. Update the ACMEv2 divergence doc to note that neither staging nor production use MandatoryPOSTasGET. Fixes #6582.	2023-02-17 13:04:31 -05:00
Jacob Hoffman-Andrews	67927390e7	wfe: remove Payload from logs (#6639 ) Also remove CSRDNSNames, CSRIPAddresses and CSREmailAddresses. And add a new log field "DNSNames", for use in new-order, finalize, and revoke requests. Add a "RevocationReason" field in the "Extra" section for revoke requests.	2023-02-09 13:45:14 -08:00
Phil Porada	3f3962bef0	Remove leftover ACMEv1 combinations code (#6640 ) Clean up unused ACMEv1 "combinations" code. Fixes #6624	2023-02-08 12:36:35 -05:00
Aaron Gable	6dae612e81	ARI: Improve error message and add tooling (#6631 ) Give ARI improved error messages when no request path is specified and when parsing of the request path blob fails. Also, add a tool which can be used to quickly generate ARI requests and print their results, to make manual spot-checking easier. Fixes #6629	2023-02-08 08:22:22 -08:00
Samantha	d73125d8f6	WFE: Add custom balancer implementation which routes nonce redemption RPCs by prefix (#6618 ) Assign nonce prefixes for each nonce-service by taking the first eight characters of the the base64url encoded HMAC-SHA256 hash of the RPC listening address using a provided key. The provided key must be same across all boulder-wfe and nonce-service instances. - Add a custom `grpc-go` load balancer implementation (`nonce`) which can route nonce redemption RPC messages by matching the prefix to the derived prefix of the nonce-service instance which created it. - Modify the RPC client constructor to allow the operator to override the default load balancer implementation (`round_robin`). - Modify the `srv` RPC resolver to accept a comma separated list of targets to be resolved. - Remove unused nonce-service `-prefix` flag. Fixes #6404	2023-02-03 17:52:18 -05:00
Jacob Hoffman-Andrews	9d3f7d8f84	Add timeout config to WFE (#6621 )	2023-01-30 10:07:41 -08:00
Jacob Hoffman-Andrews	c23e59ba59	wfe2: don't pass through client-initiated cancellation (#6608 ) And clean up the code and tests that were used for cancellation pass-through. Fixes #6603	2023-01-26 17:26:15 -08:00
Phil Porada	7864b771fe	Remove "code" from RevokeCertByKeyRequest (#6605 ) Fixes https://github.com/letsencrypt/boulder/issues/5997	2023-01-24 15:26:35 -08:00
Phil Porada	26e5b24585	dependencies: Replace square/go-jose.v2 with go-jose/go-jose.v2 (#6598 ) Fixes #6573	2023-01-24 12:08:30 -05:00
Jacob Hoffman-Andrews	fd74d20934	wfe2: update unittest to use gRPC-style backend (#6533 ) Originally, WFEs had a built-in nonce service. Then we added a "remote nonce service" via gRPC, but we kept a fallback path for when the remote nonce service was not configured, to use a built-in nonce service. This PR removes that fallback path. Since the fallback path was relied on by the unittests, this also refactors the unittests to use a gRPC-style nonce service (but in-memory for the unittests). Fixes #6530	2022-12-05 11:36:31 -08:00
Aaron Gable	a7a2afef7a	ARI: Suggest immediate renewal for revoked certs (#6534 ) Update our implementation of ARI to return a renewal window entirely in the past (i.e., suggesting immediate renewal) if the certificate in question has been revoked for any reason. This will allow clients which implement ARI to discover that they need to replace their certificate without having to query OCSP directly, especially as we move into a future where OCSP is mostly supplanted by aggregated CRLs. Fixes #6503	2022-12-02 14:33:55 -08:00
Aaron Gable	ba34ac6b6e	Use read-only SA clients in wfe, ocsp, and crl (#6484 ) In the WFE, ocsp-responder, and crl-updater, switch from using StorageAuthorityClients to StorageAuthorityReadOnlyClients. This ensures that these services cannot call methods which write to our database. Fixes #6454	2022-12-02 13:48:28 -08:00
Jacob Hoffman-Andrews	e002607f92	lint: treat logEvent.AddError as a formatter (#6526 ) Also fix a couple of places that called fmt.Sprintf and err.Error() redundantly.	2022-11-29 14:47:15 -08:00
Jacob Hoffman-Andrews	ea564d6e36	wfe: pass through problem details (#6527 ) When a client cancels their HTTP request, that propagates into gRPC-level cancellation. But we don't want those canceled RPCs to show up as 500s, we want them to show up as 408s. We do that by producing a special ProblemDetails at the gRPC level. However, for that trick to work, we need to make sure errors from RPC methods get passed through web.ProblemDetailsForError. There were some places where we didn't do this and instead created a from-scratch ProblemDetails. This resulted in spurious 500s. My methodology was to look at every method call on each of the WFE's fields that represents a gRPC backend: `ra`, `sa`, `accountGetter`, `nonceService`, `remoteNonceServe`. If the error handling for that call did not use web.ProblemDetailsForError, I changed it to use that. Fixes #6524 In draft because still needs tests.	2022-11-29 10:00:54 -08:00
Aaron Gable	abf8d7c740	Remove unused RevokeCertificateWithReg (#6458 ) This gRPC method was deprecated a number of months ago. It has no callers, and is safe to remove. Cleanup from #5936	2022-10-21 16:56:13 -07:00
Aaron Gable	89f7fb1636	Clean up go1.19 TODOs (#6464 ) Clean up several spots where we were behaving differently on go1.18 and go1.19, now that we're using go1.19 everywhere. Also re-enable the lint and generate tests, and fix the various places where the two versions disagreed on how comments should be formatted. Also clean up the OldTLS codepaths, now that both go1.19 and our own feature flags have forbidden TLS < 1.2 everywhere. Fixes #6011	2022-10-21 15:54:18 -07:00
Samantha	bdd9ad9941	grpc: Pass data necessary for Retry-After headers in BoulderErrors (#6415 ) - Add a new field, `RetryAfter` to `BoulderError`s - Add logic to wrap/unwrap the value of the `RetryAfter` field to our gRPC error interceptor - Plumb `RetryAfter` for `DuplicateCertificateError` emitted by RA to the WFE client response header Part of #6256	2022-10-03 16:24:58 -07:00
Samantha	cba5813019	WFE: Implement ARI for certificates impacted by incidents (#6313 ) Suggest that subscribers with certificates impacted by an ongoing revocation incident renew immediately. - Make SA method `IncidentsForSerial` a callable RPC Resolves #6282	2022-08-31 11:53:12 -07:00
Aaron Gable	73b72e8fa2	ARI: Implement GET portion of draft-ietf-acme-ari-00 (#6322 ) Update our ACME Renewal Info implementation to parse the CertID-based request format specified in the current version of the draft specification. Part of #6033	2022-08-30 14:03:26 -07:00

1 2 3 4 5 ...

397 Commits