boulder

Commit Graph

Author	SHA1	Message	Date
Aaron Gable	8da52f74c3	Remove v1 `NewCertificate` code path from WFE and RA (#5842 ) The NewCertificate codepath was the ACME v1 API's equivalent of the modern Finalize endpoint. Remove the bodies of the WFE's and the RA's `NewCertificate` functions. Remove the functions which were called only from those functions. One of the removed functions is the old `checkAuthorizations`, so update some tests which were calling that directly to instead use different entry points. Part of #5681	2021-12-13 17:45:45 -08:00
Jacob Hoffman-Andrews	add5cfdb22	ra: check failed authorizations limit before attempting authz reuse (#5827 ) Part of #5826	2021-12-03 14:35:58 -08:00
Aaron Gable	99035226d8	Add name and key hashes to issuance.Certificate (#5812 ) These hashes are useful for OCSP computations, as they are the two values that are used to uniquely identify the issuer of the given cert in an OCSP request. Here, they are restricted to SHA1 only, as Boulder only supports SHA1 for OCSP, as per RFC 5019. In addition, because the `ID`, `NameID`, `NameHash`, and `KeyHash` are relatively expensive to compute, introduce a new constructor for `issuance.Certificate` that computes all four values at startup time and then simply returns the precomputed values when asked.	2021-12-01 12:11:10 -08:00
Aaron Gable	2a2629493d	Enable administrative revocation of malformed certs (#5813 ) Today, the revocation codepaths involve parsing the to-be-revoked certificate multiple times: inside `admin-revoker` itself, inside the RA's `AdministrativelyRevokeCertificate` method, and again in its helper `revokeCertificate`. In addition, we use the fact that we have the full certificate multiple times: to log various attributes of it, to compute its `IssuerNameID`, and more. All of this will fail if we ever issue a cert that is malformed to the point that it cannot be parsed. Add a new argument to the `AdministrativelyRevokeCertificateRequest` that allows the certificate to be identified by serial only, instead of by full certificate bytes. Add support for this in the gRPC handler by using the serial to construct a dummy in-memory Certificate object. Support this in the `revokeCertificate` codepath by checking to see if the passed-in cert has any underlying raw DER bytes, and if not, triggering the new codepath that does everything via the serial. In order to support this, unfortunately we have to add a second in-memory map to the RA, so that it can look up issuer certs by either name ID or old-style ID, as the IDs gleaned from the database (instead of from the cert itself) may still be old-style. This will be removed when the old-style Issuer IDs have aged out. Fixes #5759	2021-11-29 11:28:19 -08:00
Andrew Gabbitas	98d9a12ccd	Use authorization attemptedAt date for CAA recheck (#5746 ) When a valid authorization is stored in the database the authorization column attemptedAt is set based on the challenge `Validated` value. Use this value in `checkAuthorizationsCAA` to determine if an authorization is sufficiently stale to need a recheck of the CAA DNS record. Error if the time is nil. Keeps old codepath for safety check and increments a metric if the old codepath is used.	2021-11-04 14:50:11 -06:00
Aaron Gable	d7f143e979	Deprecate StreamlineOrderAndAuthzs flag (#5679 )	2021-10-11 14:34:03 -06:00
Aaron Gable	bab688b98f	Remove sa-wrappers.go (#5663 ) Remove the last of the gRPC wrapper files. In order to do so: - Remove the `core.StorageGetter` interface. Replace it with a new interface (whose methods include the `...grpc.CallOption` arg) inside the `sa/proto/` package. - Remove the `core.StorageAdder` interface. There's no real use-case for having a write-only interface. - Remove the `core.StorageAuthority` interface, as it is now redundant with the autogenerated `sapb.StorageAuthorityClient` interface. - Replace the `certificateStorage` interface (which appears in two different places) with a single unified interface also in `sa/proto/`. - Update all test mocks to include the `_ ...grpc.CallOption` arg in their method signatures so they match the gRPC client interface. - Delete many methods from mocks which are no longer necessary (mostly because they're mocking old authz1 methods that no longer exist). - Move the two `test/inmem/` wrappers into their own sub-packages to avoid an import cycle. - Simplify the `satest` package to satisfy one of its TODOs and to avoid an import cycle. - Add many methods to the `test/inmem/sa/` wrapper, to accommodate all of the methods which are called in unittests. Fixes #5600	2021-09-27 13:25:41 -07:00
Aaron Gable	6c85ae0f2c	expiration-mailer: improve search for renewals (#5673 ) Overhaul how expiration-mailer checks to see if a `certIsRenewed`. First, change the helper function to take the list of names (which can be hashed into an fqdnSet) and the issuance date. This allows the search for renewals to be a much simpler linear scan rather than an ugly outer left join. Second, update the query to examine both the `fqdnSets` and `fqdnSets_old` tables, to account for the fact that this code cares about more time (~90d) than the `fqdnSets` table currently holds. Also export the SA's `HashNames` method so it can be used by the mailer, and update the mailer's tests to use correct name hashes instead of fake human-readable hashes. Fixes #5672	2021-09-24 13:36:57 -07:00
Aaron Gable	4ef9fb1b4f	Add new SA.NewOrderAndAuthzs gRPC method (#5602 ) Add a new method to the SA's gRPC interface which takes both an Order and a list of new Authorizations to insert into the database, and adds both (as well as the various ancillary rows) inside a transaction. To enable this, add a new abstraction layer inside the `db/` package that facilitates inserting many rows at once, as we do for the `authz2`, `orderToAuthz2`, and `requestedNames` tables in this operation. Finally, add a new codepath to the RA (and a feature flag to control it) which uses this new SA method instead of separately calling the `NewAuthorization` method multiple times. Enable this feature flag in the config-next integration tests. This should reduce the failure rate of the new-order flow by reducing the number of database operations by coalescing multiple inserts into a single multi-row insert. It should also reduce the incidence of new authorizations being created in the database but then never exposed to the subscriber because of a failure later in the new-order flow, both by reducing failures overall and by adding those authorizations in a transaction which will be rolled back if there is a later failure. Fixes #5577	2021-09-03 13:48:04 -07:00
Samantha	d1d04c950e	GRCP: Replace `CountByNames_MapElement` with a real proto map (#5621 ) Fixes #5614	2021-09-03 13:12:52 -07:00
Andrew Gabbitas	818e01d3db	GRPC Unwrap: Make sa.NewOrder passthrough (#5615 ) * Make `sa.NewOrder` passthrough. * Create a new proto message `sapb.NewOrderRequest` that includes only the information needed to store a new order. Part of: #5533	2021-08-31 21:35:38 -06:00
Aaron Gable	1bf857ac09	Unwrap SA FQDNSet and PreviousCertificate existence methods (#5618 ) Fixes #5532	2021-08-31 09:22:16 -06:00
Samantha	5e8744c425	GRPC: Unwrap SA Count methods (#5616 ) - Make `CountRegistrationsByIP` a pass-through - Make `CountRegistrationsByIPRange` a pass-through - Make `CountOrders` a pass-through - Make `CountFQDNSets` a pass-through - Make `CountPendingAuthorizations2` a pass-through - Make `CountInvalidAuthorizations2` a pass-through Fixes #5535	2021-08-30 15:54:42 -07:00
Samantha	9d840f9b2f	GRPC: Unwrap sa.CountCertificatesByNames (#5612 ) Part of #5535	2021-08-30 15:02:44 -07:00
Samantha	279c759ca2	GRPC: Unwrap SA Authorization methods (#5589 ) - Make `GetAuthorization2` a pass-through - Make `GetAuthorizations2` a pass-through - Make `GetPendingAuthorization2` a pass-through - Make `GetValidOrderAuthorizations2` a pass-through - Make `GetValidAuthorizations2` a pass-through - Make `NewAuthorizations2` a pass-through - Make `FinalizeAuthorization2` a pass-through - Make `DeactivateAuthorization2` a pass-through Fixes #5534	2021-08-26 15:31:23 -07:00
Samantha	53b89707d5	GRPC: Unwrap ra.DeactivateAuthorization (#5567 ) - Move `DeactivateAuthorization` logic from `grpc` to `ra` and `wfe` - Update `ra` mocks in `wfe` tests - Remove unnecessary marshalling between `core.Authorization` and `corepb.Authorization` in `ra` tests. Fixes #5562	2021-08-12 11:30:57 -07:00
Aaron Gable	b7ce627572	Remove SA Registration gRPC wrappers (#5551 ) Remove all error checking and type transformation from the gRPC wrappers for the following methods on the SA: - GetRegistration - GetRegistrationByKey - NewRegistration - UpdateRegistration - DeactivateRegistration Update callers of these methods to construct the appropriate protobuf request messages directly, and to consume the protobuf response messages directly. In many cases, this requires changing the way that clients handle the `Jwk` field (from expecting a `JSONWebKey` to expecting a slice of bytes) and the `Contacts` field (from expecting a possibly-nil pointer to relying on the value of the `ContactsPresent` boolean field). Implement two new methods in `sa/model.go` to convert directly between database models and protobuf messages, rather than round-tripping through `core` objects in between. Delete the older methods that converted between database models and `core` objects, as they are no longer necessary. Update test mocks to have the correct signatures, and update tests to not rely on `JSONWebKey` and instead use byte slices. Fixes #5531	2021-08-04 13:33:41 -07:00
Samantha	2a5b9f651a	GRPC: Make ra.AdministrativelyRevokeCertificate a pass-through (#5558 ) - Move `AdministrativelyRevokeCertificate` logic from `grpc` to `ra` - Test new error conditions in `ra/ra_test.go` - Update `ra` mocks in `wfe` tests Fixes #5529	2021-08-02 13:52:00 -07:00
Andrew Gabbitas	f599da27cc	Make ra.NewAuthorization wrapper passthrough (#5553 ) Fixes: #5527	2021-08-02 13:09:59 -07:00
J.C. Jones	7b31bdb30a	Add read-only dbConns to SQLStorageAuthority and OCSPUpdater (#5555 ) This changeset adds a second DB connect string for the SA for use in read-only queries that are not themselves dependencies for read-write queries. In other words, this is attempting to only catch things like rate-limit `SELECT`s and other coarse-counting, so we can potentially move those read queries off the read-write primary database. It also adds a second DB connect string to the OCSP Updater. This is a little trickier, as the subsequent `UPDATE`s _are_ dependent on the output of the `SELECT`, but in this case it's operating on data batches, and a few seconds' replication latency are several orders of magnitude below the threshold for update frequency, so any certificates that aren't caught on run `n` can be caught on run `n+1`. Since we export DB metrics to Prometheus, this also refactors `InitDBMetrics` to take a DB Address (host:port tuple) and User out of the DB connection DSN and include those as labels in the metrics. Fixes #5550 Fixes #4985	2021-08-02 11:21:34 -07:00
Andrew Gabbitas	1681c365aa	Make ra.NewCertificate a passthrough (#5557 ) Fixes: #5528	2021-08-02 10:47:09 -07:00
Andrew Gabbitas	9133dba948	Make ra.DeactivateRegistration pass-through (#5522 ) Fixes: #5521	2021-07-13 11:40:09 -06:00
Aaron Gable	d405f9e616	Refactor lint library for go1.17 support (#5513 ) In go1.17, the `x509.CreateCertificate()` method fails if the provided Signer (private key) and Parent (cert with public key) do not match. This change both updates the lint library to create and use an issuer cert whose public key matches the throwaway private key used for lint signatures, and overhauls its public interface for readability and simplicity. Rename the `lint` library to `linter`, to allow other methods to be renamed to reduce word repetition. Reduce the linter library interface to three functions: `Check()`, `New()`, and `Linter.Check()` by making all helper functions private. Refactor the top-level `Check()` method to rely on `New()` and `Linter.Check()` behind the scenes. Finally, create a new helper method for creating a lint issuer certificate, call this new method from `New()`, and store the result in the `Linter` struct. Part of #5480	2021-07-09 10:29:10 -07:00
Andrew Gabbitas	74909367cd	RA: Make NewOrder wrapper passthrough (#5486 ) Fixes: #5436	2021-06-16 12:41:05 -06:00
Samantha	af9f1b250d	RA: Make PerformValidation wrapper a passthrough (#5478 ) - Move response validation from `RA` client wrapper to `WFE` and `WFE2` - Move request validation from `RA` server wrapper to `RA` - Refactor `RA` tests to construct valid `core.Authorization` objects - Consolidate multiple error declarations to global `errIncompleteGRPCRequest` Fixes #5439	2021-06-15 11:33:40 -07:00
Aaron Gable	8be32d3312	Use google.protobuf.Empty instead of core.Empty (#5454 ) Replace `core.Empty` with `google.protobuf.Empty` in all of our gRPC methods which consume or return an empty protobuf. The golang core proto libraries provide an empty message type, so there is no need for us to reinvent the wheel. This change is backwards-compatible and does not require a special deploy. The protobuf message descriptions of `core.Empty` and `google.protobuf.Empty` are identical, so their wire-formats are indistinguishable and therefore interoperable / cross-compatible. Fixes #5443	2021-06-03 14:17:41 -07:00
Andrew Gabbitas	5b235bd8eb	Change ra.UpdateRegistration sig to match grpc (#5449 ) Change ra.UpdateRegistration sig to match grpc Fixes: #5403	2021-06-01 11:55:33 -06:00
Aaron Gable	7455a8a32d	Make RevokeCertificateWithReg wrappers passthroughs (#5445 ) Update the signature of the RA's RevokeCertificateWithReg method to exactly match that of the gRPC method it implements. Remove all logic from the `RevokeCertificateWithReg` client and server wrappers. Move the small amount of checking they were performing directly into the server implementation. Fixes #5440	2021-06-01 08:42:32 -07:00
Andrew Gabbitas	6b45dce5f1	Make ra.UpdateRegistration wrapper pass-through (#5431 ) Part of: #5403 Fixes: #5398	2021-05-28 15:18:54 -06:00
Aaron Gable	40f9e38088	Add lower, faster duplicate certificate rate limit (#5401 ) Add a new rate limit, identical in implementation to the current `CertificatesPerFQDNSet` limit, intended to always have both a lower window and a lower threshold. This allows us to block runaway clients quickly, and give their owners the ability to fix and try again quickly (on the order of hours instead of days). Configure the integration tests to set this new limit at 2 certs per 2 hours. Also increase the existing limit from 5 to 6 certs in 7 days, to allow clients to hit the first limit three times before being fully blocked for the week. Also add a new integration test to verify this behavior. Note that the new ratelimit must have a window greater than the configured certificate backdate (currently 1 hour) in order to be useful. Fixes #5210	2021-05-17 14:50:29 -07:00
Andrew Gabbitas	5457680a9c	ra.NewRegistration: error ContactsPresent mismatch (#5399 ) Generate error if ra.NewRegistration receives RPC with ContactsPresent: false and non-empty Contacts list Fixes #5396	2021-04-23 18:05:32 -06:00
Andrew Gabbitas	5fdacbeaa6	grpc wrapper removal: Turn ra.NewRegistration into passthrough (#5397 ) Turn ra.NewRegistration into passthrough Fixes #5343	2021-04-23 13:53:22 -06:00
Aaron Gable	ef1d3c4cde	Standardize on `AssertMetricWithLabelsEquals` (#5371 ) Update all of our tests to use `AssertMetricWithLabelsEquals` instead of combinations of the older `CountFoo` helpers with simple asserts. This coalesces all of our prometheus inspection logic into a single function, allowing the deletion of four separate helper functions.	2021-04-01 15:20:43 -07:00
Aaron Gable	1f776ba768	Remove publisher gRPC wrapper (#5327 ) Delete the PublisherClientWrapper and PublisherServerWrapper. Update various structs and functions to expect a pubpb.PublisherClient instead of a core.Publisher; these two interfaces differ only in that the auto-generated PublisherClient takes a variadic CallOptions parameter. Update all mock publishers in tests to match the new interface. Finally, delete the now-unused core.Publisher interface and some already-unused mock-generating code. This deletes a single sanity check (for a nil SCT even when there is a nil error), but that check was redundant with an identical check in the only extant client code in ctpolicy.go. Fixes #5323	2021-03-11 10:50:29 -08:00
Aaron Gable	993953bf81	Remove ca gRPC wrapper (#5330 ) Delete the CertificateAuthorityClientWrapper, OCSPGeneratorClientWrapper, and CertificateAuthorityServerWrapper structs, which provided no error checking above and beyond their wrapped types. Replace them with the corresponding auto-generated gRPC types in calling code. Update some mocks to have the necessary variadic grpc.CallOption parameter. Finally, delete the now-unused core.CertificateAuthority interface. Fixes #5324	2021-03-11 10:45:46 -08:00
Andrew Gabbitas	f5362fba24	Add Validated time field to challenges (#5288 ) Move the validated timestamp to the RA where the challenge is passed to the SA for database storage. If a challenge becomes valid or invalid, take the validated timestamp and store it in the attemptedAt field of the authz2 table. Upon retrieval of the challenge from the database, add the attemptedAt value to challenge.Validated which is passed back to the WFE and presented to the user as part of the challenge as required in ACME RFC8555. Fix: #5198	2021-03-10 14:39:59 -08:00
Samantha	e0510056cc	Enhancements to SQL driver tuning via JSON config (#5235 ) Historically the only database/sql driver setting exposed via JSON config was maxDBConns. This change adds support for maxIdleConns, connMaxLifetime, connMaxIdleTime, and renames maxDBConns to maxOpenConns. The addition of these settings will give our SRE team a convenient method for tuning the reuse/closure of database connections. A new struct, DBSettings, has been added to SA. The struct, and each of it's fields has been commented. All new fields have been plumbed through to the relevant Boulder components and exported as Prometheus metrics. Tests have been added/modified to ensure that the fields are being set. There should be no loss in coverage Deployability concerns for the migration from maxDBConns to maxOpenConns have been addressed with the temporary addition of the helper method cmd.DBConfig.GetMaxOpenConns(). This method can be removed once test/config is defaulted to using maxOpenConns. Relevant sections of the code have TODOs added that link back to an newly opened issue. Fixes #5199	2021-01-25 15:34:55 -08:00
Aaron Gable	400bf3a02a	Allow WFEv1 to specify which issuer to use (#5222 ) We intend to delete the v1 API (i.e. `wfe` and its associated codepaths) in the near future, and as such are not giving it new features or capabilities. However, before then we intend to allow the v2 API to provide issuance both from our RSA and from our ECDSA intermediates. The v1 API cannot gain such capability at the same time. The CA doesn't know which frontend originated any given issuance request, so we can't simply gate the single- or double-issuer behavior based on that. Instead, this change introduces the ability for the WFE (and the RA, which sits between the WFE and the CA) to request issuance from a specific intermediate. If the specified intermediate is not available in the CA, issuance will fail. If no intermediate is specified (as is the case in requests coming from wfe2), it falls back to selecting the issuer based on the algorithm of the public key to be signed. Fixes #5216	2021-01-20 09:22:03 -08:00
Aaron Gable	cc66969bec	Remove nearly-unused WrongAuthorizationState error (#5176 ) This error class is only used in one instance, and when returned to the user it is transformed into a `probs.Malformed` anyway. It does more harm than good to keep this one-off BoulderError around, as it introduces confusion about what sorts of errors we expose to the client. Fixes #5167	2020-11-12 15:38:16 -08:00
Aaron Gable	16c7a21a57	RA: Multi-issuer support for OCSP purging (#5160 ) The RA is responsible for contacting Akamai to purge cached OCSP responses when a certificate is revoked and fresh OCSP responses need to be served ASAP. In order to do so, it needs to construct the same OCSP URLs that clients would construct, and that Akamai would cache. In order to do that, it needs access to the issuing certificate to compute a hash across its Subject Info and Public Key. Currently, the RA holds a single issuer certificate in memory, and uses that cert to compute all OCSP URLs, on the assumption that all certs we're being asked to revoke were issued by the same issuer. In order to support issuance from multiple intermediates at the same time (e.g. RSA and ECDSA), and to support rollover between different issuers of the same type (we may need to revoke certs issued by two different issuers for the 90 days in which their end-entity certs overlap), this commit changes the configuration to provide a list of issuer certificates instead. In order to support efficient lookup of issuer certs, this change also introduces a new concept, the Chain ID. The Chain ID is a truncated hash across the raw bytes of either the Issuer Info or the Subject Info of a given cert. As such, it can be used to confirm issuer/subject relationships between certificates. In the future, this may be a replacement for our current IssuerID (a truncated hash over the whole issuer certificate), but for now it is used to map revoked certs to their issuers inside the RA. Part of #5120	2020-11-06 13:58:32 -08:00
Aaron Gable	294d1c31d7	Use error wrapping for berrors and tests (#5169 ) This change adds two new test assertion helpers, `AssertErrorIs` and `AssertErrorWraps`. The former is a wrapper around `errors.Is`, and asserts that the error's wrapping chain contains a specific (i.e. singleton) error. The latter is a wrapper around `errors.As`, and asserts that the error's wrapping chain contains any error which is of the given type; it also has the same unwrapping side effect as `errors.As`, which can be useful for further assertions about the contents of the error. It also makes two small changes to our `berrors` package, namely making `berrors.ErrorType` itself an error rather than just an int, and giving `berrors.BoulderError` an `Unwrap()` method which exposes that inner `ErrorType`. This allows us to use the two new helpers above to make assertions about berrors, rather than having to hand-roll equality assertions about their types. Finally, it takes advantage of the two changes above to greatly simplify many of the assertions in our tests, removing conditional checks and replacing them with simple assertions.	2020-11-06 13:17:11 -08:00
Samantha	63eafa4a3b	ra: replacing error assertions with errors.As (#5140 ) Part of #5010	2020-10-26 15:37:00 -07:00
Jacob Hoffman-Andrews	cfe943fea5	Factor out idForIssuer. (#5102 ) This was already part done: There is an ID() method in issuance. This change extends that by: - Defining a type alias indicating something is an IssuerID. - Defining issuance.Certificate, which also has an ID() method, so that components that aren't the CA can load certificates and use the type system to mark them as issuers (and get their IDs). - Converting akamai-purger and ca to use the new types. - Removing idForIssuer from ca.go.	2020-10-06 15:04:43 -07:00
Jacob Hoffman-Andrews	bf7c80792d	core: move to proto3 (#5063 ) Builds on #5062 Part of #5050	2020-08-31 17:58:32 -07:00
Jacob Hoffman-Andrews	1d31d60450	test: improve quality of some mocks (#5054 ) As part of #5050, I'm updating some of the code in grpc/pb-marshaling.go to move from nil checks to zero checks. In the process I'm introducing some new zero checks, on things like challenge type, status, and token. This is shaking out some places where our mocks have taken shortcuts by not creating a "full" object including all fields that are normally present. This PR updates our mocks and tests to provide more realistic objects in all the places that broke when introducing those zero checks.	2020-08-27 09:42:22 -07:00
Jacob Hoffman-Andrews	8dd386b6bc	SA: Update RPC interface to proto3 (#5043 ) One slightly surprising / interesting thing: Since core types like Order and Registration are still proto2 and have pointer fields, there are actually some places in this PR where I had to add a `*` rather than delete an `&`, because I was taking a pointer field from one of those core types and passing it as a field in an SA RPC request. Fixes #5037.	2020-08-25 10:28:41 -07:00
Aaron Gable	4d72f1f60e	RA: Update RPC interface to proto3 (#5039 ) Updates the Registration Authority to use proto3 for its RPC methods. This turns out to be a fairly minimal change, as many of the RA's request and response messages are defined in core.proto, and are therefore still proto2. Fixes #4955	2020-08-24 13:00:41 -07:00
Aaron Gable	8556d8a801	Update VA RPCs to proto3 (#5005 ) This updates va.proto to use proto3 syntax, and updates all clients of the autogenerated code to use the new types. In particular, it removes indirection from built-in types (proto3 uses ints, rather than pointers to ints, for example). Depends on #5003 Fixes #4956	2020-08-17 15:20:51 -07:00
Aaron Gable	32d56ae1e6	Make ra.PerformValidation resilient to va failure (#5028 ) ra.PerformValidation's goroutine surfaces errors not by returning them, but by accumulating them into the `prob`variable and saving them to the database. This makes it possible for processing to continue even in error cases when it should (mostly) halt. This change fixes a bug where we would try to access a member of the result returned from va.PerformValidation, even if that function call had returned an error.	2020-08-17 12:29:33 -07:00
Aaron Gable	e2c8f6743a	Introduce new core.AcmeChallenge type (#5012 ) ACME Challenges are well-known strings ("http-01", "dns-01", and "tlsalpn-01") identifying which kind of challenge should be used to verify control of a domain. Because they are well-known and only certain values are valid, it is better to represent them as something more akin to an enum than as bare strings. This also improves our ability to ensure that an AcmeChallenge is not accidentally used as some other kind of string in a different context. This change also brings them closer in line with the existing core.AcmeResource and core.OCSPStatus string enums. Fixes #5009	2020-08-11 15:02:16 -07:00
Aaron Gable	0f5d2064a8	Remove logic from VA PerformValidation wrapper (#5003 ) Updates the type of the ValidationAuthority's PerformValidation method to be identical to that of the corresponding auto-generated grpc method, i.e. directly taking and returning proto message types, rather than exploded arguments. This allows all logic to be removed from the VA wrappers, which will allow them to be fully removed after the migration to proto3. Also updates all tests and VA clients to adopt the new interface. Depends on #4983 (do not review first four commits) Part of #4956	2020-08-06 10:45:35 -07:00
Roland Bracewell Shoemaker	7853b12cb3	Remove support for issuing certificates with no CN (#5008 ) We'd like to issue certs with no CN eventually, but it's not going to happen any time soon. In the mean time, the existing code never gets exercised and is rather complex, so this removes it.	2020-08-05 09:15:30 -07:00
Aaron Gable	46d7ed0a29	Anchor all referenced loop variables (#4991 )	2020-07-29 12:57:30 -07:00
Aaron Gable	ffdae2d338	Return proto from ca.IssueCertificateFromPrecertificate (#4982 ) This is the only method on the ca which uses a non-proto type as its request or response value. Changing this to use a proto removes the last logic from the wrappers, allowing them to be removed in a future CL. It also makes the interface more uniform and easier to reason about. Issue: #4940	2020-07-23 18:39:10 -07:00
Aaron Gable	3a03e86e89	Standardize all proto import names (#4970 ) We previously used mixed case names for proto imports (e.g. both `caPB` and `rapb`), sometimes in the same file. This change standardizes on the all-lowercase spelling, which was predominant throughout the codebase.	2020-07-20 16:29:17 -07:00
Aaron Gable	7e626b63a6	Temporarily revert CA and VA proto3 migrations (#4962 )	2020-07-16 14:29:42 -07:00
Aaron Gable	281575433b	Switch VA RPCs to proto3 (#4960 ) This updates va.proto to use proto3 syntax, and updates all clients of the autogenerated code to use the new types. In particular, it removes indirection from built-in types (proto3 uses ints, rather than pointers to ints, for example). Fixes #4956	2020-07-16 09:16:23 -07:00
Phil Porada	2afb087183	Add revocation reason metric to RA (#4957 )	2020-07-15 13:30:15 -07:00
Aaron Gable	91d4e235ad	Deprecate the BlockedKeyTable feature flag (#4881 ) This commit consists of three classes of changes: 1) Changing various command main.go files to always behave as they would have when features.BlockedKeyTable was true. Also changing one test in the same manner. 2) Removing the BlockedKeyTable flag from configuration in config-next, because the flag is already live. 3) Moving the BlockedKeyTable flag to the "deprecated" section of features.go, and regenerating featureflag_strings.go. A future change will remove the BlockedKeyTable flag (and other similarly deprecated flags) from features.go entirely. Fixes #4873	2020-06-22 16:35:37 -07:00
Jacob Hoffman-Andrews	6f4966cc0f	Check email address validity in notify-mailer. (#4841 ) This required a refactoring: Move validateEmail from the RA to ValidEmail in the `policy` package. I also moved `ValidDomain` from a method on PolicyAuthority to a standalone function so that ValidEmail can call it. notify-mailer will now log invalid addresses and skip them without attempting to send mail. Since @example.com addresses are invalid, I updated the notify-mailer test, which used a lot of such addresses. Also, now when notify-mailer receives an unrecoverable error sending mail, it logs the email address and what offset within the list it was.	2020-06-04 18:28:02 -07:00
Roland Bracewell Shoemaker	97390560a3	Handful of revocation pkg cleanups (#4801 ) When we originally added this package (4 years ago) x/crypto/ocsp didn't have its own list of revocation reasons, so we added our own. Now it does have its own list, so just use that list instead of duplicating code for no real reason. Also we build a list of the revocation reasons we support so that we can tell users when they try to use an unsupported one. Instead of building this string every time, just build it once it during package initialization. Finally return the same error message in wfe that we use in wfe2 when a user requests an unsupported reason.	2020-04-30 17:29:42 -07:00
Roland Bracewell Shoemaker	9df97cbf06	Add a blocked keys table, and use it (#4773 ) Fixes #4712 and fixes #4711.	2020-04-15 13:42:51 -07:00
Jacob Hoffman-Andrews	36c1f1ab2d	Deprecate some feature flags (#4771 ) Deprecate some feature flags. These are all enabled in production.	2020-04-13 15:49:55 -07:00
Jacob Hoffman-Andrews	72deb5b798	gofmt code with -s (simplify) flag (#4763 ) Found by golangci-lint's `gofmt` linter.	2020-04-08 17:25:35 -07:00
Jacob Hoffman-Andrews	3a1a08a10b	Remove unused code. (#4722 ) Found by staticcheck.	2020-03-27 11:55:42 -07:00
alexzorin	0dd8f41c1d	ra: forbid mailto contacts that contain hfields (#4694 ) https://tools.ietf.org/html/rfc8555#section-7.3 Clients MUST NOT provide a "mailto" URL in the "contact" field that contains "hfields" [RFC6068] or more than one "addr-spec" in the "to" component. If a server encounters a "mailto" contact URL that does not meet these criteria, then it SHOULD reject it as invalid.	2020-03-11 17:15:23 -07:00
Daniel McCarney	f1894f8d1d	tidy: typo fixes flagged by codespell (#4634 )	2020-01-07 14:01:26 -05:00
Daniel McCarney	97bd7c53dc	RA: Don't extend valid authorization time by pending expiry. (#4619 ) The RA should set the expiry of valid authorizations based only on the current time and the configured authorizationLifetime. It should not extend the pending authorization's lifetime by the authorizationLifetime. Resolves #4617 I didn't gate this with a feature flag. If we think this needs an API announcement and gradual rollout (I don't personally think this change deserves that) then I think we should change the RA config's authorizationLifetimeDays value to 37 days instead of adding a feature flag that we'll have to clean up after the flag date. We can change it back to 30 after the flag date.	2019-12-19 10:11:23 -08:00
Roland Bracewell Shoemaker	5b2f11e07e	Switch away from old style statsd metrics wrappers (#4606 ) In a handful of places I've nuked old stats which are not used in any alerts or dashboards as they either duplicate other stats or don't provide much insight/have never actually been used. If we feel like we need them again in the future it's trivial to add them back. There aren't many dashboards that rely on old statsd style metrics, but a few will need to be updated when this change is deployed. There are also a few cases where prometheus labels have been changed from camel to snake case, dashboards that use these will also need to be updated. As far as I can tell no alerts are impacted by this change. Fixes #4591.	2019-12-18 11:08:25 -05:00
Daniel McCarney	6ed62cf746	RA: reject Contacts that marshal too long for DB. (#4575 ) In the deep dark history of Boulder we ended up jamming contacts into a VARCHAR db field. We need to make sure that when contacts are marshaled the resulting bytes will fit into the column or a 500 will be returned to the user when the SA RPC fails. One day we should fix this properly and not return a hacky error message that's hard for users to understand. Unfortunately that will likely require a migration or a new DB table. In the shorter term this hack will prevent 500s which is a clear improvement.	2019-11-22 15:13:53 -05:00
Daniel McCarney	fde145ab96	RA: implement stricter email validation. (#4574 ) Prev. we weren't checking the domain portion of an email contact address very strictly in the RA. This updates the PA to export a function that can be used to validate the domain the same way we validate domain portions of DNS type identifiers for issuance. This also changes the RA to use the `invalidEmail` error type in more places. A new Go integration test is added that checks these errors end-to-end for both account creation and account update.	2019-11-22 13:39:31 -05:00
Daniel McCarney	a86ed0f753	RA: fix error returned through WFE2 for too big NewOrders. (#4572 ) We need the RA's `NewOrder` RPC to return a `berrors.Malformed` instance when there are too many identifiers. A bare error will be turned into a server internal problem by the WFE2's `web.ProblemDetailsForError` call while a `berrors.Malformed` will produce the expected malformed problem. This commit fixes the err, updates the unit test, and adds an end-to-end integration test so we don't mess this up again.	2019-11-21 13:54:49 -05:00
Roland Bracewell Shoemaker	e402156c1c	Revert "Revert "Remove remaining old format authorization code from SA/… (#4502 )" (#4524 ) This reverts commit `dc2ce4ca84`.	2019-11-04 09:45:19 -05:00
stilez	2461c9fc5b	PA: make user-facing error messages more intuitive (#4513 ) LE is popular and aims to popularise certificate issuance. End users who see error messages cannot be assumed to be as DNS-experienced as previously. The user-facing error messages in the policy authority file are terse and unobvious to the point that they are often unlikely to be well understood by those they are intended to inform, who may be "just trying to get a LE cert for their domain".	2019-11-01 10:25:55 -04:00
Jacob Hoffman-Andrews	329e4154cd	Deprecate EarlyOrderRateLimit and FasterGetOrder (#4497 ) These feature flags are already turned on in production.	2019-10-24 10:47:29 -07:00
Roland Bracewell Shoemaker	dc2ce4ca84	Revert "Remove remaining old format authorization code from SA/… (#4502 ) We need to apply some fixes for bugs introduced in #4476 before it can be deployed, as such we need to revert #4495 as there needs to be a full deploy cycle between these two changes. This reverts commit `3ae1ae1`. 😭	2019-10-23 10:45:29 -07:00
Roland Bracewell Shoemaker	3ae1ae1493	Remove remaining old format authorization code from SA/protos (#4495 )	2019-10-23 09:08:38 -04:00
Roland Bracewell Shoemaker	46e0468220	Make authz2 the default storage format (#4476 ) This change set makes the authz2 storage format the default format. It removes most of the functionality related to the previous storage format, except for the SA fallbacks and old gRPC methods which have been left for a follow-up change in order to make these changes deployable without introducing incompatibilities. Fixes #4454.	2019-10-21 15:29:15 -04:00
Roland Bracewell Shoemaker	fba6104d8e	RA: Check the number of names in NewOrder (#4444 )	2019-09-23 09:31:07 -04:00
Roland Bracewell Shoemaker	9df9c21ddc	Use sub-problems for the certificates per name rate limit (#4416 ) Fixes #4360.	2019-09-09 09:20:05 -07:00
Roland Bracewell Shoemaker	a585f23365	Add feature flag for disabling new domain validations in the V1… (#4385 ) Fixes #4307.	2019-08-05 11:34:51 -07:00
Jacob Hoffman-Andrews	4628c79239	Check invalid authorization limit in parallel. (#4348 ) Fixes #3069.	2019-07-19 13:37:12 -07:00
Roland Bracewell Shoemaker	0d9b48e280	PA: restructure error for single bad name in multi-name req (#4319 )	2019-07-03 13:47:31 -04:00
Jacob Hoffman-Andrews	9094862051	ra/sa: clean up CountCertificatesExact. (#4309 ) In #4179 we added a different method of counting the certificatesPerName rate limit that can provide the correct behavior for exact public suffix matches without the need for a separate RPC call. This cleans up the separate code paths in the SA and RA that are no longer necesary.	2019-06-28 12:57:14 -04:00
Daniel McCarney	2d1a0d8e48	ra/pa: fix suberrors for single error case. (#4305 ) If there is only one overall error then there is no reason to include it as a sub-error, just return a top level error without any sub-errors.	2019-06-27 13:22:38 -04:00
Daniel McCarney	daf311f41b	RA: fix suberror identifier bug in CAA recheck. (#4259 ) In the RA's recheckCAA function we loop through a list of core.Authorizations, dispatching each to a Go routine that checks CAA for the authz and writes an error to a results channel. Later, we iterate the same core.Authorization list and read errors from the channel. If we get a non-nil error, then the current iteration's *core.Authorization is used as the identifier for the suberror created with the non-nil error. This is a flawed approach and relies on the scheduling of recheck goroutines matching the iteration of the authorizations. When the goroutines write error results to the channel in an order that doesn't match the loop over the authorizations the RA will construct a suberror with the wrong identifier. This manifests as making the TestRecheckCAAFail unit test appear flaky, because it specifically checks the expected identifiers in the returned subproblems. The fix involves writing both the checked authorization and the error result to the results channel. Later instead of iterating the authorizations we just read the correct number of results from the channel and use the attached authorization from the result when constructing a suberror. Resolves #4248 Take away lessons: Write unit tests and always verify expected values! Always investigate flaky unit tests! Sometimes there's a real bug and not just a subpar test :-)	2019-06-12 09:13:47 -07:00
Daniel McCarney	caf655353e	RA: Use suberrors when rechecking CAA. (#4240 ) When Boulder's RA rechecks CAA for a set of authorization identifiers it should use suberrors to make it easy to identify which of a possible 100 identifiers had a CAA issue at order finalization time. Updates #4193 Resolves #4235	2019-05-31 15:36:47 -07:00
Daniel McCarney	7dd176e9a4	Implement suberrors for policy blocked names. (#4234 ) When validating a CSR's identifiers, or a new order's identifiers there may be more than one identifier that is blocked by policy. We should return an error that has suberrors identifying each bad identifier individually in this case. Updates https://github.com/letsencrypt/boulder/issues/4193 Resolves https://github.com/letsencrypt/boulder/issues/3727	2019-05-31 15:00:17 -07:00
Roland Bracewell Shoemaker	6f93942a04	Consistently used stdlib context package (#4229 )	2019-05-28 14:36:16 -04:00
Roland Bracewell Shoemaker	824d0c4ab0	Include email address in parsing error (#4231 ) * Include email address in parsing error * limit address length in error to 254 chars	2019-05-28 14:09:52 -04:00
Daniel McCarney	ea9871de1e	core: split identifier types into separate package. (#4225 ) This will allow implementing sub-problems without creating a cyclic dependency between `core` and `problems`. The `identifier` package is somewhat small/single-purpose and in the future we may want to move more "ACME" bits beyond the `identifier` types into a dedicated package outside of `core`.	2019-05-23 13:24:41 -07:00
Jacob Hoffman-Andrews	0d2e41fdc8	Bring back errcheck, accidentally removed from tests. (#4212 )	2019-05-14 17:07:03 -04:00
Daniel McCarney	443c949180	tidy: cleanup JSON hostname policy support. (#4214 ) We transitioned this data to YAML to have support for comments and can remove the legacy JSON support/test data.	2019-05-14 17:06:36 -04:00
Roland Bracewell Shoemaker	6e06f36309	Use new SA authorization methods in RA (#4184 ) Fixes #4177.	2019-05-13 12:40:13 -04:00
Jacob Hoffman-Andrews	0759d2d440	cmd: Split out config structs (#4200 ) This follows up on some refactoring we had done previously but not completed. This removes various binary-specific config structs from the common cmd package, and moves them into their appropriate packages. In the case of CT configs, they had to be moved into their own package to avoid a dependency loop between RA and ctpolicy.	2019-05-06 11:11:08 -04:00
Jacob Hoffman-Andrews	825277f62e	tests: Remove global SupportedChallenges in ra_test.go (#4198 ) In general the RA test has an anti-pattern of too much global state. This piece in particular got a little outdated over time, and there were some incorrect comments littered around the code about what SupportedChallenges did or did not contain. This change removes most cases where it was overridden, and moves the main definition into initAuthorities.	2019-05-06 09:18:35 -04:00
Jacob Hoffman-Andrews	2d59d7ee8e	Remove an old unneeded test case in RA. (#4197 ) We introduced a test case called "TestUpdateAuthorizationNewRPC" when we added the "PerformValidation" RPC at the SA. It mostly duplicated the TestUpdateAuthorization test case that already existed. They evolved alongside over the years, but we should have cleaned up the "NewRPC" variant and removed the old one it was copied from a long time ago.	2019-05-03 15:26:47 -07:00
Jacob Hoffman-Andrews	935df44851	Move "Combinations" support to WFE1. (#4155 ) Early ACME drafts supported a notion of "combinations" of challenges that had to be completed together. This was removed from subsequent drafts. Boulder has only ever supported "combinations" that exactly map to the list of challenges, 1 for 1. This removes all the plumbing for combinations, and adds a list of combinations to the authz JSON right before marshaling it in WFE1.	2019-04-16 11:31:51 -07:00
Jacob Hoffman-Andrews	ff3129247d	Put features.Reset in unitest setup functions. (#4129 ) Previously we relied on each instance of `features.Set` to have a corresponding `defer features.Reset()`. If we forget that, we can wind up with unexpected behavior where features set in one test case leak into another test case. This led to the bug in https://github.com/letsencrypt/boulder/issues/4118 going undetected. Fix #4120	2019-04-02 10:14:38 -07:00
Jacob Hoffman-Andrews	677b9b88ad	Remove GSB support. (#4115 ) This is no longer enabled in prod; cleaning up the code. https://community.letsencrypt.org/t/let-s-encrypt-no-longer-checking-google-safe-browsing/82168	2019-03-15 10:24:44 -07:00
Jacob Hoffman-Andrews	d1e6d0f190	Remove TLS-SNI-01 (#4114 ) * Remove the challenge whitelist * Reduce the signature for ChallengesFor and ChallengeTypeEnabled * Some unit tests in the VA were changed from testing TLS-SNI to testing the same behavior in TLS-ALPN, when that behavior wasn't already tested. For instance timeouts during connect are now tested. Fixes #4109	2019-03-15 09:05:24 -04:00
Daniel McCarney	204ff0446f	RA: forbid finalizing pending orders. (#4108 ) This change was made on the WFE2 side but I missed making it in the RA.	2019-03-12 15:09:35 -04:00
Jacob Hoffman-Andrews	0bd407bd59	RA: Remove issuanceExpvar. (#4091 ) We've replaced this functionality with Prometheus variables, and there are no longer alerts that need it.	2019-02-28 08:15:31 -05:00
Daniel McCarney	16e464a37d	RA: apply certificate rate limits at NewOrder time. (#4074 ) If an order for a given set of names will fail finalization because of certificate rate limits (certs per domain, certs per fqdn set) there isn't any point in allowing an order for those names to be created. We can stop a lot of requests earlier by enforcing the cert rate limits at new order time as well as finalization time. A new RA `EarlyOrderRateLimit` feature flag controls whether this is done or not. Resolves #3975	2019-02-21 11:02:40 -08:00
Roland Bracewell Shoemaker	3e54cea295	Implement direct revocation at RA (#4043 ) Implements a feature that enables immediate revocation instead of marking a certificate revoked and waiting for the OCSP-Updater to generate the OCSP response. This means that as soon as the request returns from the WFE the revoked OCSP response should be available to the user. This feature requires that the RA be configured to use the standalone Akamai purger service. Fixes #4031.	2019-02-14 14:47:42 -05:00
Roland Bracewell Shoemaker	232a5f828f	Fix ineffectual assignments (#4052 ) * in boulder-ra we connected to the publisher and created a publisher gRPC client twice for no apparent reason * in the SA we ignored errors from `getChallenges` in `GetAuthorizations` which could result in a nil challenge being returned in an authorization	2019-02-13 15:39:58 -05:00
Roland Bracewell Shoemaker	3129c57bb8	Require email domains end in a IANA suffix (#4037 )	2019-01-28 17:05:58 -08:00
Daniel McCarney	b4c13ce6c7	RA: fix valid authz reuse control for V2 newOrder. (#4027 ) Before fixing `ra.NewOrder` the `TestNewOrderAuthzReuseDisabled` test from this branch fails as expected based on #4026: ``` === RUN TestNewOrderAuthzReuseDisabled --- FAIL: TestNewOrderAuthzReuseDisabled (0.24s) ra_test.go:2477: "reused-valid-authz" == "reused-valid-authz" FAIL FAIL github.com/letsencrypt/boulder/ra 0.270s ``` Afterwards it passes: ``` === RUN TestNewOrderAuthzReuseDisabled --- PASS: TestNewOrderAuthzReuseDisabled (0.26s) PASS ok github.com/letsencrypt/boulder/ra 1.291s ```	2019-01-23 13:00:44 -05:00
Roland Bracewell Shoemaker	93ac7fbe9e	Modify authorization creation to allow for new style storage schema (#3998 ) Adds a feature which gates creation of authorizations following the style required for the new schema (and which can be used for gating the reset of our new schema code later down the road). There was an internal discussion about an issue this creates regarding a predictable ordering of challenges within a challenge due to sequential challenge IDs which will always be static for each challenge type. It was suggested we could add some kind of obfuscation to the challenge ID when presented to the user to prevent this. This hasn't been done in this PR as it would only be focused in the WFE and would be better suited as its own changeset. Fixes #3981.	2019-01-17 17:09:38 -08:00
Daniel McCarney	8f5de538c1	RA: Add PerformValidation RPC to replace UpdateAuthorization. (#3942 ) The existing RA `UpdateAuthorization` RPC needs replacing for two reasons: 1. The name isn't accurate - `PerformValidation` better captures the purpose of the RPC. 2. The `core.Challenge` argument is superfluous since Key Authorizations are not sent in the initiation POST from the client anymore. The corresponding unmarshal and verification is now removed. Notably this means broken clients that were POSTing the wrong thing and failing pre-validation will now likely fail post-validation. To remove `UpdateAuthorization` the new `PerformValidation` RPC is added alongside the old one. WFE and WFE2 are updated to use the new RPC when the perform validation feature flag is enabled. We can remove `UpdateAuthorization` and its associated wrappers once all WFE instances have been updated. Resolves https://github.com/letsencrypt/boulder/issues/3930	2018-11-28 10:12:47 -05:00
Roland Bracewell Shoemaker	465be64f3f	Remove unnecessary usage of UpdatePendingAuthorization in RA (#3927 ) Removes superfluous usage of `UpdatePendingAuthorization` in the RA to update the key authorization and test if the authorization is pending and instead uses the result of the initial `GetAuthorization` call in the WFE. Fixes #3923.	2018-11-12 17:23:56 -08:00
Roland Bracewell Shoemaker	a9a0846ee9	Remove checks for deployed features (#3881 ) Removes the checks for a handful of deployed feature flags in preparation for removing the flags entirely. Also moves all of the currently deprecated flags to a separate section of the flags list so they can be more easily removed once purged from production configs. Fixes #3880.	2018-10-17 20:29:18 -07:00
Daniel McCarney	16a846b721	RA: Use t.Helper in assertAuthzEqual (#3864 ) This ensures the assertAuthzEqual helper doesn't obscure the location of the test that failed.	2018-09-20 09:47:44 -07:00
Roland Bracewell Shoemaker	196f019851	Add support for temporal CT logs (#3853 ) Required a little bit of rework of the RA issuance flow (to add parsing of the precert to determine the expiration date, and moving final cert parsing before final cert submission) and RA tests, but I think it shouldn't create any issues... Fixes #3197.	2018-09-14 16:14:42 -07:00
Daniel McCarney	d39babdcf3	RA: Remove vestigial DNS config/setup. (#3854 ) In `db01b0b` we removed email validation from the RA. This was the only use of the `bdns` package by the RA and so we can go one step further and delete the remaining setup, configuration and `bdns` fields.	2018-09-13 13:39:23 -04:00
Daniel McCarney	db01b0b5bc	RA: Remove email DNS validations. (#3851 ) Performing DNS lookups to check the A/AAAA/MX records of a provided contact e-mail address adds variability to the RA's NewRegistration/UpdateRegistration functions and requires that the RA be able to reach out to the EFN. Since this is simply a convenience to prevent some classes of registration errors we can remove it to improve performance and to tighten up our security posture slightly. Resolves https://github.com/letsencrypt/boulder/issues/3849	2018-09-11 18:42:34 -07:00
Daniel McCarney	c490ec457f	SA: statusForOrder shouldn't fetch authzs for expired orders. (#3843 ) If an order is expired the status is invalid and we don't need to get any of the order's authorizations. Its important to exit early in this case because expired authorizations may be purged from the DB. Fetching the authz's for an expired order may return less authz objects than expected, triggering a 500 error response. Resolves https://github.com/letsencrypt/boulder/issues/3839	2018-09-05 11:52:28 -07:00
Roland Bracewell Shoemaker	e27f370fd3	Excise code relating to pre-SCT embedding issuance flow (#3769 ) Things removed: * features.EmbedSCTs (and all the associated RA/CA/ocsp-updater code etc) * ca.enablePrecertificateFlow (and all the associated RA/CA code) * sa.AddSCTReceipt and sa.GetSCTReceipt RPCs * publisher.SubmitToCT and publisher.SubmitToSingleCT RPCs Fixes #3755.	2018-06-28 08:33:05 -04:00
Daniel McCarney	8583e42964	RA: Forbid contact addresses for IANA example domains. (#3748 ) We see a fair number of ACME accounts/registrations with contact addresses for the RFC2606 Section 3 "Reserved Example Second Level Domain Names" (`example.com`, `example.net`, `example.org`). These are not real contact addresses and are likely the result of the user copy-pasting example configuration. These users will miss out on expiration emails and other subscriber communications :-( This commit updates the RA's `validateEmail` function to reject any contact addresses for reserved example domain names. The corresponding unit test is updated accordingly. Resolves https://github.com/letsencrypt/boulder/issues/3719	2018-06-08 13:42:51 -04:00
Joel Sing	2540d59296	Implement CAA validation-methods checking. (#3716 ) When performing CAA checking respect the validation-methods parameter (if present) and restrict the allowed authorization methods to those specified. This allows a domain to restrict authorization methods that can be used with Let's Encrypt. This is largely based on PR #3003 (by @lukaslihotzki), which was landed and then later reverted due to issue #3143. The bug the resulted in the previous code being reverted has been addressed (likely inadvertently) by `76973d0f`. This implementation also includes integration tests for CAA validation-methods. Fixes issue #3143.	2018-05-23 14:32:31 -07:00
Joel Sing	5bc7fe639d	Distinguish between recheckCAA failures. (#3710 ) When rechecking CAA, the existing code maps all failures to a CAAError. This means that any other non-CAA failure (for example, an internal server error) gets hidden. Avoid this by reworking recheckCAA to return errors and if we find a non-CAAError, we return that directly. Revise tests to cover both situations. Updates issue #3143.	2018-05-15 17:57:35 -07:00
Joel Sing	f8a023e49c	Remove various unnecessary uses of fmt.Sprintf (#3707 ) Remove various unnecessary uses of fmt.Sprintf - in particular: - Avoid calls like t.Error(fmt.Sprintf(...)), where t.Errorf can be used directly. - Use strconv when converting an integer to a string, rather than using fmt.Sprintf("%d", ...). This is simpler and can also detect type errors at compile time. - Instead of using x.Write([]byte(fmt.Sprintf(...))), use fmt.Fprintf(x, ...).	2018-05-11 11:55:25 -07:00
Roland Bracewell Shoemaker	0e6713e573	Randomize order of CT logs when submitting precerts (#3660 ) * Randomize order of CT logs when submitting precerts so we maximize the chances we actually exercise all of the logs in a group and not just the first in the list. * Add metrics for winning logs	2018-04-20 15:00:10 -04:00
Jacob Hoffman-Andrews	6ca5c51e8e	Fix newOrder ready status regression, restore ready status. (#3644 ) In #3614 we adjusted the `SA.NewOrder` function to conditionally call `ssa.statusForOrder` on the new order when `features.OrderReadyStatus` was enabled. Unfortunately this call to `ssa.statusForOrder` happened before the `req.BeganProcessing` field was initialized with a pointer to a `false` bool. The `ssa.statusForOrder` function (correctly) assumes that `req.BeganProcessing == nil` is illegal and doesn't correspond to a known status. This results in `NewOrder` requests returning a 500 error of the form: > Internal error - Error creating new order - Order XXX is in an invalid state. No state known for this order's authorizations Our integration tests missed this because we didn't have a test case that issued for a set of names with one account, and then issued again for the same set of names with the same account. This PR fixes the original bug by moving the `BeganProcessing` initialization before the call to `statusForOrder`. This PR also adds an integration test to catch this sort of bug again in the future. Prior to the SA fix this test failed with the 500 server internal error observed by the Certbot team. With the SA fix in place the test passes again. Finally, this PR disables the `OrderReadyStatus` feature flag in `test/config-next/sa.json`. Certbot's ACME implementation breaks when this flag is enabled (See https://github.com/certbot/certbot/issues/5856). Since Certbot runs integration tests against Boulder with config-next we should be courteous and leave this flag disabled until we are closer to being able to turn it on for staging/prod.	2018-04-12 13:55:46 -07:00
Daniel McCarney	74d5decc67	Remove `TotalCertificates` rate limit. (#3638 ) The `TotalCertificates` rate limit serves to ensure we don't accidentally exceed our OCSP signing capacity by issuing too many certificates within a fixed period. In practice this rate limit has been fragile and the associated queries have been linked to performance problems. Since we now have better means of monitoring our OCSP signing capacity this commit removes the rate limit and associated code.	2018-04-12 13:25:47 -07:00
Daniel	ac6672bc71	Revert "Revert "V2: implement "ready" status for Order objects (#3614 )" (#3643 )" This reverts commit `3ecf841a3a`.	2018-04-12 13:20:47 -04:00
Jacob Hoffman-Andrews	3ecf841a3a	Revert "V2: implement "ready" status for Order objects (#3614 )" (#3643 ) This reverts commit `1d22f47fa2`. According to https://github.com/letsencrypt/boulder/pull/3614#issuecomment-380615172, this broke Certbot's tests. We'll investigate, and then roll forward once we understand what broke.	2018-04-12 10:46:57 -04:00
Daniel McCarney	1d22f47fa2	V2: implement "ready" status for Order objects (#3614 ) * SA: Add Order "Ready" status, feature flag. This commit adds the new "Ready" status to `core/objects.go` and updates `sa.statusForOrder` to use it conditionally for orders with all valid authorizations that haven't been finalized yet. This state is used conditionally based on the `features.OrderReadyStatus` feature flag since it will likely break some existing clients that expect status "Processing" for this state. The SA unit test for `statusForOrder` is updated with a "ready" status test case. * RA: Enforce order ready status conditionally. This commit updates the RA to conditionally expect orders that are being finalized to be in the "ready" status instead of "pending". This is conditionally enforced based on the `OrderReadyStatus` feature flag. Along the way the SA was changed to calculate the order status for the order returned in `sa.NewOrder` dynamically now that it could be something other than "pending". * WFE2: Conditionally enforce order ready status for finalization. Similar to the RA the WFE2 should conditionally enforce that an order's status is either "ready" or "pending" based on the "OrderReadyStatus" feature flag. * Integration: Fix `test_order_finalize_early`. This commit updates the V2 `test_order_finalize_early` test for the "ready" status. A nice side-effect of the ready state change is that we no longer invalidate an order when it is finalized too soon because we can reject the finalization in the WFE. Subsequently the `test_order_finalize_early` testcase is also smaller. * Integration: Test classic behaviour w/o feature flag. In the previous commit I fixed the integration test for the `config/test-next` run that has the `OrderReadyStatus` feature flag set but broke it for the `config/test` run without the feature flag. This commit updates the `test_order_finalize_early` test to work correctly based on the feature flag status in both cases.	2018-04-11 10:31:25 -07:00
Daniel McCarney	fa5c917665	RA: Don't lose CA error types when prefixing err msg. (#3633 ) Previously we updated the RA's issueCertificateInner function to prefix errors returned from the CA with meaningful information about which CA RPC caused the failure. Unfortunately by using fmt.Errorf to do this we're discarding the underlying error type. This can cause unexpected server internal errors downstream if (for e.g.) the CA rejects a CSR with a malformed error (see #3632). This PR updates the issueCertificateInner error message prefixing to maintain the error type if the underlying error is a berrors.BoulderError. A RA unit test with several mock CAs is added to test the prefixing occurs as expected without loss of error type. This PR also adds an integration test that ensures we reject a CSR with >100 names with a malformed error. This is not strictly related to this PR but since I wrote it while debugging the root issue I thought I'd include it. To allow this test to pass the pendingAuthorizationsPerAccount in test/rate-limit-policies.yml and associated tests had to be adjusted. Resolves #3632	2018-04-10 11:28:03 -07:00
Daniel McCarney	299e53b237	RA,CA: Refuse to start with MaxNames == 0. (#3634 ) This commit updates the `boulder-ra` and `boulder-ca` commands to refuse to start if their configured `MaxNames` is 0 (the default value). This should always be set to a positive number. This commit also updates `csr/csr.go` to always apply the max names check since it will never be 0 after the change above. Also refactor `FailOnError` to pull out a separate `Fail` function. Related to https://github.com/letsencrypt/boulder/issues/3632	2018-04-10 10:53:23 -07:00
Daniel McCarney	8a03a6848e	RA: Log Authz ID and solved-by Chal type at Issuance (#3601 ) This PR updates the RA such that certificateRequestEvent objects created during issuance and written to the audit log as JSON also include a new Authorizations field. This field is a map of the form map[string]certificateRequestAuthz and can be used to map from an identifier name appearing in the associated certificate to a certificateRequestAuthz object. Each of the certificateRequestAuthz objects holds an authorization ID and the type of challenge that made the authorization valid. Example Audit log output (with the JSON pulled out and pretty-printed): { "ID":"0BjPk94KlxExRRIQ3kslRVSJ68KMaTh416chRvq0wyA", "Requester":666, "SerialNumber":"ff699d91cab5bc84f1bc97fc71e4e27abc1a", "VerifiedFields":["subject.commonName","subjectAltName"], "CommonName":"rand.44634cbf.xyz", "Names":["rand.44634cbf.xyz"], "NotBefore":"2018-03-28T19:50:07Z", "NotAfter":"2018-06-26T19:50:07Z", "RequestTime":"2018-03-28T20:50:07.234038859Z", "ResponseTime":"2018-03-28T20:50:07.278848954Z", "Authorizations": { "rand.44634cbf.xyz" : { "ID":"jGt37Rnvfr0nhZn-wLkxrQxc2HBfV4t6TSraRiGnNBM", "ChallengeType":"http-01" } } } Resolves #3253	2018-04-04 20:59:38 +01:00
Daniel McCarney	21a17f0baf	Harmonize order expiry with assoc. authz expiry. (#3567 ) Prior to this commit an order's expiry was set based on ra.orderLiftime while pending and valid authorization expiry was set based on ra.pendingAuthorizationLifetime and ra.authorizationLifetime. Since orders reused existing valid/pending authorizations this can lead to a case where an order has an expiry beyond the associated authorization expiries. In this case when an authorization expired the order becomes inactionable and the extra order lifetime is not useful. This commit addresses this problem in two ways: 1. The SA GetAuthorizations function used to find authzs to reuse for ra.NewOrder is adjusted to only return authorizations at min 24hr away from expiry. 2. Order expiry is now calculated by the RA in newOrder as the min of the order's own expiry or the soonest authorization expiry. This properly reflects the order's true lifetime based on the authorization lifetime. The RA/SA unit tests are updated accordingly. Resolves #3498	2018-03-16 20:42:21 +00:00
Jacob Hoffman-Andrews	0a517aa9c0	Remove config-next gating on v2 and wildcard features (#3563 ) Also, move the last of the v2 migrations from db-next into db.	2018-03-15 13:14:25 -07:00
Jacob Hoffman-Andrews	4a961c3bc8	Ungate config-next for wfe2 and Wildcards.	2018-03-14 13:18:37 -07:00
Jacob Hoffman-Andrews	700604dda1	Overlapping wildcard errors are 400, not 500. (#3561 ) Return a malformed error for these requests. Also add an integration test. Fixes #3558	2018-03-14 13:18:25 -07:00
Roland Bracewell Shoemaker	9c9e944759	Add SCT embedding (#3521 ) Adds SCT embedding to the certificate issuance flow. When a issuance is requested a precertificate (the requested certificate but poisoned with the critical CT extension) is issued and submitted to the required CT logs. Once the SCTs for the precertificate have been collected a new certificate is issued with the poison extension replace with a SCT list extension containing the retrieved SCTs. Fixes #2244, fixes #3492 and fixes #3429.	2018-03-12 11:58:30 -07:00
Jacob Hoffman-Andrews	c621cbd33f	Reject overlaps with wildcards. (#3542 ) Requesting a certificate with "*.example.com" and "www.example.com" as separate SANs doesn't make sense, because "www.example.com" is covered by the wildcard. #3524	2018-03-10 06:49:36 +00:00
Daniel McCarney	49d55d9ab5	Make POSTing KeyAuthorization optional, V2 don't echo it. (#3526 ) This commit updates the RA to make the notion of submitting a KeyAuthorization value as part of the ra.UpdateAuthorization call optional. If set, the value is enforced against expected and an error is returned if the provided authorization isn't correct. If it isn't set the RA populates the field with the computed authorization for the VA to enforce against the value it sees in challenges. This retains the legacy behaviour of the V1 API. The V2 API will never unmarshal a provided key authorization. The ACMEv2/WFEv2 prepChallengeForDisplay function is updated to strip the ProvidedKeyAuthorization field before sending the challenge object back to a client. ACMEv1/WFEv1 continue to return the KeyAuthorization in challenges to avoid breaking clients that are relying on this legacy behaviour. For deployability ease this commit retains the name of the core.Challenge.ProvidedKeyAuthorization field even though it should be called core.Challenge.ComputedKeyAuthorization now that it isn't set based on the client's provided key authz. This will be easier as a follow-up change. Resolves #3514	2018-03-06 20:33:01 +00:00
Daniel McCarney	531d9ce52c	Run load-generator against V1 and V2 API in CI. (#3509 ) This commit adds short 15s runs of the load generator against the V1 and V2 APIs during the three integration test runs (v1 config, v1 config-next, and v2). 15s was selected because 30s caused too much output and the build log to be truncated. Presently the latency output is not being checked for errors. This was too flaky in practice. A fix for a race condition in the load-generator code itself related to HTTP status code tracking is included in this commit. The pending authz rate limit also needed to be adjusted to keep the load-generator from failing requests after hitting 429s.	2018-03-05 15:34:15 -05:00
Jacob Hoffman-Andrews	2d1d895da1	Copy authz.challenges to fix data race. (#3520 ) This race was uncovered by running the load generator as part of our CI. Also, update ra_test.go. It was previously testing that the returned authz and the stored authz should be identical, which is not actually a property of UpdateAuthorization; in general, they will not be identical.	2018-03-03 09:19:56 -08:00
Daniel McCarney	f2d3ad6d52	Enforce new orders per acct per window rate limit. (#3501 ) Previously we introduced the concept of a "pending orders per account ID" rate limit. After struggling with making an implementation of this rate limit perform well we reevaluated the problem and decided a "new orders per account per time window" rate limit would be a better fit for ACMEv2 overall. This commit introduces the new newOrdersPerAccount rate limit. The RA now checks this before creating new pending orders in ra.NewOrder. It does so after order reuse takes place ensuring the rate limit is only applied in cases when a distinct new pending order row would be created. To accomplish this a migration for a new orders field (created) and an index over created and registrationID is added. It would be possible to use the existing expires field for this like we've done in the past, but that was primarily to avoid running a migration on a large table in prod. Since we don't have that problem yet for V2 tables we can Do The Right Thing and add a column. For deployability the deprecated pendingOrdersPerAccount code & SA gRPC bits are left around. A follow-up PR will be needed to remove those (#3502). Resolves #3410	2018-03-02 10:47:39 -08:00
Roland Bracewell Shoemaker	0b53063a72	ctpolicy: Add informational logs and don't cancel remaining submissions (#3472 ) Add a set of logs which will be submitted to but not relied on for their SCTs, this allows us to test submissions to a particular log or submit to a log which is not yet approved by a browser/root program. Also add a feature which stops cancellations of remaining submissions when racing to get a SCT from a group of logs. Additionally add an informational log that always times out in config-next. Fixes #3464 and fixes #3465.	2018-02-23 21:51:50 -05:00
Roland Bracewell Shoemaker	9bd8dc5d0f	Rever to using ParseAddress instead of ParseAddressList (#3475 ) Fixes #1558.	2018-02-23 21:48:31 -05:00
Jacob Hoffman-Andrews	92c9340fe8	Deprecate CountCertificatesExact. (#3462 ) This is now enabled in prod and we can make it the default.	2018-02-20 14:34:03 -08:00
Roland Bracewell Shoemaker	8446571b46	Remove EnforceChallengeDisable (#3444 ) Removes usage of the `EnforceChallengeDisable` feature, the feature itself is not removed as it is still configured in staging/production, once that is fixed I'll submit another PR removing the actual flag. This keeps the behavior that when authorizations are retrieved from the SA they have their challenges populated, because that seems to make the most sense to me? It also retains TLS re-validation. Fixes #3441.	2018-02-14 13:21:26 -08:00
Daniel McCarney	f3d2dc50d9	Fix RA V2 wildcard authz reuse safety check. (#3434 ) Prior to this commit a logical error in the RA's `NewOrder` caused a safety check that prevents authorization reuse with a non-wildcard authz for a wildcard name to not work. This commit adds a test for the condition that the safety check is designed for and fixes the logical error. Prior to fixing the logical error the test fails. With the corrected safety check the test passes.	2018-02-08 15:35:11 -08:00
Roland Bracewell Shoemaker	9e23edf850	Use ctpolicy package in RA (#3422 ) And collect the metrics on success/failure rates. Built on top of #3414. Fixes #3413.	2018-02-08 13:33:42 -08:00
Daniel McCarney	4ac109ac25	Do not reuse legacy authzs in V2 new-order. (#3432 ) Prior to this commit when building up the authorizations for a new-order request we looked for any unexpired pending/valid authorizations owned by the account and used them for the order. This allows a client to use the V1 new-authz endpoint in combination with the V2 new-order endpoint and we do not want to support this behaviour. All V2 authorizations should be sourced from other V2 orders. This commit implements a new parameter for the SA's getAuthorizations function that allows filtering out legacy V1 authorizations by doing a JOIN on the order to authorizations join table. Resolves #3328	2018-02-08 12:31:04 -08:00
Daniel McCarney	d7bfb542c0	Handle order finalization errors. (#3404 ) This commit resolves the case where an error during finalization occurs. Prior to this commit if an error (expected or otherwise) occurred after setting an order to status processing at the start of order finalization the order would be stuck processing forever. The SA now has a `SetOrderError` RPC that can be used by the RA to persist an error onto an order. The order status calculation can use this error to decide if the order is invalid. The WFE is updated to write the error to the order JSON when displaying the order information. Prior to this commit the order protobuf had the error field as a `[]byte`. It doesn't seem like this is the right decision, we have a specific protobuf type for ProblemDetails and so this commit switches the error field to use it. The conversion to/from `[]byte` is done with the model by the SA. An integration test is included that prior to this commit left an order in a stuck processing state. With this commit the integration test passes as expected. Resolves https://github.com/letsencrypt/boulder/issues/3403	2018-02-07 16:34:07 -05:00
Daniel McCarney	eea049da40	Fix order reuse, calc order status by authz status (#3402 ) This PR is a rework of what was originally https://github.com/letsencrypt/boulder/pull/3382, integrating the design feedback proposed by @jsha: https://github.com/letsencrypt/boulder/pull/3382#issuecomment-359912549 This PR removes the stored Order status field and replaces it with a value that is calculated on-the-fly by the SA when fetching an order, based on the order's associated authorizations. In summary (and order of precedence): * If any of the order's authorizations are invalid, the order is invalid. * If any of the order's authorizations are deactivated, the order is deactivated. * If any of the order's authorizations are pending, the order is pending. * If all of the order's authorizations are valid, and there is a certificate serial, the order is valid. * If all of the order's authorizations are valid, and we have began processing, but there is no certificate serial, the order is processing. * If all of the order's authorizations are valid, and we haven't processing, then the order is pending waiting a finalization request. This avoids having to explicitly update the order status when an associated authorization changes status. The RA's implementation of new-order is updated to only reuse an existing order if the calculated status is pending. This avoids giving back invalid or deactivated orders to clients. Resolves #3333	2018-02-01 16:33:42 -05:00
Roland Bracewell Shoemaker	2adf5a54ab	Move CN to SAN in v2 API (#3394 ) Fixes #3368. Basically just adds a `csr.VerifyCSR` call in `ra.FinalizeOrder` that mirrors what we have in `ra.NewCertificate`, this moves the CN to SAN as expected if included.	2018-01-29 13:21:12 -08:00
Roland Bracewell Shoemaker	cdab3a2ef8	Improve wildcard error (#3398 )	2018-01-29 10:49:31 -08:00
Roland Bracewell Shoemaker	3e33d56d03	Remove test/config-next gating from unittests (#3395 ) The migrations are all applied and the tests can run unconditionally.	2018-01-25 09:08:07 -05:00
Roland Bracewell Shoemaker	7e4d44e172	Don't mask sa.GetValidAuthorization error in ra.NewAuthorization (#3381 )	2018-01-18 15:53:14 -05:00
Jacob Hoffman-Andrews	cfc7823cdd	Remove EnforceChallengeDisable check at issuance. (#3362 ) Per https://community.letsencrypt.org/t/2018-01-11-update-regarding-acme-tls-sni-and-shared-hosting-infrastructure/50188/3, we are planning to treat prior issuance by an account as reason to whitelist that account for reissuance via TLS-SNI. By extension, reusing validations that occurred prior to disclosure of the TLS-SNI issue is reasonably safe, so this change removes the issuance-time check for whether a challenge has been disabled. This saves us significant complexity and database load in implementing TLSSNIRevalidation (https://github.com/letsencrypt/boulder/pull/3361), since ChallengeTypeEnabled returns false, so we'd have to plumb through data about whether an issuance was based on a revalidation. Instead, we can safely delete this code. Note that "EnforceChallengeDisable" is implemented in three places: new-authz, validation time, and issuance time. We're keeping it in place at new-authz for now because it's intertwined with the account whitelisting code. We're keeping it in place at validation time, because there's a small chance that someone could have created a pending authz for a domain they don't control before the TLS-SNI issue was announced, and that authz could still be pending, and they could find out that that domain is hosted on a vulnerable provider, and use the vulnerability now that they know about it. A tiny chance, but may as well be careful.	2018-01-12 11:35:23 -08:00
Jacob Hoffman-Andrews	8153b919be	Implement TLSSNIRevalidation (#3361 ) This change adds a feature flag, TLSSNIRevalidation. When it is enabled, Boulder will create new authorization objects with TLS-SNI challenges if the requesting account has issued a certificate with the relevant domain name, and was the most recent account to do so. This setting overrides the configured list of challenges in the PolicyAuthority, so even if TLS-SNI is disabled in general, it will be enabled for revalidation. Note that this interacts with EnforceChallengeDisable. Because EnforceChallengeDisable causes additional checked at validation time and at issuance time, we need to update those two places as well. We'll send a follow-up PR with that. We chose to make this work only for the most recent account to issue, even if there were overlapping certificates, because it significantly simplifies the database access patterns and should work for 95+% of cases. Note that this change will let an account revalidate and reissue for a domain even if the previous issuance on that account used http-01 or dns-01. This also simplifies implementation, and fits within the intent of the mitigation plan: If someone previously issued for a domain using http-01, we have high confidence that they are actually the owner, and they are not going to "steal" the domain from themselves using tls-sni-01. Also note: This change also doesn't work properly with ReusePendingAuthz: true. Specifically, if you attempted issuance in the last couple days and failed because there was no tls-sni challenge, you'll still have an http-01 challenge lying around, and we'll reuse that; then your client will fail due to lack of tls-sni challenge again. This change was joint work between @rolandshoemaker and @jsha.	2018-01-12 11:00:06 -08:00
Maciej Dębski	44984cd84a	Implement regID whitelist for allowed challenge types. (#3352 ) This updates the PA component to allow authorization challenge types that are globally disabled if the account ID owning the authorization is on a configured whitelist for that challenge type.	2018-01-10 13:44:53 -05:00
Roland Shoemaker	400ffede3d	More fixes	2018-01-09 20:48:16 -08:00
Roland Shoemaker	1a3a76438c	Fix tests and GetOrderAuthorizations	2018-01-09 20:38:52 -08:00
Roland Shoemaker	dcd2b438f4	Fix previous impl, add valid authz reuse fix and existing authz validation fix	2018-01-09 19:53:48 -08:00
Roland Shoemaker	5ca646c5dd	Disallow the use of valid authorizations that used currently disabled challenges for issuance	2018-01-09 18:52:29 -08:00
Daniel McCarney	7bb16ff21e	ACMEv2: Add pending order reuse (#3290 ) This commit adds pending order reuse. Subsequent to this commit multiple add-order requests from the same account ID for the same set of order names will result in only one order being created. Orders are only reused while they are not expired. Finalized orders will not be reused for subsequent new-order requests allowing for duplicate order issuance. Note that this is a second level of reuse, building on the pending authorization reuse that's done between separate orders already. To efficiently find an appropriate order ID given a set of names, a registration ID, and the current time a new orderFqdnSets table is added with appropriate indexes and foreign keys. Resolves #3258	2018-01-02 13:27:16 -08:00
Jacob Hoffman-Andrews	e60251d8ca	Add documentation link to rate limit errors. (#3286 ) Resolves #2951	2017-12-19 15:46:18 -08:00
Daniel McCarney	de5fbbdb67	Implement CAA issueWild enforcement for wildcard names (#3266 ) This commit implements RFC 6844's description of the "CAA issuewild property" for CAA records. We check CAA in two places: at the time of validation, and at the time of issuance when an authorization is more than 8hours old. Both locations have been updated to properly enforce issuewild when checking CAA for a domain corresponding to a wildcard name in a certificate order. Resolves https://github.com/letsencrypt/boulder/issues/3211	2017-12-13 12:09:33 -05:00
Daniel McCarney	0684d5fc73	Add pending orders rate limit to new-order. (#3257 ) This commit adds a new rate limit to restrict the number of outstanding pending orders per account. If the threshold for this rate limit is crossed subsequent new-order requests will return a 429 response. Note: Since this the rate limit object itself defines an `Enabled()` test based on whether or not it has been configured there is not a feature flag for this change. Resolves https://github.com/letsencrypt/boulder/issues/3246	2017-12-04 16:36:48 -05:00
Daniel McCarney	1c99f91733	Policy based issuance for wildcard identifiers (Round two) (#3252 ) This PR implements issuance for wildcard names in the V2 order flow. By policy, pending authorizations for wildcard names only receive a DNS-01 challenge for the base domain. We do not re-use authorizations for the base domain that do not come from a previous wildcard issuance (e.g. a normal authorization for example.com turned valid by way of a DNS-01 challenge will not be reused for a .example.com order). The wildcard prefix is stripped off of the authorization identifier value in two places: When presenting the authorization to the user - ACME forbids having a wildcard character in an authorization identifier. When performing validation - We validate the base domain name without the . prefix. This PR is largely a rewrite/extension of #3231. Instead of using a pseudo-challenge-type (DNS-01-Wildcard) to indicate an authorization & identifier correspond to the base name of a wildcard order name we instead allow the identifier to take the wildcard order name with the *. prefix.	2017-12-04 12:18:10 -08:00
Daniel McCarney	171da33513	Don't 500 on missing pendingAuthz (#3248 ) As described in #3201 a concurrent challenge POST would result in 500 errors if the pending authz row was deleted by a promotion to the authz table underneath another request. This PR adjusts the SA & RA so that if a pending authz is promoted to a final authz between updates a not found error will be returned instead of a server internal error. Resolves https://github.com/letsencrypt/boulder/issues/3201	2017-11-22 15:48:42 -05:00
Daniel McCarney	2f263f8ed5	ACME v2 Finalize order support (#3169 ) This PR implements order finalization for the ACME v2 API. In broad strokes this means: * Removing the CSR from order objects & the new-order flow * Adding identifiers to the order object & new-order * Providing a finalization URL as part of orders returned by new-order * Adding support to the WFE's Order endpoint to receive finalization POST requests with a CSR * Updating the RA to accept finalization requests and to ensure orders are fully validated before issuance can proceed * Updating the SA to allow finding order authorizations & updating orders. * Updating the CA to accept an Order ID to log when issuing a certificate corresponding to an order object Resolves #3123	2017-11-01 12:39:44 -07:00
JP Phillips	e83480f86b	Return accurate error description for invalid authz limit (#3156 ) Fixes #3144	2017-10-11 22:21:51 -07:00
Daniel McCarney	1794c56eb8	Revert "Add CAA parameter to restrict challenge type (#3003 )" (#3145 ) This reverts commit `23e2c4a836`.	2017-10-04 12:00:44 -07:00
Jacob Hoffman-Andrews	dbfb48226d	Add parallelism to SA CountCertificatesByNames. (#3133 ) Since we can make up to 100 SQL queries from this method (based on the 100-SAN limit), sometimes it is too slow and we get a timeout for large certificates. By running some of those queries in parallel, we can speed things up and stop getting timeouts.	2017-10-02 15:45:08 -04:00
lukaslihotzki	23e2c4a836	Add CAA parameter to restrict challenge type (#3003 ) This commit adds CAA `issue` paramter parsing and the `challenge` parameter to permit a single challenge type only. By setting `challenge=dns-01`, the nameserver keeps control over every issued certificate.	2017-10-02 11:59:47 -07:00
Roland Bracewell Shoemaker	b7bca87134	Batch fetching of existing authorizations and creation of pending authorizations (#3058 ) For the new-order endpoint only. This does some refactoring of the order of operations in `ra.NewAuthorization` as well in order to reduce the duplication of code relating to creating pending authorizations, existing tests still seem to work as intended... A close eye should be given to this since we don't have integration tests yet that test it end to end. This also changes the inner type of `grpc.StorageAuthorityServerWrapper` to `core.StorageAuthority` so that we can avoid a circular import that is created by needing to import `grpc.AuthzToPB` and `grpc.PBToAuthz` in `sa/sa.go`. This is a big change but should considerably improve the performance of the new-order flow. Fixes #2955.	2017-09-25 09:10:59 -07:00
Daniel McCarney	f0a9e9aa0e	Fix duplicate cert limit off-by-one. (#3117 ) The RA's `checkCertificatesPerFQDNSetLimit` function was using `>` where it should have been using `>=` when evaluating a FQDNSet count against the rate limit threshold. This resulted in an off by one error where we allowed 1 more duplicate certificate than intended. This commit fixes the off-by-one error and adds a short unit test. The unit test failed the `TestCheckExactCertificateLimit/FQDN_set_issuances_equal_to_limit` subtest before the fix was applied and passes afterwards.	2017-09-22 14:45:48 -07:00
Kleber Correia	72b1c69761	Remove RecheckCAA feature flag (#3103 ) Updates #2692.	2017-09-18 11:59:58 -07:00
Jacob Hoffman-Andrews	9ab2ff4e03	Add CAA-specific error. (#3051 ) Previously, CAA problems were lumped in under "ConnectionProblem" or "Unauthorized". This should make things clearer and easier to differentiate. Fixes #3043	2017-09-14 14:11:41 -07:00
Jacob Hoffman-Andrews	3b6a5ff63c	Fix a bad error return in RA (#3061 ) RA.NewRegistration checked for an error return from SA.NewRegistration, but failed to properly propagate the error. It was just setting the err variable and falling through to the successful return case. This fixes that issue and adds a unittest.	2017-09-08 13:15:09 -07:00
Daniel McCarney	d18e1dbcff	Add WrongAuthorizationState error code for UpdateAuthorization (#3053 ) This commit adds a new boulder error type WrongAuthorizationState. This error type is returned by the SA when UpdateAuthorization is provided an authz that isn't pending. The RA and WFE are updated accordingly such that this error percolates back through the API to the user as a Malformed problem with a sensible description. Previously this behaviour resulted in a ServerInternal error. Resolves #3032	2017-09-07 11:22:02 -07:00
Kleber Correia	02864c11bf	Remove AllowAccountDeactivation flag (#2927 ) Part of #2712	2017-09-06 11:11:40 -07:00
Kleber Correia	710c814720	Remove AllowKeyRollover flag (#3037 )	2017-09-06 08:43:15 -04:00
Jacob Hoffman-Andrews	45f42f6583	Switch recheckCAA error to Unauthorized. (#3040 ) ConnectionFailure is only used during validation, and so isn't handled by WFE's problemDetailsFromBoulderError. This led to returning ServerInternal instead of the intended error code, and hiding the error detail. Unauthorized is probably a better error type for now anyhow, but long-term we should switch to a specific CAA error type. This PR will allow clients to see the detailed list of problem domains when new-cert returns an error due to CAA rechecking.	2017-09-05 10:53:47 -07:00
Jacob Hoffman-Andrews	b0c7bc1bee	Recheck CAA for authorizations older than 8 hours (#3014 ) Fixes #2889. VA now implements two gRPC services: VA and CAA. These both run on the same port, but this allows implementation of the IsCAAValid RPC to skip using the gRPC wrappers, and makes it easier to potentially separate the service into its own package in the future. RA.NewCertificate now checks the expiration times of authorizations, and will call out to VA to recheck CAA for those authorizations that were not validated recently enough.	2017-08-28 16:40:57 -07:00
Roland Bracewell Shoemaker	90ba766af9	Add NewOrder RPCs + methods to SA and RA (#2907 ) Fixes #2875, #2900 and #2901.	2017-08-11 14:24:25 -04:00
Jacob Hoffman-Andrews	8bc1db742c	Improve recycling of pending authzs (#2896 ) The existing ReusePendingAuthz implementation had some bugs: It would recycle deactivated authorizations, which then couldn't be fulfilled. (#2840) Since it was implemented in the SA, it wouldn't get called until after the RA checks the Pending Authorizations rate limit. Which means it wouldn't fulfill its intended purpose of making accounts less likely to get stuck in a Pending Authorizations limited state. (#2831) This factors out the reuse functionality, which used to be inside an "if" statement in the SA. Now the SA has an explicit GetPendingAuthorization RPC, which gets called from the RA before calling NewPendingAuthorization. This happens to obsolete #2807, by putting the recycling logic for both valid and pending authorizations in the RA.	2017-07-26 14:00:30 -07:00
Daniel McCarney	2a84bc2495	Replace go-jose v1 with go-jose v2. (#2899 ) This commit replaces the Boulder dependency on gopkg.in/square/go-jose.v1 with gopkg.in/square/go-jose.v2. This is necessary both to stay in front of bitrot and because the ACME v2 work will require a feature from go-jose.v2 for JWS validation. The largest part of this diff is cosmetic changes: Changing import paths jose.JsonWebKey -> jose.JSONWebKey jose.JsonWebSignature -> jose.JSONWebSignature jose.JoseHeader -> jose.Header Some more significant changes were caused by updates in the API for for creating new jose.Signer instances. Previously we constructed these with jose.NewSigner(algorithm, key). Now these are created with jose.NewSigner(jose.SigningKey{},jose.SignerOptions{}). At present all signers specify EmbedJWK: true but this will likely change with follow-up ACME V2 work. Another change was the removal of the jose.LoadPrivateKey function that the wfe tests relied on. The jose v2 API removed these functions, moving them to a cmd's main package where we can't easily import them. This function was reimplemented in the WFE's test code & updated to fail fast rather than return errors. Per CONTRIBUTING.md I have verified the go-jose.v2 tests at the imported commit pass: ok gopkg.in/square/go-jose.v2 14.771s ok gopkg.in/square/go-jose.v2/cipher 0.025s ? gopkg.in/square/go-jose.v2/jose-util [no test files] ok gopkg.in/square/go-jose.v2/json 1.230s ok gopkg.in/square/go-jose.v2/jwt 0.073s Resolves #2880	2017-07-26 10:55:14 -07:00
Jacob Hoffman-Andrews	ef81f13f7b	Remove "DONE" logging in unittest. (#2895 ) This is not necessary because the framework prints a done message for us.	2017-07-25 13:37:58 -04:00
Roland Bracewell Shoemaker	05d869b005	Rename DNSResolver -> DNSClient (#2878 ) Fixes #639. This resolves something that has bugged me for two+ years, our DNSResolverImpl is not a DNS resolver, it is a DNS client. This change just makes that obvious.	2017-07-18 08:37:45 -04:00
Jacob Hoffman-Andrews	d47d3c5066	Recycle pending authorizations (#2797 ) If the feature flag "ReusePendingAuthz" is enabled, a request to create a new authorization object from an account that already has a pending authorization object for the same identifier will return the already-existing authorization object. This should make it less common for people to get stuck in the "too many pending authorizations" state, and reduce DB storage growth. Fixes #2768	2017-06-19 13:35:36 -04:00
Daniel McCarney	fbd87b1757	Splits CountRegistrationsByIP to exact-match and by /48. (#2782 ) Prior to this PR the SA's `CountRegistrationsByIP` treated IPv6 differently than IPv4 by counting registrations within a /48 for IPv6 as opposed to exact matches for IPv4. This PR updates `CountRegistrationsByIP` to treat IPv4 and IPv6 the same, always matching exactly. The existing RegistrationsPerIP rate limit policy will be applied against this exact matching count. A new `CountRegistrationsByIPRange` function is added to the SA that performs the historic matching process, e.g. for IPv4 it counts exactly the same as `CountRegistrationsByIP`, but for IPv6 it counts within a /48. A new `RegistrationsPerIPRange` rate limit policy is added to allow configuring the threshold/window for the fuzzy /48 matching registration limit. Stats for the "Exceeded" and "Pass" events for this rate limit are separated into a separate `RegistrationsByIPRange` stats scope under the `RateLimit` scope to allow us to track it separate from the exact registrations per IP rate limit. Resolves https://github.com/letsencrypt/boulder/issues/2738	2017-05-30 15:12:20 -07:00
Daniel McCarney	7393db6d59	Fixes RPC bug for CountCertificatesExact feature flag. (#2759 ) With the CountCertificatesExact feature flag enabled if the RA's checkCertificatesPerNameLimit was called with names only containing domains exactly matching a public suffix entry then the legacy ra.enforceNameCounts function will be called with an empty tldNames argument. This in turn will cause the RA->SA RPC to fail with an "incomplete gRPC request message error". This commit fixes this bug by only calling ra.enforceNameCounts when len(tldNames) > 0. Resolves #2758	2017-05-12 15:21:00 -04:00
Daniel McCarney	b9369a4814	Uses `UniqueLowerNames` for domain/suffix rl funcs. (#2725 ) Both `ra.domainsForRateLimiting` and `ra.suffixesForRateLimiting` were doing their own "unique"ing with a `map[string]struct{}` when they could have used `core.UniqueLowerNames`. This commit updates them both to do so and adjusts the unit tests to expect the new sorted order return.	2017-05-04 12:37:05 +01:00
Daniel McCarney	1ed34a4a5d	Fixes cert count rate limit for exact PSL matches. (#2703 ) Prior to this PR if a domain was an exact match to a public suffix list entry the certificates per name rate limit was applied based on the count of certificates issued for that exact name and all of its subdomains. This PR introduces an exception such that exact public suffix matches correctly have the certificate per name rate limit applied based on only exact name matches. In order to accomplish this a new RPC is added to the SA `CountCertificatesByExactNames`. This operates similar to the existing `CountCertificatesByNames` but does not include subdomains in the count, only exact matches to the names provided. The usage of this new RPC is feature flag gated behind the "CountCertificatesExact" feature flag. The RA unit tests are updated to test the new code paths both with and without the feature flag enabled. Resolves #2681	2017-05-02 13:43:35 -07:00
David Calavera	cc5ee3906b	Refactor IsSane and IsSane* to return useful errors. (#2685 ) This change changes the returning values from boolean to error. It makes `checkConsistency` an internal function and removes the optional argument in favor of making checks explicit where they are used. It also renames those functions to CheckConsistency* to not give the impression of still returning boolean values. Signed-off-by: David Calavera <david.calavera@gmail.com>	2017-04-19 12:08:47 -04:00
Roland Bracewell Shoemaker	fd561ef842	Block issuance on first OCSP response generation (#2633 ) Generate first OCSP response in ca.IssueCertificate instead of ocsp-updater.newCertificateTick if features.GenerateOCSPEarly is enabled. Adds a new field to the sa.AddCertiifcate RPC for the OCSP response and only adds it to the certificate status + sets ocspLastUpdated if it is a non-empty slice. ocsp-updater.newCertificateTick stays the same so we can catch certificates that were successfully signed + stored but a OCSP response couldn't be generated (for whatever reason). Fixes #2477.	2017-04-04 11:28:09 -07:00
Roland Bracewell Shoemaker	e2b2511898	Overhaul internal error usage (#2583 ) This patch removes all usages of the `core.XXXError` and almost all usages of `probs` outside of the WFE and VA and replaces them with a unified internal error type. Since the VA uses `probs.ProblemDetails` quite extensively in challenges, and currently stores them in the DB I've saved this change for another change (it'll also require a migration). Since `ProblemDetails` should only ever be exposed to end-users all of its related logic should be moved into the `WFE` but since it still needs to be exposed to the VA and SA I've left it in place for now. The new internal `errors` package offers the same convenience functions as `probs` does as well as a new simpler type testing method. A few small changes have also been made to error messages, mainly adding the library and function name to internal server errors for easier debugging (i.e. where a number of functions return the exact same errors and there is no other way to distinguish which method threw the error). Also adds proper encoding of internal errors transferred over gRPC (the current encoding scheme is kept for `core` and `probs` errors since it'll be ideally be removed after we deploy this and follow-up changes) using `grpc/metadata` instead of the gRPC status codes. Fixes #2507. Updates #2254 and #2505.	2017-03-22 23:27:31 -07:00
Jacob Hoffman-Andrews	6c93b41f20	Add a limit on failed authorizations (#2513 ) Fixes #976. This implements a new rate limit, InvalidAuthorizationsPerAccount. If a given account fails authorization for a given hostname too many times within the window, subsequent new-authz attempts for that account and hostname will fail early with a rateLimited error. This mitigates the misconfigured clients that constantly retry authorization even though they always fail (e.g., because the hostname no longer resolves). For the new rate limit, I added a new SA RPC, CountInvalidAuthorizations. I chose to implement this only in gRPC, not in AMQP-RPC, so checking the rate limit is gated on gRPC. See #2406 for some description of the how and why. I also chose to directly use the gRPC interfaces rather than wrapping them in core.StorageAuthority, as a step towards what we will want to do once we've moved fully to gRPC. Because authorizations don't have a created time, we need to look at the expires time instead. Invalid authorizations retain the expiration they were given when they were created as pending authorizations, so we use now + pendingAuthorizationLifetime as one side of the window for rate limiting, and look backwards from there. Note that this means you could maliciously bypass this rate limit by stacking up pending authorizations over time, then failing them all at once. Similarly, since this limit is by (account, hostname) rather than just (hostname), you can bypass it by creating multiple accounts. It would be more natural and robust to limit by hostname, like our certificate limits. However, we currently only have two indexes on the authz table: the primary key, and (`registrationID`,`identifier`,`status`,`expires`) Since this limit is intended mainly to combat misconfigured clients, I think this is sufficient for now. Corresponding PR for website: letsencrypt/website#125	2017-01-23 11:22:51 -08:00
Josh Soref	8adf9d41cf	Spelling (#2500 ) Various spelling fixes.	2017-01-16 10:44:52 -05:00
Daniel	5f7ec2217e	Spelling fixes	2016-11-30 13:17:56 -05:00
Roland Bracewell Shoemaker	c5f99453a9	Switch CT submission RPC from CA -> RA (#2304 ) With the current gRPC design the CA talks directly to the Publisher when calling SubmitToCT which crosses security bounadries (secure internal segment -> internet facing segment) which is dangerous if (however unlikely) the Publisher is compromised and there is a gRPC exploit that allows memory corruption on the caller end of a RPC which could expose sensitive information or cause arbitrary issuance. Instead we move the RPC call to the RA which is in a less sensitive network segment. Switching the call site from the CA -> RA is gated on adding the gRPC PublisherService object to the RA config. Fixes #2202.	2016-11-08 11:39:02 -08:00
Daniel McCarney	a6f2b0fafb	Updates `go-jose` dep to v1.1.0 (#2314 ) This commit updates the `go-jose` dependency to [v1.1.0](https://github.com/square/go-jose/releases/tag/v1.1.0) (Commit: aa2e30fdd1fe9dd3394119af66451ae790d50e0d). Since the import path changed from `github.com/square/...` to `gopkg.in/square/go-jose.v1/` this means removing the old dep and adding the new one. The upstream go-jose library added a `[]x509.Certificate` member to the `JsonWebKey` struct that prevents us from using a direct equality test against two `JsonWebKey` instances. Instead we now must compare the inner `Key` members. The `TestRegistrationContactUpdate` function from `ra_test.go` was updated to populate the `Key` members used in testing instead of only using KeyID's to allow the updated comparisons to work as intended. The `Key` field of the `Registration` object was switched from `jose.JsonWebKey` to `jose.JsonWebKey ` to make it easier to represent a registration w/o a Key versus using a value with a nil `JsonWebKey.Key`. I verified the upstream unit tests pass per contributing.md: ``` daniel@XXXXX:~/go/src/gopkg.in/square/go-jose.v1$ git show commit aa2e30fdd1fe9dd3394119af66451ae790d50e0d Merge: 139276c e18a743 Author: Cedric Staub <cs@squareup.com> Date: Thu Sep 22 17:08:11 2016 -0700 Merge branch 'master' into v1 * master: Better docs explaining embedded JWKs Reject invalid embedded public keys Improve multi-recipient/multi-sig handling daniel@XXXXX:~/go/src/gopkg.in/square/go-jose.v1$ go test ./... ok gopkg.in/square/go-jose.v1 17.599s ok gopkg.in/square/go-jose.v1/cipher 0.007s ? gopkg.in/square/go-jose.v1/jose-util [no test files] ok gopkg.in/square/go-jose.v1/json 1.238s ```	2016-11-08 13:56:50 -05:00

... 2 3 4 5 6 ...

384 Commits