boulder

Commit Graph

Author	SHA1	Message	Date
Aaron Gable	294d1c31d7	Use error wrapping for berrors and tests (#5169 ) This change adds two new test assertion helpers, `AssertErrorIs` and `AssertErrorWraps`. The former is a wrapper around `errors.Is`, and asserts that the error's wrapping chain contains a specific (i.e. singleton) error. The latter is a wrapper around `errors.As`, and asserts that the error's wrapping chain contains any error which is of the given type; it also has the same unwrapping side effect as `errors.As`, which can be useful for further assertions about the contents of the error. It also makes two small changes to our `berrors` package, namely making `berrors.ErrorType` itself an error rather than just an int, and giving `berrors.BoulderError` an `Unwrap()` method which exposes that inner `ErrorType`. This allows us to use the two new helpers above to make assertions about berrors, rather than having to hand-roll equality assertions about their types. Finally, it takes advantage of the two changes above to greatly simplify many of the assertions in our tests, removing conditional checks and replacing them with simple assertions.	2020-11-06 13:17:11 -08:00
Aaron Gable	17e9e7fbb7	SA: Ensure that IssuerID is set when adding precertificates (#5099 ) This change adds `req.IssuerID` to the set of fields that the SA's `AddPrecertificate` method requires be non-zero. As a result, this also updates many tests, both unit and integration, to ensure that they supply a value (usually just 1) for that field. The most complex part of the test changes is a slight refactoring to the orphan-finder code, which makes it easier to reason about the separation between log line parsing and building and sending the request. Based on #5096 Fixes #5097	2020-09-23 16:45:19 -07:00
Jacob Hoffman-Andrews	8dd386b6bc	SA: Update RPC interface to proto3 (#5043 ) One slightly surprising / interesting thing: Since core types like Order and Registration are still proto2 and have pointer fields, there are actually some places in this PR where I had to add a `*` rather than delete an `&`, because I was taking a pointer field from one of those core types and passing it as a field in an SA RPC request. Fixes #5037.	2020-08-25 10:28:41 -07:00
Aaron Gable	82e9e41597	Update CA RPC interface to proto3 (#4983 )	2020-07-31 13:23:55 -07:00
Aaron Gable	7e626b63a6	Temporarily revert CA and VA proto3 migrations (#4962 )	2020-07-16 14:29:42 -07:00
Aaron Gable	24e782e8b4	Update CA RPC interface to proto3 (#4951 ) This updates the ca.proto to use proto3 syntax, and updates all clients of the autogenerated code to use the new types. In particular, it removes indirection from built-in types (proto3 uses ints, rather than pointers to ints, for example). It also updates a few instances where tests were being conducted to see if various object fields were nil to instead check for those fields' new zero-value. Fixes #4940	2020-07-13 18:02:18 -07:00
Jacob Hoffman-Andrews	0b0917cea6	Revert "Remove StoreIssuerInfo flag in CA (#4850 )" (#4868 ) This reverts commit `6454513ded`. We actually need to wait 90 days to ensure the issuerID field of the certificateStatus table is non-nil for all extant certificates.	2020-06-12 12:50:24 -07:00
Jacob Hoffman-Andrews	6454513ded	Remove StoreIssuerInfo flag in CA (#4850 ) As part of that, add support for issuer IDs in orphan-finder's and RA's calls to GenerateOCSP. This factors out the idForIssuer logic from ca/ca.go into a new issuercerts package. orphan-finder refactors: Add a list of issuers in config. Create an orphanFinder struct to hold relevant fields, including the newly added issuers field. Factor out a storeDER function to reduce duplication between the parse-der and parse-ca-log cases. Use test certificates generated specifically for orphan-finder tests. This was necessary because the issuers of these test certificates have to be configured for the orphan finder.	2020-06-09 12:25:13 -07:00
Jacob Hoffman-Andrews	2d7337dcd0	Remove newlines from log messages. (#4777 ) Since Boulder's log system adds checksums to lines, but log-validator processes entries on a per-line basis, including newlines in log messages can cause a validation failure.	2020-04-16 16:49:08 -07:00
Jacob Hoffman-Andrews	bef02e782a	Fix nits found by staticcheck (#4726 ) Part of #4700	2020-03-30 10:20:20 -07:00
Roland Bracewell Shoemaker	5b2f11e07e	Switch away from old style statsd metrics wrappers (#4606 ) In a handful of places I've nuked old stats which are not used in any alerts or dashboards as they either duplicate other stats or don't provide much insight/have never actually been used. If we feel like we need them again in the future it's trivial to add them back. There aren't many dashboards that rely on old statsd style metrics, but a few will need to be updated when this change is deployed. There are also a few cases where prometheus labels have been changed from camel to snake case, dashboards that use these will also need to be updated. As far as I can tell no alerts are impacted by this change. Fixes #4591.	2019-12-18 11:08:25 -05:00
Jacob Hoffman-Andrews	d4168626ad	Fix orphan-finder (#4507 ) This creates the correct type of backend service for the OCSP generator. It also adds an invocation of orphan-finder during the integration tests. This also adds a minor safety check to SA that I hit while writing the test. Without this safety check, passing a certificate with no DNSNames to AddCertificate would result in an obscure MariaDB syntax error without enough context to track it down. In normal circumstances this shouldn't be hit, but it will be good to have a solid error message if we hit it in tests sometime. Also, this tweaks the .travis.yml so it explicitly sets BOULDER_CONFIG_DIR to test/config in the default case. Because the docker-compose run command uses -e BOULDER_CONFIG_DIR="${BOULDER_CONFIG_DIR}", we were setting a blank BOULDER_CONFIG_DIR in default case. Since the Python startservers script sets a default if BOULDER_CONFIG_DIR is not set, we haven't noticed this before. But since this test case relies on the actual environment variable, it became an issue. Fixes #4499	2019-10-25 09:51:14 -07:00
Jacob Hoffman-Andrews	672bdcfdcb	orphan-finder: Rename CAService in config. (#4496 ) OCSPGeneratorService matches the semantics better, and is what ocsp-updater uses. It also matches what's in the config-next. This wasn't caught by integration tests because we don't currently run orphan-finder in the integration tests. We don't have a good way to induce failures in the SA on demand.	2019-10-22 09:25:11 -07:00
Daniel McCarney	7b513de6a5	orphan-finder: adopt orphan precerts. (#4483 ) Since `9906c93` the CA has logged orphan log lines for precertificates as well as certificates. The orphan-finder needs to handle them similar to final certificates. Resolves https://github.com/letsencrypt/boulder/issues/4479	2019-10-17 13:14:57 -07:00
Jacob Hoffman-Andrews	d3b9107059	orphan-finder: add OCSP generation (#4457 ) Fixes #4428	2019-10-07 14:40:36 -04:00
Roland Bracewell Shoemaker	6f93942a04	Consistently used stdlib context package (#4229 )	2019-05-28 14:36:16 -04:00
Joel Sing	8ebdfc60b6	Provide formatting logger functions. (#3699 ) A very large number of the logger calls are of the form log.Function(fmt.Sprintf(...)). Rather than sprinkling fmt.Sprintf at every logger call site, provide formatting versions of the logger functions and call these directly with the format and arguments. While here remove some unnecessary trailing newlines and calls to String/Error.	2018-05-10 11:06:29 -07:00
Daniel McCarney	aa810a3142	gRPC: publish RPC latency stat in server interceptor. (#3665 ) We may see RPCs that are dispatched by a client but do not arrive at the server for some time afterwards. To have insight into potential request latency at this layer we want to publish the time delta between when a client sent an RPC and when the server received it. This PR updates the gRPC client interceptor to add the current time to the gRPC request metadata context when it dispatches an RPC. The server side interceptor is updated to pull the client request time out of the gRPC request metadata. Using this timestamp it can calculate the latency and publish it as an observation on a Prometheus histogram. Accomplishing the above required wiring a clock through to each of the client interceptors. This caused a small diff across each of the gRPC aware boulder commands. A small unit test is included in this PR that checks that a latency stat is published to the histogram after an RPC to a test ChillerServer is made. It's difficult to do more in-depth testing because using fake clocks makes the latency 0 and using real clocks requires finding a way to queue/delay requests inside of the gRPC mechanisms not exposed to Boulder. Updates https://github.com/letsencrypt/boulder/issues/3635 - Still TODO: Explicitly logging latency in the VA, tracking outstanding RPCs as a gauge.	2018-04-25 15:37:22 -07:00
Daniel McCarney	f8f9a158c7	orphan-finder: set cert issued date based on notbefore. (#3651 ) The Boulder orphan-finder command uses the SA's AddCertificate RPC to add orphaned certificates it finds back to the DB. Prior to this commit this RPC always set the core.Certificate.Issued field to the current time. For the orphan-finder case this meant that the Issued date would incorrectly be set to when the certificate was found, not when it was actually issued. This could cause cert-checker to alarm based on the unusual delta between the cert NotBefore and the core.Certificate.Issued value. This PR updates the AddCertificate RPC to accept an optional issued timestamp in the request arguments. In the SA layer we address deployability concerns by setting a default value of the current time when none is explicitly provided. This matches the classic behaviour and will let an old RA communicate with a new SA. This PR updates the orphan-finder to provide an explicit issued time to sa.AddCertificate. The explicit issued time is calculated using the found certificate's NotBefore and the configured backdate. This lets the orphan-finder set the true issued time in the core.Certificate object, avoiding any cert-checker alarms. Resolves #3624	2018-04-19 10:25:12 -07:00
Jacob Hoffman-Andrews	9da5a7e1fc	Cleanup: TLS and GRPC configs are mandatory. (#3476 ) Our various main.go functions gated some key code on whether the TLS and/or GRPC config fields were present. Now that those fields are fully deployed in production, we can simplify the code and require them. Also, rename tls to tlsConfig everywhere to avoid confusion with the tls package. Avoid assigning to the same err from two different goroutines in boulder-ca (fix a race).	2018-02-26 10:16:50 -05:00
Jacob Hoffman-Andrews	68d5cc3331	Restore gRPC metrics (#3265 ) The go-grpc-prometheus package by default registers its metrics with Prometheus' global registry. In #3167, when we stopped using the global registry, we accidentally lost our gRPC metrics. This change adds them back. Specifically, it adds two convenience functions, one for clients and one for servers, that makes the necessary metrics object and registers it. We run these in the main function of each server. I considered adding these as part of StatsAndLogging, but the corresponding ClientMetrics and ServerMetrics objects (defined by go-grpc-prometheus) need to be subsequently made available during construction of the gRPC clients and servers. We could add them as fields on Scope, but this seemed like a little too much tight coupling. Also, update go-grpc-prometheus to get the necessary methods. ``` $ go test github.com/grpc-ecosystem/go-grpc-prometheus/... ok github.com/grpc-ecosystem/go-grpc-prometheus 0.069s ? github.com/grpc-ecosystem/go-grpc-prometheus/examples/testproto [no test files] ```	2017-12-07 15:44:55 -08:00
Jacob Hoffman-Andrews	f366e45756	Remove global state from metrics gathering (#3167 ) Previously, we used prometheus.DefaultRegisterer to register our stats, which uses global state to export its HTTP stats. We also used net/http/pprof's behavior of registering to the default global HTTP ServeMux, via DebugServer, which starts an HTTP server that uses that global ServeMux. In this change, I merge DebugServer's functions into StatsAndLogging. StatsAndLogging now takes an address parameter and fires off an HTTP server in a goroutine. That HTTP server is newly defined, and doesn't use DefaultServeMux. On it is registered the Prometheus stats handler, and handlers for the various pprof traces. In the process I split StatsAndLogging internally into two functions: makeStats and MakeLogger. I didn't port across the expvar variable exporting, which serves a similar function to Prometheus stats but which we never use. One nice immediate effect of this change: Since StatsAndLogging now requires and address, I noticed a bunch of commands that called StatsAndLogging, and passed around the resulting Scope, but never made use of it because they didn't run a DebugServer. Under the old StatsD world, these command still could have exported their stats by pushing, but since we moved to Prometheus their stats stopped being collected. We haven't used any of these stats, so instead of adding debug ports to all short-lived commands, or setting up a push gateway, I simply removed them and switched those commands to initialize only a Logger, no stats.	2017-10-13 11:58:01 -07:00
Jacob Hoffman-Andrews	b17b5c72a6	Remove statsd from Boulder (#2752 ) This removes the config and code to output to statsd. - Change `cmd.StatsAndLogging` to output a `Scope`, not a `Statter`. - Remove the prefixing of component name (e.g. "VA") in front of stats; this was stripped by `autoProm` but now no longer needs to be. - Delete vendored statsd client. - Delete `MockStatter` (generated by gomock) and `mocks.Statter` (hand generated) in favor of mocking `metrics.Scope`, which is the interface we now use everywhere. - Remove a few unused methods on `metrics.Scope`, and update its generated mock. - Refactor `autoProm` and add `autoRegisterer`, which can be included in a `metrics.Scope`, avoiding global state. `autoProm` now registers everything with the `prometheus.Registerer` it is given. - Change va_test.go's `setup()` to not return a stats object; instead the individual tests that care about stats override `va.stats` directly. Fixes #2639, #2733.	2017-05-15 10:19:54 -04:00
Roland Bracewell Shoemaker	636a1fc878	Remove core.XXXError type checks	2017-05-03 22:18:13 +00:00
Jacob Hoffman-Andrews	d99800ecb1	Remove some last traces of AMQP. (#2687 ) Fixes #2665	2017-04-20 10:43:17 -07:00
Roland Bracewell Shoemaker	fd561ef842	Block issuance on first OCSP response generation (#2633 ) Generate first OCSP response in ca.IssueCertificate instead of ocsp-updater.newCertificateTick if features.GenerateOCSPEarly is enabled. Adds a new field to the sa.AddCertiifcate RPC for the OCSP response and only adds it to the certificate status + sets ocspLastUpdated if it is a non-empty slice. ocsp-updater.newCertificateTick stays the same so we can catch certificates that were successfully signed + stored but a OCSP response couldn't be generated (for whatever reason). Fixes #2477.	2017-04-04 11:28:09 -07:00
Jacob Hoffman-Andrews	6719dc17a6	Remove AMQP config and code (#2634 ) We now use gRPC everywhere.	2017-04-03 10:39:39 -04:00
Roland Bracewell Shoemaker	e2b2511898	Overhaul internal error usage (#2583 ) This patch removes all usages of the `core.XXXError` and almost all usages of `probs` outside of the WFE and VA and replaces them with a unified internal error type. Since the VA uses `probs.ProblemDetails` quite extensively in challenges, and currently stores them in the DB I've saved this change for another change (it'll also require a migration). Since `ProblemDetails` should only ever be exposed to end-users all of its related logic should be moved into the `WFE` but since it still needs to be exposed to the VA and SA I've left it in place for now. The new internal `errors` package offers the same convenience functions as `probs` does as well as a new simpler type testing method. A few small changes have also been made to error messages, mainly adding the library and function name to internal server errors for easier debugging (i.e. where a number of functions return the exact same errors and there is no other way to distinguish which method threw the error). Also adds proper encoding of internal errors transferred over gRPC (the current encoding scheme is kept for `core` and `probs` errors since it'll be ideally be removed after we deploy this and follow-up changes) using `grpc/metadata` instead of the gRPC status codes. Fixes #2507. Updates #2254 and #2505.	2017-03-22 23:27:31 -07:00
Daniel McCarney	00d11f126b	Parse feature flags in all cmd's (#2534 ) If you are the first person to add a feature to a Boulder command its very easy to forget to update the command's config structure to accommodate a `map[string]bool` entry and to pass it to `features.Set` in `main()`. See https://github.com/letsencrypt/boulder/issues/2533 for one example. I've fallen into this trap myself a few times so I'm going to try and save myself some future grief by fixing it across the board once and for all! This PR adds a `Features` config entry and a corresponding `features.Set` to: * ocsp-updater (resolves #2533) * admin-revoker * boulder-publisher * contact-exporter * expiration-mailer * expired-authz-purger * notify-mailer * ocsp-responder * orphan-finder These components were skipped because they already had features supported: * boulder-ca * boulder-ra * boulder-sa * boulder-va * boulder-wfe * cert-checker I deliberately skipped adding Feature support to: * single-ocsp (Its only configuration comes from the pkcs11key library and doesn't support features) * rabbitmq-setup (No configuration/features and we'll likely soon be rming this since the gRPC migration) * notafter-backfill (This is a one-off that will be deleted soon)	2017-01-27 16:29:46 -05:00
Jacob Hoffman-Andrews	510e279208	Simplify gRPC TLS configs. (#2470 ) Previously, a given binary would have three TLS config fields (CA cert, cert, key) for its gRPC server, plus each of its configured gRPC clients. In typical use, we expect all three of those to be the same across both servers and clients within a given binary. This change reuses the TLSConfig type already defined for use with AMQP, adds a Load() convenience function that turns it into a *tls.Config, and configures it for use with all of the binaries. This should make configuration easier and more robust, since it more closely matches usage. This change preserves temporary backwards-compatibility for the ocsp-updater->publisher RPCs, since those are the only instances of gRPC currently enabled in production.	2017-01-06 14:19:18 -08:00
Jacob Hoffman-Andrews	e25138b21c	Update orphan finder. (#2409 ) The log format changed slightly: We log hex instead of base64.	2016-12-09 12:06:19 -08:00
Jacob Hoffman-Andrews	27a1446010	Move timeouts into client interceptor. (#2387 ) Previously we had custom code in each gRPC wrapper to implement timeouts. Moving the timeout code into the client interceptor allows us to simplify things and reduce code duplication.	2016-12-05 10:42:26 -05:00
Roland Bracewell Shoemaker	03fdd65bfe	Add gRPC server to SA (#2374 ) Adds a gRPC server to the SA and SA gRPC Clients to the WFE, RA, CA, Publisher, OCSP updater, orphan finder, admin revoker, and expiration mailer. Also adds a CA gRPC client to the OCSP Updater which was missed in #2193. Fixes #2347.	2016-12-02 17:24:46 -08:00
Roland Bracewell Shoemaker	c8f1fb3e2f	Remove direct usages of go-statsd-client in favor of using metrics.Scope (#2136 ) Fixes #2118, fixes #2082.	2016-09-07 19:35:13 -04:00
Ben Irving	f73328b3cb	Split up boulder-config.json (Orphan Finder) (#2059 )	2016-07-21 09:30:31 -04:00
Ben Irving	1336c42813	Replace all log.Err calls with log.AuditErr (#1891 ) * remove calls to log.Err() * go fmt * remove more occurrences * change AuditErr argument to string and replace occurrences	2016-06-06 16:27:16 -04:00
Kane York	b7cf618f5d	context.Context as the first parameter of all RPC calls (#1741 ) Change core/interfaces to put context.Context as the first parameter of all RPC calls in preparation for gRPC.	2016-04-19 11:34:36 -07:00
Jacob Hoffman-Andrews	e6c17e1717	Switch to new vendor style (#1747 ) * Switch to new vendor style. * Fix metrics generate command. * Fix miekg/dns types_generate. * Use generated copies of files. * Update miekg to latest. Fixes a problem with `go generate`. * Set GO15VENDOREXPERIMENT. * Build in letsencrypt/boulder. * fix travis more. * Exclude vendor instead of godeps. * Replace some ... * Fix unformatted cmd * Fix errcheck for vendorexp * Add GO15VENDOREXPERIMENT to Makefile. * Temp disable errcheck. * Restore master fetch. * Restore errcheck. * Build with 1.6 also. * Match statsd." Skip errcheck unles Go1.6. * Add other ignorepkg. * Fix errcheck. * move errcheck * Remove go1.6 requirement. * Put godep-restore with errcheck. * Remove go1.6 dep. * Revert master fetch revert. * Remove -r flag from godep save. * Set GO15VENDOREXPERIMENT in Dockerfile and remove _worskpace. * Fix Godep version.	2016-04-18 12:51:36 -07:00
Kane York	25b45a45ec	Errcheck errors fixed (#1677 ) * Fix all errcheck errors * Add errcheck to test.sh * Add a new sa.Rollback method to make handling errors in rollbacks easier. This also causes a behavior change in the VA. If a HTTP connection is abruptly closed after serving the headers for a non-200 response, the reported error will be the read failure instead of the non-200.	2016-04-12 16:54:01 -07:00
Jacob Hoffman-Andrews	ecc04e8e61	Refactor log package (#1717 ) - Remove error signatures from log methods. This means fewer places where errcheck will show ignored errors. - Pull in latest cfssl to be compatible with errorless log messages. - Reduce the number of message priorities we support to just those we actually use. - AuditNotice -> AuditInfo - Remove InfoObject (only one use, switched to Info) - Remove EmergencyExit and related functions in favor of panic - Remove SyslogWriter / AuditLogger separate types in favor of a single interface, Logger, that has all the logging methods on it. - Merge mock log into logger. This allows us to unexport the internals but still override them in the mock. - Shorten names to be compatible with Go style: New, Set, Get, Logger, NewMock, etc. - Use a shorter log format for stdout logs. - Remove "... Starting" log messages. We have better information in the "Versions" message logged at startup. Motivation: The AuditLogger / SyslogWriter distinction was confusing and exposed internals only necessary for tests. Some components accepted one type and some accepted the other. This made it hard to consistently use mock loggers in tests. Also, the unnecessarily fat interface for AuditLogger made it hard to meaningfully mock out.	2016-04-08 16:12:20 -07:00
Jacob Hoffman-Andrews	090565a711	Accept = in orphan-finder. Also, when a certificate already exists, treat that as info, not error. Update mock logger to allow matching by log level, and fix WFE and VA tests correspondingly.	2016-04-05 17:46:51 -07:00
Jacob Hoffman-Andrews	3018c00519	Testing and logging improvements Pass log as an argument to SA. This allows us to mock it out. Use a mockSA in CA test. Use mockSA in orphan-finder test. Improve logging from assert functions: Use our own printing style plus FailNow() so that each failure message isn't prefixed by "test-tools.go:60" Remove duplicate TraceOn. Part of #1642. https://github.com/letsencrypt/boulder/pull/1683	2016-04-04 18:42:42 -07:00
Roland Shoemaker	29127d5779	Add tool to find orphaned certificates in boulder-ca logs	2016-01-26 15:43:23 -08:00

43 Commits