boulder

Commit Graph

Author	SHA1	Message	Date
Roland Bracewell Shoemaker	b5f7c62460	Remove leftover publisher CT config (#3803 )	2018-07-27 08:05:51 -04:00
Roland Bracewell Shoemaker	e27f370fd3	Excise code relating to pre-SCT embedding issuance flow (#3769 ) Things removed: * features.EmbedSCTs (and all the associated RA/CA/ocsp-updater code etc) * ca.enablePrecertificateFlow (and all the associated RA/CA code) * sa.AddSCTReceipt and sa.GetSCTReceipt RPCs * publisher.SubmitToCT and publisher.SubmitToSingleCT RPCs Fixes #3755.	2018-06-28 08:33:05 -04:00
Jacob Hoffman-Andrews	b2f5cf39b9	Bring test/config up to date with test/config-next (#3743 ) Notably, enable the precertificate flow, RPCHeadroom, and multi-IP hostnames. Lots of other changes and feature flags too.	2018-06-01 12:00:52 -07:00
Jacob Hoffman-Andrews	a4421ae75b	Run gRPC backends on multiple IPs instead of multiple ports (#3679 ) We're currently stuck on gRPC v1.1 because of a breaking change to certificate validation in gRPC 1.8. Our gRPC balancer uses a static list of multiple hostnames, and expects to validate against those hostnames. However gRPC expects that a service is one hostname, with multiple IP addresses, and validates all those IP addresses against the same hostname. See grpc/grpc-go#2012. If we follow gRPC's assumptions, we can rip out our custom Balancer and custom TransportCredentials, and will probably have a lower-friction time in general. This PR is the first step in doing so. In order to satisfy the "multiple IPs, one port" property of gRPC backends in our Docker container infrastructure, we switch to Docker's user-defined networking. This allows us to give the Boulder container multiple IP addresses on different local networks, and gives it different DNS aliases in each network. In startservers.py, each shard of a service listens on a different DNS alias for that service, and therefore a different IP address. The listening port for each shard of a service is now identical. This change also updates the gRPC service certificates. Now, each certificate that is used in a gRPC service (as opposed to something that is "only" a client) has three names. For instance, sa1.boulder, sa2.boulder, and sa.boulder (the generic service name). For now, we are validating against the specific hostnames. When we update our gRPC dependency, we will begin validating against the generic service name. Incidentally, the DNS aliases feature of Docker allows us to get rid of some hackery in entrypoint.sh that inserted entries into /etc/hosts. Note: Boulder now has a dependency on the DNS aliases feature in Docker. By default, docker-compose run creates a temporary container and doesn't assign any aliases to it. We now need to specify docker-compose run --use-aliases to get the correct behavior. Without --use-aliases, Boulder won't be able to resolve the hostnames it wants to bind to.	2018-05-07 10:38:31 -07:00
Roland Bracewell Shoemaker	0a86573a73	Update integration tests	2018-04-20 13:18:40 -07:00
Jacob Hoffman-Andrews	c556a1a20d	Reduce spurious errors in integration test (#3436 ) Boulder is fairly noisy about gRPC connection errors. This is a mixed blessing: Our gRPC configuration will try to reconnect until it hits an RPC deadline, and most likely eventually succeed. In that case, we don't consider those to really be errors. However, in cases where a connection is repeatedly failing, we'd like to see errors in the logs about connection failure, rather than "deadline exceeded." So we want to keep logging of gRPC errors. However, right now we get a lot of these errors logged during integration tests. They make the output hard to read, and may disguise more serious errors. So we'd like to avoid causing such errors in normal integration test operation. This change reorders the startup of Boulder components by their gRPC dependencies, so everything's backend is likely to be up and running before it starts. It also reverses that order for clean shutdowns, and waits for each process to exit before signalling the next one. With these changes, I still got connection errors. Taking listenbuddy out of the gRPC path fixed them. I believe the issue is that listenbuddy is not a truly transparent proxy. In particular, it accepts an inbound TCP connection before opening an outbound TCP connection. If opening that outbound connection results in "connection refused," it closes the inbound connection. That means gRPC sees a "connection closed" (or "connection reset"?) rather than "connection refused". I'm guessing it handles those cases differently, explaining the different error results. We've been using listenbuddy to trigger disconnects while Boulder is running, to ensure that gRPC's reconnect code works. I think we can probably rely on gRPC's reconnect to work. The initial problem that led us to start testing this was a configuration problem; now that we have the configuration we want, we should be fine and don't need to keep testing reconnects on every integration test run.	2018-02-12 18:17:50 -08:00
Jacob Hoffman-Andrews	0a64fd4066	Bring test/config up-to-date. (#3056 ) Methodology: Copy test/config-next/* into test/config/, then manually review the diffs, removing any diffs that are not yet in production.	2017-09-11 16:55:58 -04:00
Jacob Hoffman-Andrews	cef0a630b3	Remove old-style gRPC TLS configs (#2495 ) * Switch Publisher gRPC to use new "tls" block. * Remove old-style GRPC TLS configs. * Fix incorrect TLS blocks. * Remove more config.	2017-04-05 12:41:41 -04:00
Roland Bracewell Shoemaker	a46d30945c	Purge remaining AMQP code (#2648 ) Deletes github.com/streadway/amqp and the various RabbitMQ setup tools etc. Changes how listenbuddy is used to proxy all of the gRPC client -> server connections so we test reconnection logic. +49 -8,221 😁 Fixes #2640 and #2562.	2017-04-04 15:02:22 -07:00
Jacob Hoffman-Andrews	6719dc17a6	Remove AMQP config and code (#2634 ) We now use gRPC everywhere.	2017-04-03 10:39:39 -04:00
Daniel	e88db3cd5e	Revert "Revert "Copy all statsd stats to Prometheus. (#2474 )" (#2541 )" This reverts commit `9d9e4941a5` and restores the statsd prometheus code.	2017-02-01 15:48:18 -05:00
Daniel McCarney	9d9e4941a5	Revert "Copy all statsd stats to Prometheus. (#2474 )" (#2541 ) This reverts commit `58ccd7a71a`. We are seeing multiple boulder components restart when they encounter the stat registration race condition described in https://github.com/letsencrypt/boulder/issues/2540	2017-02-01 12:50:27 -05:00
Jacob Hoffman-Andrews	58ccd7a71a	Copy all statsd stats to Prometheus. (#2474 ) We have a number of stats already expressed using the statsd interface. During the switchover period to direct Prometheus collection, we'd like to make those stats available both ways. This change automatically exports any stats exported using the statsd interface via Prometheus as well. This is a little tricky because Prometheus expects all stats to by registered exactly once. Prometheus does offer a mechanism to gracefully recover from registering a stat more than once by handling a certain error, but it is not safe for concurrent access. So I added a concurrency-safe wrapper that creates Prometheus stats on demand and memoizes them. In the process, made a few small required side changes: - Clean "/" from method names in the gRPC interceptors. They are allowed in statsd but not in Prometheus. - Replace "127.0.0.1" with "boulder" as the name of our testing CT log. Prometheus stats can't start with a number. - Remove ":" from the CT-log stat names emitted by Publisher. Prometheus stats can't include it. - Remove a stray "RA" in front of some rate limit stats, since it was duplicative (we were emitting "RA.RA..." before). Note that this means two stat groups in particular are duplicated: - Gostats* is duplicated with the default process-level stats exported by the Prometheus library. - gRPCClient* are duplicated by the stats generated by the go-grpc-prometheus package. When writing dashboards and alerts in the Prometheus world, we should be careful to avoid these two categories, as they will disappear eventually. As a general rule, if a stat is available with an all-lowercase name, choose that one, as it is probably the Prometheus-native version. In the long run we will want to create most stats using the native Prometheus stat interface, since it allows us to use add labels to metrics, which is very useful. For instance, currently our DNS stats distinguish types of queries by appending the type to the stat name. This would be more natural as a label in Prometheus.	2017-01-10 10:30:15 -05:00
Jacob Hoffman-Andrews	9b8dacab03	Split out separate RPC services for issuing and for signing OCSP (#2452 ) This allows finer-grained control of which components can request issuance. The OCSP Updater should not be able to request issuance. Also, update test/grpc-creds/generate.sh to reissue the certs properly. Resolves #2417	2017-01-05 15:08:39 -08:00
Daniel McCarney	74c5e68491	Fixes the `config/publisher.json` clientNames list. (#2466 ) In https://github.com/letsencrypt/boulder/pull/2453 we created individual client certificates for each gRPC client. The "clientNames" list in the `config-next/publisher.json` was updated for the new component-specific SANs but we neglected to updated `config/publisher.json`. This caused the `ocsp-updater` (which uses gRPC in the base `config/` to talk to the `publisher`) to fail to connect. This commit updates `config/publisher.json` to have the same clientNames as `config-next/publisher.json` and resolves #2465	2017-01-03 10:10:01 -08:00
Jacob Hoffman-Andrews	089a270453	Add instructions on load testing OCSP generation. (#2459 )	2017-01-02 11:36:03 -08:00
Jacob Hoffman-Andrews	0c665b2053	Split up gRPC certificates by service. (#2453 ) Previously, all gRPC services used the same client and server certificates. Now, each service has its own certificate, which it uses for both client and server authentication, more closely simulating production. This also adds aliases for each of the relevant hostnames in /etc/hosts. There may be some issues if Docker decides to rewrite /etc/hosts while Boulder is running, but this seems to work for now.	2016-12-29 14:53:59 -08:00
Jacob Hoffman-Andrews	1c1449b284	Improvements to tests and test configs. (#2396 ) - Remove spinner from test.js. It made Travis logs hard to read. - Listen on all interfaces for debugAddr. This makes it possible to check Prometheus metrics for instances running in a Docker container. - Standardize DNS timeouts on 1s and 3 retries across all configs. This ensures DNS completes within the relevant RPC timeouts. - Remove RA service queue from VA, since VA no longer uses the callback to RA on completing a challenge.	2016-12-05 14:35:27 -08:00
Daniel McCarney	be63da0639	Removes AQMP publishing support from ocsp-updater. (#2341 ) The ocsp-updater has been switched over to the `config-next/` usage of gRPC for submitting to the publisher service. This commit removes the legacy AQMP support for such. This does not remove the `rpc/rpc-wrappers.go` implementation of `NewPublisherClient` at this point because it appears `boulder-ca` may still be using it.	2016-11-22 13:33:53 -08:00
Jacob Hoffman-Andrews	e1bc1e5b29	Update config from config-next. (#2175 ) Set feature flags: "reuseValidAuthz": true, "authorizationLifetimeDays": 90, "pendingAuthorizationLifetimeDays": 7, "CAASERVFAILExceptions": "test/caa-servfail-exceptions.txt", "lookupIPV6": true, "allowAuthzDeactivation": true, Remove BaseURL. Remove trailing slash on CT log URL. All files now have trailing newlines.	2016-09-19 14:08:36 -07:00
Jacob Hoffman-Andrews	d75a44baa0	Remove "network" and "server" from syslog configs. (#2159 ) We removed these from the config object because we never use anything other than the default empty string, which means "local socket."	2016-09-08 10:08:18 -04:00
Jacob Hoffman-Andrews	c97f28055c	Update tests to use multi-issuer format and ca2 (#1638 ) Builds on #1635.	2016-08-05 13:42:03 -07:00
Jacob Hoffman-Andrews	9b6125c9cd	Remove only one trailing slash in Publisher. Previously the code was removing the trailing slash plus one extra character.	2016-07-14 15:46:35 -07:00
Ben Irving	cb45bdea67	Split up boulder-config.json (Publisher) (#2008 )	2016-07-05 13:31:30 -07:00

24 Commits