boulder

Commit Graph

Author	SHA1	Message	Date
Aaron Gable	dad9e08606	Lay the groundwork for supporting IP identifiers (#7692 ) Clean up how we handle identifiers throughout the Boulder codebase by - moving the Identifier protobuf message definition from sa.proto to core.proto; - adding support for IP identifier to the "identifier" package; - renaming the "identifier" package's exported names to be clearer; and - ensuring we use the identifier package's helper functions everywhere we can. This will make future work to actually respect identifier types (such as in Authorization and Order protobuf messages) simpler and easier to review. Part of https://github.com/letsencrypt/boulder/issues/7311	2024-08-30 11:40:38 -07:00
Jacob Hoffman-Andrews	ac4be89b56	grpc: add NoWaitForReady config field (#6850 ) Currently we set WaitForReady(true), which causes gRPC requests to not fail immediately if no backends are available, but instead wait until the timeout in case a backend does become available. The downside is that this behavior masks true connection errors. We'd like to turn it off. Fixes #6834	2023-05-09 16:16:44 -07:00
Aaron Gable	257136779c	Add interceptor for per-rpc client auth (#6488 ) Add a new gRPC server interceptor (both unary and streaming) which verifies that the mTLS info set on the persistent connection has a client cert which contains a name which is allowlisted for the particular service being called, not just for the overall server. This will allow us to make more services -- particularly the CA and the SA -- more similar to the VA. We will be able to run multiple services on the same port, while still being able to control access to those services on a per-client basis. It will also let us split those services (e.g. into read-only and read-write subsets) much more easily, because a client will be able to switch which service it is calling without also having to be reconfigured to call a different address. And finally, it will allow us to simplify configuration for clients (such as the RA) which maintain connections to multiple different services on the same server, as they'll be able to re-use the same address configuration.	2022-11-07 13:47:47 -08:00
Aaron Gable	0a02cdf7e3	Streamline gRPC client creation (#6472 ) Remove the need for clients to explicitly call bgrpc.NewClientMetrics, by moving that call inside bgrpc.ClientSetup. In case ClientSetup is called multiple times, use the recommended method to gracefully recover from registering duplicate metrics. This makes gRPC client setup much more similar to gRPC server setup after the previous server refactoring change landed.	2022-10-28 08:45:52 -07:00
Aaron Gable	9213bd0993	Streamline gRPC server creation (#6457 ) Collapse most of our boilerplate gRPC creation steps (in particular, creating default metrics, making the server and listener, registering the server, creating and registering the health service, filtering shutdown errors from the output, and gracefully stopping) into a single function in the existing bgrpc package. This allows all but one of our server main functions to drop their calls to NewServer and NewServerMetrics. To enable this, create a new helper type and method in the bgrpc package. Conceptually, this could be just a new function, but it must be attached to a new type so that it can be generic over the type of gRPC server being created. (Unfortunately, the grpc.RegisterFooServer methods do not accept an interface type for their second argument). The only main function which is not updated is the boulder-va, which is a special case because it creates multiple gRPC servers but (unlike the CA) serves them all on the same port with the same server and listener. Part of #6452	2022-10-26 15:45:52 -07:00
Samantha	bdd9ad9941	grpc: Pass data necessary for Retry-After headers in BoulderErrors (#6415 ) - Add a new field, `RetryAfter` to `BoulderError`s - Add logic to wrap/unwrap the value of the `RetryAfter` field to our gRPC error interceptor - Plumb `RetryAfter` for `DuplicateCertificateError` emitted by RA to the WFE client response header Part of #6256	2022-10-03 16:24:58 -07:00
Aaron Gable	927b1622b7	Add gRPC stream interceptors (#6370 ) Create new gRPC interceptors which are capable of working on streaming gRPC methods. Add these new interceptors, as well as the default metrics interceptor provided by grpc-prometheus, to all of our gRPC clients and servers. The new interceptors behave virtually identically to their unary counterparts: they wrap and unwrap our custom errors from the gRPC metadata, they increment and decrement the in-flight RPC metric, and they ensure that the RPCs don't fail-fast and do have enough time left in their deadline to actually finish. Unfortunately, because the interfaces for unary and streaming RPCs are so divergent, it's not feasible to share code between the two kinds of interceptors. While much of the new code is copy-pasted from the old interceptors, there are subtle differences (such as not immediately deferring the local context's cancel() function). Fixes #6356	2022-09-12 09:28:12 -07:00
Aaron Gable	c706609e79	Update grpc from v1.36.1 to v1.49.0 (#6336 ) Changelog: https://github.com/grpc/grpc-go/compare/v1.36.1...v1.49.0 The biggest change for us is that grpc.WithBalancerName has transitioned from deprecated to fully removed. The fix is to replace it with a JSON-formatted "default config" object, as demonstrated in https://github.com/grpc/grpc-go/pull/5232#issuecomment-1106921140. This should unblock updating other dependencies which want to transitively update gRPC as well.	2022-09-01 13:29:06 -07:00
Aaron Gable	4ad66729d2	Tests: use reflect.IsNil() to avoid boxed nil issues (#6305 ) Add a new `test.AssertNil()` helper to facilitate asserting that a given unit test result is a non-boxed nil. Update `test.AssertNotNil()` to use the reflect package's `.IsNil()` method to catch boxed nils. In Go, variables whose type is constrained to be an interface type (e.g. a function parameter which takes an interface, or the return value of a function which returns `error`, itself an interface type) should actually be thought of as a (T, V) tuple, where T is their underlying concrete type and V is their underlying value. Thus, there are two ways for such a variable to be nil-like: it can be truly nil where T=nil and V is uninitialized, or it can be a "boxed nil" where T is a nillable type such as a pointer or a slice and V=nil. Unfortunately, only the former of these is == nil. The latter is the cause of frequent bugs, programmer frustration, a whole entry in the Go FAQ, and considerable design effort to remove from Go 2. Therefore these two test helpers both call `t.Fatal()` when passed a boxed nil. We want to avoid passing around boxed nils whenever possible, and having our tests fail whenever we do is a good way to enforce good nil hygiene. Fixes #3279	2022-08-19 14:47:34 -07:00
Aaron Gable	ab79f96d7b	Fixup staticcheck and stylecheck, and violations thereof (#5897 ) Add `stylecheck` to our list of lints, since it got separated out from `staticcheck`. Fix the way we configure both to be clearer and not rely on regexes. Additionally fix a number of easy-to-change `staticcheck` and `stylecheck` violations, allowing us to reduce our number of ignored checks. Part of #5681	2022-01-20 16:22:30 -08:00
Aaron Gable	6629b49376	Fix grpc test proto generation (#5452 ) The //grpc/test_proto/generate.go file was not generating the protos in its own directory, it was regenerating the VA protos. Therefore the generated files were out of date, and were relying on an old version of the go proto library, which we can now remove from our direct deps. Part of #5443 Part of #5453	2021-06-02 16:19:25 -07:00
Jacob Hoffman-Andrews	bef02e782a	Fix nits found by staticcheck (#4726 ) Part of #4700	2020-03-30 10:20:20 -07:00
Roland Bracewell Shoemaker	5b2f11e07e	Switch away from old style statsd metrics wrappers (#4606 ) In a handful of places I've nuked old stats which are not used in any alerts or dashboards as they either duplicate other stats or don't provide much insight/have never actually been used. If we feel like we need them again in the future it's trivial to add them back. There aren't many dashboards that rely on old statsd style metrics, but a few will need to be updated when this change is deployed. There are also a few cases where prometheus labels have been changed from camel to snake case, dashboards that use these will also need to be updated. As far as I can tell no alerts are impacted by this change. Fixes #4591.	2019-12-18 11:08:25 -05:00
Daniel McCarney	5a1c18dd9f	gRPC: support wrap/unwrap of berrors with suberrors. (#4278 ) If a berror with suberrors is being wrapped then we must marshal the suberrors as JSON and include this data in the RPC metadata trailer that also carries the berror type. When unwrapping metadata with JSON suberrors they should be unmarshalled into the returned berror's suberrors.	2019-06-20 16:36:13 -04:00
Roland Bracewell Shoemaker	6f93942a04	Consistently used stdlib context package (#4229 )	2019-05-28 14:36:16 -04:00
Daniel McCarney	4f9ee00510	gRPC: publish in-flight RPC gauge in client interceptor. (#3672 ) This PR updates the Boulder gRPC clientInterceptor to update a Prometheus gauge stat for each in-flight RPC it dispatches, sliced by service and method. A unit test is included that uses a custom ChillerServer that lets the test block up a bunch of RPCs, check the in-flight gauge value is increased, unblock the RPCs, and recheck that the in-flight gauge is reduced. To check the gauge value for a specific set of labels a new test-tools.go function GaugeValueWithLabels is added. Updates #3635	2018-04-27 15:53:54 -07:00
Daniel McCarney	aa810a3142	gRPC: publish RPC latency stat in server interceptor. (#3665 ) We may see RPCs that are dispatched by a client but do not arrive at the server for some time afterwards. To have insight into potential request latency at this layer we want to publish the time delta between when a client sent an RPC and when the server received it. This PR updates the gRPC client interceptor to add the current time to the gRPC request metadata context when it dispatches an RPC. The server side interceptor is updated to pull the client request time out of the gRPC request metadata. Using this timestamp it can calculate the latency and publish it as an observation on a Prometheus histogram. Accomplishing the above required wiring a clock through to each of the client interceptors. This caused a small diff across each of the gRPC aware boulder commands. A small unit test is included in this PR that checks that a latency stat is published to the histogram after an RPC to a test ChillerServer is made. It's difficult to do more in-depth testing because using fake clocks makes the latency 0 and using real clocks requires finding a way to queue/delay requests inside of the gRPC mechanisms not exposed to Boulder. Updates https://github.com/letsencrypt/boulder/issues/3635 - Still TODO: Explicitly logging latency in the VA, tracking outstanding RPCs as a gauge.	2018-04-25 15:37:22 -07:00
Jacob Hoffman-Andrews	68d5cc3331	Restore gRPC metrics (#3265 ) The go-grpc-prometheus package by default registers its metrics with Prometheus' global registry. In #3167, when we stopped using the global registry, we accidentally lost our gRPC metrics. This change adds them back. Specifically, it adds two convenience functions, one for clients and one for servers, that makes the necessary metrics object and registers it. We run these in the main function of each server. I considered adding these as part of StatsAndLogging, but the corresponding ClientMetrics and ServerMetrics objects (defined by go-grpc-prometheus) need to be subsequently made available during construction of the gRPC clients and servers. We could add them as fields on Scope, but this seemed like a little too much tight coupling. Also, update go-grpc-prometheus to get the necessary methods. ``` $ go test github.com/grpc-ecosystem/go-grpc-prometheus/... ok github.com/grpc-ecosystem/go-grpc-prometheus 0.069s ? github.com/grpc-ecosystem/go-grpc-prometheus/examples/testproto [no test files] ```	2017-12-07 15:44:55 -08:00
Roland Bracewell Shoemaker	e91349217e	Switch to using go 1.9 (#3047 ) * Switch to using go 1.9 * Regenerate with 1.9 * Manually fix import path... * Upgrade mockgen and regenerate * Update github.com/golang/mock	2017-09-06 16:30:13 -04:00
Roland Bracewell Shoemaker	f193137405	Remove superfluous gRPC error encodings (#3048 ) Follow up from #3041 Fixes #2589	2017-09-06 12:38:10 -07:00
Jacob Hoffman-Andrews	18f15b2b3d	Remove unused error types (#3041 ) * Remove all of the errors under core. Their purpose is now served by errors, and they were almost entirely unused. The remaining uses were switched to errors. * Remove errors.NotSupportedError. It was used in only one place (ca.go), and that usage is more appropriately a ServerInternal error.	2017-09-05 16:51:32 -07:00
Jacob Hoffman-Andrews	d542960a35	Remove statsd version of RPC stats (#2693 ) * Remove statsd-style RPC stats. * Remove tests for old code.	2017-04-25 10:10:35 -04:00
Roland Bracewell Shoemaker	e2b2511898	Overhaul internal error usage (#2583 ) This patch removes all usages of the `core.XXXError` and almost all usages of `probs` outside of the WFE and VA and replaces them with a unified internal error type. Since the VA uses `probs.ProblemDetails` quite extensively in challenges, and currently stores them in the DB I've saved this change for another change (it'll also require a migration). Since `ProblemDetails` should only ever be exposed to end-users all of its related logic should be moved into the `WFE` but since it still needs to be exposed to the VA and SA I've left it in place for now. The new internal `errors` package offers the same convenience functions as `probs` does as well as a new simpler type testing method. A few small changes have also been made to error messages, mainly adding the library and function name to internal server errors for easier debugging (i.e. where a number of functions return the exact same errors and there is no other way to distinguish which method threw the error). Also adds proper encoding of internal errors transferred over gRPC (the current encoding scheme is kept for `core` and `probs` errors since it'll be ideally be removed after we deploy this and follow-up changes) using `grpc/metadata` instead of the gRPC status codes. Fixes #2507. Updates #2254 and #2505.	2017-03-22 23:27:31 -07:00
Roland Bracewell Shoemaker	7d7adabe44	Allow probs.ProblemDetails to be passed across gRPC layer (#2506 ) Currently services will pass both `core.XXXError` and `probs.XXX` type errors across the gRPC layer. In the future (#2505) we intend to stop passing `probs.XXX` type errors across this layer but for now we need to support them until that change is landed. This patch takes the easiest path to allow this by encoding the `probs.ProblemDetails` to JSON and storing it in the gRPC error body so that it can be passed around. Fixes #2497.	2017-01-19 14:59:44 -08:00

24 Commits