boulder

Commit Graph

Author	SHA1	Message	Date
Aaron Gable	18389c9024	Remove dead code (#5893 ) Running an older version (v0.0.1-2020.1.4) of `staticcheck` in whole-program mode (`staticcheck --unused.whole-program=true -- ./...`) finds various instances of unused code which don't normally show up as CI issues. I've used this to find and remove a large chunk of the unused code, to pave the way for additional large deletions accompanying the WFE1 removal. Part of #5681	2022-01-19 12:23:06 -08:00
Aaron Gable	00af222568	Remove cmd.FailOnError's raw stderr output (#5844 ) Currently, `cmd.FailOnError` both audit-logs the message and error, and prints it directly to stderr. It does both because originally it only printed to stderr, and the audit logging capability was added later. Since audit logs are printed to stderr anyway, and since printing to stderr without going through the logger produces lines of output that violate the log-validator's expected checksums, remove the now-redundant print. Fixes #5790	2021-12-09 14:24:41 -08:00
Jacob Hoffman-Andrews	d3c027c93d	Fix log filtering for grpc errors. (#5712 ) We had in place some filtering for grpc errors that we consider spurious, but that filtering was broken. This change ensures the filtering gets called regardless of which of the various error/warning methods grpc calls. This removes a lot of unnecessary red from our integration test output.	2021-10-14 16:45:16 -07:00
Aaron Gable	e0c3e2c1df	Reject unrecognized config keys (#5649 ) Instead of using the default `json.Unmarshal`, explicitly construct and use a `json.Decoder` so that we can set the `DisallowUnknownFields` flag on the decoder. This causes any unrecognized config keys to result in errors at boulder startup time. Fixes #5643	2021-09-24 10:13:44 -07:00
Aaron Gable	bfd3f83717	Remove CFSSL issuance path (#5347 ) Make the `NonCFSSLSigner` code path the only code path through the CA. Remove all code related to the old, CFSSL-based code path. Update tests to supply (or mock) issuers of the new kind. Remove or simplify a few tests that were testing for behavior only exhibited by the old code path, such as incrementing certain metrics. Remove code from `//cmd/` for initializing the CFSSL library. Finally, mark the `NonCFSSLSigner` feature flag itself as deprecated. Delete the portions of the vendored CFSSL code which were only used by these deleted code paths. This does not remove the CFSSL library entirely, the rest of the cleanup will follow shortly. Part of #5115	2021-03-18 16:39:52 -07:00
Aaron Gable	ebba443cad	Remove cmd.LoadCert in favor of core.LoadCert (#5165 ) Having both of these very similar methods sitting around only serves to increase confusion. This removes the last few places which use `cmd.LoadCert` and replaces them with `core.LoadCert`, and deletes the method itself. Fixes #5163	2020-11-10 13:00:46 -08:00
Jacob Hoffman-Andrews	5a3daf448c	cmd: use Fprintln instead of Fprint for Fail. (#5069 )	2020-09-01 17:42:11 -07:00
Jacob Hoffman-Andrews	cb06fe8e13	log: Remove trailing newlines and escape internal newlines. (#4925 ) Fixes #4914.	2020-07-06 14:17:23 -07:00
Jacob Hoffman-Andrews	485f1ee0ba	Add periodic timestamp logging. (#4858 ) Fixes #4827	2020-06-10 11:24:37 -07:00
Jacob Hoffman-Andrews	324aaa0571	Intercept stdlib logger (try 2). (#4796 ) This builds on #4665 and #4781. The problem we had previously was that we were relying on a goroutine to consume bytes from a pipe in a non-blocking manner, which meant that log.Fatal would cause us to exit before writing out the data. This version implements an io.Writer so we can make sure the log line gets written in a blocking manner.	2020-04-27 11:21:43 -07:00
Jacob Hoffman-Andrews	91aa272354	Revert #4665 : "Capture output from stdlib `log` library" (#4781 ) The problem with this approach is that there is no way to guarantee the output is copied to syslog / stdout before shutdown. This is particularly evident when `log.Fatal` is used, because that calls `os.Exit` immediately after `l.Output`, creating a race condition where the log line might or might not get printed before the program exits. Reverting this change means that in case some component does call `log.Fatal` we'll still get the output from stdout. This also changes one instance in cmd/shell.go where we call `log.Fatal` to use `logger.Errf`.	2020-04-16 20:00:47 -07:00
Jacob Hoffman-Andrews	2d7337dcd0	Remove newlines from log messages. (#4777 ) Since Boulder's log system adds checksums to lines, but log-validator processes entries on a per-line basis, including newlines in log messages can cause a validation failure.	2020-04-16 16:49:08 -07:00
Jacob Hoffman-Andrews	bef02e782a	Fix nits found by staticcheck (#4726 ) Part of #4700	2020-03-30 10:20:20 -07:00
Jacob Hoffman-Andrews	13a0bb32f1	Capture output from stdlib `log` library. (#4665 ) Some components, particularly net/http, occasionally output log lines via log.Print. We'd like to capture these and send them to rsyslog so all our log data goes to the same place, and so that we can attach log line checksums to them. This uses log.SetOutput to change the log output to an io.Pipe, then consumes that buffer line-by-line in a goroutine and sends it to our rsyslog logger. This seems to tickle an unrelated race condition in test/ocsp/helper.go, so I fixed that too. Also filters out a noisy and unimportant error from the grpcLog handler. Fixes #4664 Fixes #4628	2020-02-05 09:28:38 -08:00
Roland Bracewell Shoemaker	5b2f11e07e	Switch away from old style statsd metrics wrappers (#4606 ) In a handful of places I've nuked old stats which are not used in any alerts or dashboards as they either duplicate other stats or don't provide much insight/have never actually been used. If we feel like we need them again in the future it's trivial to add them back. There aren't many dashboards that rely on old statsd style metrics, but a few will need to be updated when this change is deployed. There are also a few cases where prometheus labels have been changed from camel to snake case, dashboards that use these will also need to be updated. As far as I can tell no alerts are impacted by this change. Fixes #4591.	2019-12-18 11:08:25 -05:00
Daniel McCarney	e9e15c9a83	deps: update to prometheus/client_golang 1.2.1 (#4601 ) * cmd: update prometheus.NewProcessCollector args. There's a new struct `prometheus.ProcessCollectorOpts` that is expected to be used as the sole argument to `prometheus.NewProcessCollector`. We don't need to specify `os.Getpid` as the `PidFn` of the struct because the default is to assume `os.Getpid`. Similarly we don't need to set the namespace to `""` explicitly, it is the default. * SA: reimplement db metrics as custom collector. The modern Prometheus golang API supports translating between legacy metric sources on the fly with a custom collector. We can use this approach to collect the metrics from `gorp.DbMap`'s via the `sql.DB` type's `Stats` function and the returned `sql.DbStats` struct. This is a cleaner solution overall (we can lose the DB metrics updating go routine) and it avoids the need to use the now-removed `Set` method of the `prometheus.Counter` type. * test: Update CountHistogramSamples. The `With` function of `prometheus.HistogramVec` types we tend to use as the argument to `test.CountHistogramSamples` changed to return a `prometheus.Observer`. Since we only use this function in test contexts, and only with things that cast back to a `prometheus.Histogram` we take that approach to fix the problem without updating call-sites.	2019-12-06 16:14:50 -05:00
Daniel McCarney	117df57e8c	cmd: remove stale package comment. (#4488 ) The idea expressed in this comment isn't representative of the Boulder cmds. E.g. There's no top level "App Shell" in use and the `NewAppShell`, `Action` and `Run` functions ref'd do not exist.	2019-10-17 13:40:32 -04:00
Jacob Hoffman-Andrews	e20eb6271d	Suppress "transport is closing" errors. (#4394 ) These errors show up in the Publisher at shutdown during integration test runs, because the Publisher is trying to write responses from RPCs that were slow due to the ct-test-srv's LatencySchedule. This specifically happens only for the optional submission of "final" certificates.	2019-08-07 13:39:53 -07:00
Roland Bracewell Shoemaker	751e3b1704	cmd: Set CFSSL log level to debug (#4393 )	2019-08-07 14:30:42 -04:00
Jacob Hoffman-Andrews	ba5a5a5ac9	cmd: Log less from gRPC, no INFO level. (#4367 ) The gRPC INFO log lines clutter up integration test output, and we've never had a use for them in production (they are mostly about details of connection status).	2019-07-26 10:02:34 -04:00
Roland Bracewell Shoemaker	876c727b6f	Update gRPC (#3817 ) Fixes #3474.	2018-08-20 10:55:42 -04:00
Joel Sing	8ebdfc60b6	Provide formatting logger functions. (#3699 ) A very large number of the logger calls are of the form log.Function(fmt.Sprintf(...)). Rather than sprinkling fmt.Sprintf at every logger call site, provide formatting versions of the logger functions and call these directly with the format and arguments. While here remove some unnecessary trailing newlines and calls to String/Error.	2018-05-10 11:06:29 -07:00
Daniel McCarney	299e53b237	RA,CA: Refuse to start with MaxNames == 0. (#3634 ) This commit updates the `boulder-ra` and `boulder-ca` commands to refuse to start if their configured `MaxNames` is 0 (the default value). This should always be set to a positive number. This commit also updates `csr/csr.go` to always apply the max names check since it will never be 0 after the change above. Also refactor `FailOnError` to pull out a separate `Fail` function. Related to https://github.com/letsencrypt/boulder/issues/3632	2018-04-10 10:53:23 -07:00
Jacob Hoffman-Andrews	5c4f5e346a	Fix pprof handlers. (#3533 ) Some of the pprof handlers have to be accessed through pprof.Handler("string"), while some have to be accessed through an exported var in pprof. We weren't doing the latter before, which meant some key handlers like Profile weren't available.	2018-03-08 18:18:13 +00:00
Jacob Hoffman-Andrews	3d9b3d4d20	Restore expvar handler. (#3209 ) In #3167 I removed expvar, thinking it was unused, but it turns out the RA exports the last issuance time, and core/util.go has a function to export BuildID, both of which are used in monitoring. This wasn't caught at compile time because the global expvar package was happy to register the exports even though there was no handler to serve them.	2017-11-02 07:05:54 -07:00
Jacob Hoffman-Andrews	6cd777bd8d	Fix up stats after #3167 (#3185 ) There were two bugs in #3167: All process-level stats got prefixed with "boulder", which broke dashboards. All request_time stats got dropped, because measured_http was using the prometheus DefaultRegisterer. To fix, this PR plumbs through a scope object to measured_http, and uses an empty prefix when calling NewProcessCollector().	2017-10-18 11:14:59 -07:00
Jacob Hoffman-Andrews	f366e45756	Remove global state from metrics gathering (#3167 ) Previously, we used prometheus.DefaultRegisterer to register our stats, which uses global state to export its HTTP stats. We also used net/http/pprof's behavior of registering to the default global HTTP ServeMux, via DebugServer, which starts an HTTP server that uses that global ServeMux. In this change, I merge DebugServer's functions into StatsAndLogging. StatsAndLogging now takes an address parameter and fires off an HTTP server in a goroutine. That HTTP server is newly defined, and doesn't use DefaultServeMux. On it is registered the Prometheus stats handler, and handlers for the various pprof traces. In the process I split StatsAndLogging internally into two functions: makeStats and MakeLogger. I didn't port across the expvar variable exporting, which serves a similar function to Prometheus stats but which we never use. One nice immediate effect of this change: Since StatsAndLogging now requires and address, I noticed a bunch of commands that called StatsAndLogging, and passed around the resulting Scope, but never made use of it because they didn't run a DebugServer. Under the old StatsD world, these command still could have exported their stats by pushing, but since we moved to Prometheus their stats stopped being collected. We haven't used any of these stats, so instead of adding debug ports to all short-lived commands, or setting up a push gateway, I simply removed them and switched those commands to initialize only a Logger, no stats.	2017-10-13 11:58:01 -07:00
Jacob Hoffman-Andrews	0a72f768a7	Remove ProfileCmd. (#3166 ) These stats are now all collected by Prometheus.	2017-10-13 10:02:04 -04:00
Jacob Hoffman-Andrews	4128e0d95a	Add time-dependent integration testing (#3060 ) Fixes #3020. In order to write integration tests for some features, especially related to rate limiting, rechecking of CAA, and expiration of authzs, orders, and certs, we need to be able to fake the passage of time in integration tests. To do so, this change switches out all clock.Default() instances for cmd.Clock(), which can be set manually with the FAKECLOCK environment variable. integration-test.py now starts up all servers once before the main body of tests, with FAKECLOCK set to a date 70 days ago, and does some initial setup for a new integration test case. That test case tries to fetch a 70-day-old authz URL, and expects it to 404. In order to make this work, I also had to change a number of our test binaries to shut down cleanly in response to SIGTERM. Without that change, stopping the servers between the setup phase and the main tests caused startservers.check() to fail, because some processes exited with nonzero status. Note: This is an initial stab at things, to prove out the technique. Long-term, I think we will want to use an idiom where test cases are classes that have a number of optional setup phases that may be run at e.g. 70 days prior and 5 days prior. This could help us avoid a proliferation of global state as we add more time-dependent test cases.	2017-09-13 12:34:14 -07:00
Jacob Hoffman-Andrews	20ec1e3e4e	Filter spurious shutdown errors. (#3052 ) Previously, we would produce an error an a nonzero status code on shutdown, because gRPC's GracefulStop would cause s.Serve() to return an error. Now we filter that specific error and treat it as success. This also allows us to kill process with SIGTERM instead of SIGKILL in integration tests. Fixes #2410.	2017-09-07 13:45:32 -07:00
Jacob Hoffman-Andrews	63a25bf913	Remove clientName everywhere. (#2862 ) This used to be used for AMQP queue names. Now that AMQP is gone, these consts were only used when printing a version string at startup. This changes VersionString to just use the name of the current program, and removes `const clientName = ` from many of our main.go's.	2017-07-12 10:28:54 -07:00
Jacob Hoffman-Andrews	b17b5c72a6	Remove statsd from Boulder (#2752 ) This removes the config and code to output to statsd. - Change `cmd.StatsAndLogging` to output a `Scope`, not a `Statter`. - Remove the prefixing of component name (e.g. "VA") in front of stats; this was stripped by `autoProm` but now no longer needs to be. - Delete vendored statsd client. - Delete `MockStatter` (generated by gomock) and `mocks.Statter` (hand generated) in favor of mocking `metrics.Scope`, which is the interface we now use everywhere. - Remove a few unused methods on `metrics.Scope`, and update its generated mock. - Refactor `autoProm` and add `autoRegisterer`, which can be included in a `metrics.Scope`, avoiding global state. `autoProm` now registers everything with the `prometheus.Registerer` it is given. - Change va_test.go's `setup()` to not return a stats object; instead the individual tests that care about stats override `va.stats` directly. Fixes #2639, #2733.	2017-05-15 10:19:54 -04:00
Jacob Hoffman-Andrews	d9b53cd103	Set gRPC logs to go through syslog. (#2403 ) StatsAndLogging is called early enough in each program that it precedes any gRPC setup code that might need SetLogger already to have been set. Fixes #2383	2016-12-08 15:25:31 -08:00
Daniel	c96c8a648f	Removes `cmd.Version()`. The `Version()` function is a less useful alternative to `VersionString()` and isn't necessary. It used a fixed "0.1.0" prefix that doesn't match a release tag, it also doesn't print Go & host information that `VersionString()` does. Less code = less bugs!	2016-11-26 16:59:45 -05:00
Roland Bracewell Shoemaker	595204b23f	Implement improved signal catching in services that already use it (#2333 ) Implements a less RPC focused signal catch/shutdown method. Certain things that probably could also use this (i.e. `ocsp-updater`) haven't been given it as they would require rather substantial changes to allow for a graceful shutdown approach. Fixes #2298.	2016-11-18 21:05:04 -05:00
Jacob Hoffman-Andrews	9b8b877e42	Add prometheus client. (#2293 ) This vendors the Prometheus client code, and exports metrics on the debug port, under `/metrics`. This will currently export just the default metrics, like `go_goroutines`, `process_cpu_seconds_total`, `process_open_fds`, and `process_resident_memory_bytes`. Later work will start exporting Boulder-specific metrics, but this will allow Ops to start configuring scraping of Prometheus metrics in production. Tests pass: ``` $ git diff master Godeps/ \| sed -ne 's/^+.*ImportPath": "//p' \| tr -d '",' \| xargs go test ok github.com/beorn7/perks/quantile 0.562s ok github.com/matttproud/golang_protobuf_extensions/pbutil 0.003s ok github.com/prometheus/client_golang/prometheus 34.418s ok github.com/prometheus/client_golang/prometheus/promhttp 0.003s ? github.com/prometheus/client_model/go [no test files] ok github.com/prometheus/common/expfmt 0.019s ok github.com/prometheus/common/internal/bitbucket.org/ww/goautoneg 0.002s ok github.com/prometheus/common/model 0.003s ok github.com/prometheus/procfs 0.008s ``` Part of #2284	2016-10-28 16:13:41 -07:00
Roland Bracewell Shoemaker	239bf9ae0a	Very basic feature flag impl (#1705 ) Updates #1699. Adds a new package, `features`, which exposes methods to set and check if various internal features are enabled. The implementation uses global state to store the features so that services embedded in another service do not each require their own features map in order to check if something is enabled. Requires a `boulder-tools` image update to include `golang.org/x/tools/cmd/stringer`.	2016-09-20 16:29:01 -07:00
Roland Bracewell Shoemaker	c8f1fb3e2f	Remove direct usages of go-statsd-client in favor of using metrics.Scope (#2136 ) Fixes #2118, fixes #2082.	2016-09-07 19:35:13 -04:00
Blake Griffith	344a312905	Remove audit comments -- closes #2129 (#2139 ) Closes #2129 * Remove audit comments. * Nuke doc/requirements/*	2016-08-25 18:23:42 -07:00
Ben Irving	1a4f099899	Split up boulder-config.json (Expiration Mailer) (#2036 ) Part of #1962.	2016-07-12 15:55:52 -07:00
Jacob Hoffman-Andrews	c8723c4baa	Add back version flag for all binaries. (#2034 ) Fixes #2020	2016-07-11 09:28:01 -07:00
Ben Irving	0e2ef748b4	Split up boulder-config.json (OCSP Responder) (#2017 )	2016-07-07 14:52:08 -04:00
Ben Irving	653cc004d0	Split Boulder Config (OCSP Updater) (#2013 )	2016-07-06 10:00:52 -04:00
Ben Irving	cb45bdea67	Split up boulder-config.json (Publisher) (#2008 )	2016-07-05 13:31:30 -07:00
Ben Irving	bea8e57536	Split up boulder-config.json (VA) (#1979 )	2016-07-01 13:06:50 -04:00
Ben Irving	21e0b3bdc7	Split up boulder-config.json (CA) (#1978 )	2016-07-01 10:24:19 -04:00
Ben Irving	6162533c00	Split up boulder-config.json (SA) (#1975 ) Depends on #1973 https://github.com/letsencrypt/boulder/pull/1975	2016-06-29 15:01:49 -07:00
Ben Irving	c4f7fb580d	Split up boulder-config.json (RA) (#1974 ) Part of #1962	2016-06-29 13:43:55 -07:00
Ben Irving	6007df8f3c	Split up boulder-config.json (WFE) (#1973 ) Moves the wfe to it's own config file. Each config will now belong in `test/config` and `test/config-next` analogous to `boulder-config` and `boulder-config-next`.	2016-06-28 10:40:16 -07:00
Jacob Hoffman-Andrews	4283fb5dd4	Improve syslog defaults. (#1932 ) Under the new defaults, if the syslog section is missing, we'll use the default config that we use in prod: no logs to stdout, INFO and below to syslog. This allows us to remove the syslog section from prod configs, and potentially move it to individual service configs in the future. * Improve syslog defaults. * Add stdout logging for purger test. * Use plain int for sysloglevel. * Fix JSON syntax * Fix syslog config for expired-authz-purger. https://github.com/letsencrypt/boulder/pull/1932	2016-06-17 11:26:11 -07:00

1 2 3 4 5 ...

255 Commits