boulder

Commit Graph

Author	SHA1	Message	Date
Jacob Hoffman-Andrews	824417f6c0	sa: refactor db initialization (#6930 ) Previously, we had three chained calls initializing a database: - InitWrappedDb calls NewDbMap - NewDbMap calls NewDbMapFromConfig Since all three are exporetd, this left me wondering when to call one vs the others. It turns out that NewDbMap is only called from tests, so I renamed it to DBMapForTest to make that clear. NewDbMapFromConfig is only called internally to the SA, so I made it unexported it as newDbMapFromMysqlConfig. Also, I copied the ParseDSN call into InitWrappedDb, so it doesn't need to call DBMapForTest. Now InitWrappedDb and DBMapForTest both independently call newDbMapFromMysqlConfig. I also noticed that InitDBMetrics was only called internally so I unexported it.	2023-06-13 10:15:40 -07:00
Aaron Gable	27de4befb9	Don't panic on duplicate db metrics (#6247 ) Use `prometheus.Register` instead of `.MustRegister` when setting up database metrics. This allows us to capture and return up the call-chain any errors encountered during metric initialization. While this does not actually help mitigate or prevent any future duplicate-metrics errors, it will make them easier to debug by surfacing them as well-formed errors rather than simply stack-traces. Fixes #6150	2022-07-23 11:11:15 -07:00
Samantha	99502b1ffb	oscp-updater: use rows.Scan() to get query results (#5656 ) - Replace `gorp.DbMap` with calls that use `sql.DB` directly - Use `rows.Scan()` and `rows.Next()` to get query results (which opens the door to streaming the results) - Export function `CertStatusMetadataFields` from `SA` - Add new function `ScanCertStatusRow` to `SA` - Add new function `NewDbSettingsFromDBConfig` to `SA` Fixes #5642 Part Of #5715	2021-10-18 10:33:09 -07:00
J.C. Jones	7b31bdb30a	Add read-only dbConns to SQLStorageAuthority and OCSPUpdater (#5555 ) This changeset adds a second DB connect string for the SA for use in read-only queries that are not themselves dependencies for read-write queries. In other words, this is attempting to only catch things like rate-limit `SELECT`s and other coarse-counting, so we can potentially move those read queries off the read-write primary database. It also adds a second DB connect string to the OCSP Updater. This is a little trickier, as the subsequent `UPDATE`s _are_ dependent on the output of the `SELECT`, but in this case it's operating on data batches, and a few seconds' replication latency are several orders of magnitude below the threshold for update frequency, so any certificates that aren't caught on run `n` can be caught on run `n+1`. Since we export DB metrics to Prometheus, this also refactors `InitDBMetrics` to take a DB Address (host:port tuple) and User out of the DB connection DSN and include those as labels in the metrics. Fixes #5550 Fixes #4985	2021-08-02 11:21:34 -07:00
Samantha	e0510056cc	Enhancements to SQL driver tuning via JSON config (#5235 ) Historically the only database/sql driver setting exposed via JSON config was maxDBConns. This change adds support for maxIdleConns, connMaxLifetime, connMaxIdleTime, and renames maxDBConns to maxOpenConns. The addition of these settings will give our SRE team a convenient method for tuning the reuse/closure of database connections. A new struct, DBSettings, has been added to SA. The struct, and each of it's fields has been commented. All new fields have been plumbed through to the relevant Boulder components and exported as Prometheus metrics. Tests have been added/modified to ensure that the fields are being set. There should be no loss in coverage Deployability concerns for the migration from maxDBConns to maxOpenConns have been addressed with the temporary addition of the helper method cmd.DBConfig.GetMaxOpenConns(). This method can be removed once test/config is defaulted to using maxOpenConns. Relevant sections of the code have TODOs added that link back to an newly opened issue. Fixes #5199	2021-01-25 15:34:55 -08:00
Roland Bracewell Shoemaker	5b2f11e07e	Switch away from old style statsd metrics wrappers (#4606 ) In a handful of places I've nuked old stats which are not used in any alerts or dashboards as they either duplicate other stats or don't provide much insight/have never actually been used. If we feel like we need them again in the future it's trivial to add them back. There aren't many dashboards that rely on old statsd style metrics, but a few will need to be updated when this change is deployed. There are also a few cases where prometheus labels have been changed from camel to snake case, dashboards that use these will also need to be updated. As far as I can tell no alerts are impacted by this change. Fixes #4591.	2019-12-18 11:08:25 -05:00
Daniel McCarney	e9e15c9a83	deps: update to prometheus/client_golang 1.2.1 (#4601 ) * cmd: update prometheus.NewProcessCollector args. There's a new struct `prometheus.ProcessCollectorOpts` that is expected to be used as the sole argument to `prometheus.NewProcessCollector`. We don't need to specify `os.Getpid` as the `PidFn` of the struct because the default is to assume `os.Getpid`. Similarly we don't need to set the namespace to `""` explicitly, it is the default. * SA: reimplement db metrics as custom collector. The modern Prometheus golang API supports translating between legacy metric sources on the fly with a custom collector. We can use this approach to collect the metrics from `gorp.DbMap`'s via the `sql.DB` type's `Stats` function and the returned `sql.DbStats` struct. This is a cleaner solution overall (we can lose the DB metrics updating go routine) and it avoids the need to use the now-removed `Set` method of the `prometheus.Counter` type. * test: Update CountHistogramSamples. The `With` function of `prometheus.HistogramVec` types we tend to use as the argument to `test.CountHistogramSamples` changed to return a `prometheus.Observer`. Since we only use this function in test contexts, and only with things that cast back to a `prometheus.Histogram` we take that approach to fix the problem without updating call-sites.	2019-12-06 16:14:50 -05:00
Daniel McCarney	1c9ece3f44	SA: use wrapped database maps/transactions. (#4585 ) New types and related infrastructure are added to the `db` package to allow wrapping gorp DbMaps and Transactions. The wrapped versions return a special `db.ErrDatabaseOp` error type when errors occur. The new error type includes additional information such as the operation that failed and the related table. Where possible we determine the table based on the types of the gorp function arguments. Where that isn't possible (e.g. with raw SQL queries) we try to use a simple regexp approach to find the table name. This isn't great for general SQL but works well enough for Boulder's existing SQL queries. To get additional confidence my regexps work for all of Boulder's queries I temporarily changed the `db` package's `tableFromQuery` function to panic if the table couldn't be determined. I re-ran the full unit and integration test suites with this configuration and saw no panics. Resolves https://github.com/letsencrypt/boulder/issues/4559	2019-12-04 13:03:09 -05:00
Daniel McCarney	0ecdf80709	SA: refactor DB stat collection & collect more stats. (#4096 ) Go 1.11+ updated the `sql.DBStats` struct with new fields that are of interest to us. This PR routes these stats to Prometheus by replacing the existing autoprom stats code with new first-class Prometheus metrics. Resolves https://github.com/letsencrypt/boulder/issues/4095 The `max_db_connections` stat from the SA is removed because the Go 1.11+ `sql.DBStats.MaxOpenConnections` field will give us a better view of the same information. The autoprom "reused_authz" stat that was being incremented in `SA.GetPendingAuthorization` was also removed. It wasn't doing what it says it was (counting reused authorizations) and was instead counting the number of times `GetPendingAuthorization` returned an authz.	2019-03-06 17:08:53 -08:00

9 Commits