grpc.io/content/en/docs/guides/custom-backend-metrics.md

71 lines
2.8 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: Custom Backend Metrics
description : >-
A mechanism in the gRPC library that allows users to inject custom metrics at
the gRPC server and consume at gRPC clients to make your custom load balancing
algorithms.
---
### Overview
Simple load balancing decisions can be made by taking into account local or
global knowledge of a backends load, for example CPU. More sophisticated
load balancing decisions are possible with application specific knowledge, e.g.
queue depth, or by combining multiple metrics.
The custom backend metrics feature exposes APIs to allow users to implement
the metrics feedback in their LB policies.
### Use Cases
The feature is mainly for advanced use cases where a custom LB policy is used
to route traffic more intelligently to a list of backend servers to improve the
routing performance, e.g. a weighted round robin LB policy.
gRPC traditionally allows users to plug in their own load balancing policies,
see [guide][custom load balancing guide]. For xDS users, [custom load balancer]
can be configured to select the custom LB policy.
### Metrics Reporting
Open Request Cost Aggregation ([ORCA]) is an open standard for conveying backend
metrics information. gRPC uses ORCA service and metrics standards
and supports two metrics reporting mechanisms:
* Per-query metrics reporting: the backend server attaches the injected custom
metrics in the trailing metadata when the corresponding RPC finishes. This is
typically useful for short RPCs like unary calls.
* Out-of-band metrics reporting: the backend server periodically pushes metrics
data, e.g. cpu and memory utilization, to the client. This is useful for all
situations: unary calls, long RPCs in streaming calls, or no RPCs. However,
out-of-band metrics reporting does not send query cost metrics. The metrics
emission frequency is user-configurable, and this configuration resides in the
custom load balancing policy.
The diagram shows the architecture where a user creates their own LB policy
that implements backend metrics feedback.
![gRPC backend metrics diagram](/img/backend_metrics.svg)
### Implementation
For more details, please see gRPC [proposal A51].
### Language Support
| Language | Example |
|----------|------------------|
| Java | [Java example] |
| Go | [Go example] |
| C++ | Example upcoming |
[proposal A51]: https://github.com/grpc/proposal/blob/master/A51-custom-backend-metrics.md
[ORCA]: https://github.com/cncf/xds/blob/main/xds/data/orca/v3/orca_load_report.proto
[Java example]: https://github.com/grpc/grpc-java/tree/master/examples/example-orca
[Go example]: https://github.com/grpc/grpc-go/tree/master/examples/features/orca
[custom load balancing guide]: /docs/guides/custom-load-balancing/
[custom load balancer]: https://github.com/grpc/proposal/blob/master/A52-xds-custom-lb-policies.md