moved SLO metrics to the recommender field and added metric name

2023-04-05 13:40:35 -05:00 · 2023-04-05 13:40:35 -05:00 · 9fd214d681
parent b65e03db66
commit 9fd214d681
1 changed files with 11 additions and 6 deletions
--- a/multidimensional-pod-autoscaler/AEP.md
+++ b/multidimensional-pod-autoscaler/AEP.md
@ -217,8 +217,6 @@ spec:
  constraints:
    minReplicas: min-num-replicas
    maxReplicas: max-num-replicas
-    applicationSLO: slo-value  # customizable SLO for application metrics such as latency or throughput
-    percentageSLO: percentage  # a percentage used in SLO definitions such as 99% requests <= slo-value
  resourcePolicy:
    containerControlledResources: [ memory, cpu ]  # Added cpu here as well
    container:
@ -233,6 +231,13 @@ spec:
  # Define the recommender to use here
  recommenders:
  - name: my-recommender
+  - customMetrics:  # optional, user-defined SLO for application metrics such as latency or throughput
+    - type: Pods
+      pods:
+        metric:
+          name: myapp-custom-metric-name
+        targetSLO: slo-value
+        percentile: slo-percentile  # a percentile used in SLO definitions such as p99 latency <= slo-value
 # MultidimPodAutoscalerStatus
 status:
  lastScaleTime: timestamp
@ -255,10 +260,10 @@ status:
    value: metric-value
 ```

-Note that `applicationSLO` field is **optional** and SLO is defined to be the quality of service target that an application must meet (regarding latency, throughput, and so on).
-For example, if the latency SLO is in use, then it could be 99% of the requests finish within 100ms (i.e., `applicationSLO = 100` and `percentageSLO = 0.99`). Accordingly, the replica set can be horizontally scaled when the measured latency is greater than 100ms, i.e., violating the SLO value. Similarly, throughput SLOs can be defined as throughput is greater than 100/s in 90% of the time, i.e., `applicationSLO = 100` and `percentageSLO = 0.9`.
-The default MPA recommender implemented in this AEP will not use the `applicationSLO` field and this field will be used by users who want to implement their own recommender. For example, an RL/ML-based recommender can have `applicationSLO` as part of the reward/loss function and thus they can have extra constraints in addition to min/max replicas.
-The `applicationSLO` field is a floating point number (most application metrics like latency and throughput are floating point numbers). The `percentageSLO` is a floating point number between 0 and 1.
+Note that `customMetrics` field under `recommenders` is **optional** and SLO is defined to be the quality of service target that an application must meet (regarding latency, throughput, and so on).
+For example, if the latency SLO is in use for metric name `myapp-custom-metric-name`, then it could be 99% of the requests finish within 100ms (i.e., `targetSLO = 100` and `percentile = 99`). Accordingly, the replica set can be horizontally scaled when the measured latency is greater than 100ms, i.e., violating the SLO value. Similarly, throughput SLOs can be defined as throughput is greater than 100/s in 90% of the time, i.e., `targetSLO = 100` and `percentile = 90`.
+The default MPA recommender implemented in this AEP will not use the `customMetrics` field and this field will be used by users who want to implement their own recommender. For example, an RL/ML-based recommender can have `customMetrics` as part of the reward/loss function and thus they can have extra constraints in addition to min/max replicas.
+The `targetSLO` field is a floating point number (most application metrics like latency and throughput are floating point numbers) while the `percentile` is an integer between 0 and 100.

 ### Test Plan