vllm/examples/online_serving/disaggregated_serving
Simon Mo 02f0c7b220
[Misc] Add SPDX-FileCopyrightText (#19100)
Signed-off-by: simon-mo <simon.mo@hey.com>
2025-06-03 11:20:17 -07:00
..
README.md Improve examples rendering in docs and GitHub (#18203) 2025-05-15 15:57:49 +00:00
disagg_proxy_demo.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
kv_events.sh Improve examples rendering in docs and GitHub (#18203) 2025-05-15 15:57:49 +00:00

README.md

Disaggregated Serving

This example contains scripts that demonstrate the disaggregated serving features of vLLM.

Files

  • disagg_proxy_demo.py - Demonstrates XpYd (X prefill instances, Y decode instances).
  • kv_events.sh - Demonstrates KV cache event publishing.