minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
This commit is contained in:
parent
c96ab351cc
commit
a143f260b1
|
@ -66,7 +66,7 @@ vLLM V1 introduces a clean and efficient architecture for tensor-parallel infere
|
||||||
|
|
||||||
<p align="center">
|
<p align="center">
|
||||||
<picture>
|
<picture>
|
||||||
<img src="/assets/figures/v1/persistent_batch.png" width="80%">
|
<img src="/assets/figures/v1/persistent_batch.png" width="50%">
|
||||||
</picture>
|
</picture>
|
||||||
</p>
|
</p>
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue