remove the in the author
This commit is contained in:
parent
31cb4a5733
commit
73405ded17
|
@ -1,7 +1,7 @@
|
||||||
---
|
---
|
||||||
layout: post
|
layout: post
|
||||||
title: "vLLM v0.6.0: 2.7x Throughput Improvement and 5x Latency Reduction"
|
title: "vLLM v0.6.0: 2.7x Throughput Improvement and 5x Latency Reduction"
|
||||||
author: "The vLLM Team"
|
author: "vLLM Team"
|
||||||
---
|
---
|
||||||
|
|
||||||
**TL;DR:** vLLM achieves 2.7x higher throughput and 5x less TPOT (time per output token) on Llama 8B model, and 1.8x higher throughput and 2x less TPOT on Llama 70B model.
|
**TL;DR:** vLLM achieves 2.7x higher throughput and 5x less TPOT (time per output token) on Llama 8B model, and 1.8x higher throughput and 2x less TPOT on Llama 70B model.
|
||||||
|
|
Loading…
Reference in New Issue