Add snowflake to llama3.1 post
This commit is contained in:
parent
bd0a2e74c1
commit
d39c04f6f2
|
@ -80,4 +80,4 @@ To learn more about distributed inference using vLLM please refer to [this doc](
|
|||
|
||||
### Acknowledgements
|
||||
|
||||
We would like to thank Meta for the pre-release partnership and letting us test the model. Independently from the release, we thank the following vLLM contributors for the features mentioned in this blogpost: [Neural Magic](https://neuralmagic.com/) for FP8 quantization; [CentML](https://centml.ai/) for pipeline parallelism; [Anyscale](https://www.anyscale.com/) for the chunked prefill feature. The evaluation runs on [Lambda’s 1-Click Clusters](https://lambdalabs.com/service/gpu-cloud/1-click-clusters) with InfiniBand, and we thank Lambda for the resource and the smooth cluster setup experience.
|
||||
We would like to thank Meta for the pre-release partnership and letting us test the model. Independently from the release, we thank the following vLLM contributors for the features mentioned in this blogpost: [Neural Magic](https://neuralmagic.com/) for FP8 quantization; [CentML](https://centml.ai/) and Snowflake for pipeline parallelism; [Anyscale](https://www.anyscale.com/) for the chunked prefill feature. The evaluation runs on [Lambda’s 1-Click Clusters](https://lambdalabs.com/service/gpu-cloud/1-click-clusters) with InfiniBand, and we thank Lambda for the resource and the smooth cluster setup experience.
|
||||
|
|
Loading…
Reference in New Issue