Move image to assets

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
This commit is contained in:
Yuan Tang 2025-01-25 11:15:15 -05:00
parent 5ea205d8d8
commit 4b3bc6dc25
No known key found for this signature in database
2 changed files with 4 additions and 2 deletions

View File

@ -2,14 +2,16 @@
layout: post layout: post
title: "Introducing vLLM Inference Provider in Llama Stack" title: "Introducing vLLM Inference Provider in Llama Stack"
author: "Yuan Tang (Red Hat) and Ashwin Bharambe (Meta)" author: "Yuan Tang (Red Hat) and Ashwin Bharambe (Meta)"
image: /assets/logos/vllm-logo-only-light.png image: /assets/figures/llama-stack/llama-stack.png
--- ---
We are excited to announce that vLLM inference provider is now available in [Llama Stack](https://github.com/meta-llama/llama-stack) through the collaboration between the Red Hat AI Engineering team and the Llama Stack team from Meta. This article provides an introduction to this integration and a tutorial to help you get started using it locally or deploying it in a Kubernetes cluster. We are excited to announce that vLLM inference provider is now available in [Llama Stack](https://github.com/meta-llama/llama-stack) through the collaboration between the Red Hat AI Engineering team and the Llama Stack team from Meta. This article provides an introduction to this integration and a tutorial to help you get started using it locally or deploying it in a Kubernetes cluster.
# What is Llama Stack? # What is Llama Stack?
<img align="right" src="https://llama-stack.readthedocs.io/en/latest/_images/llama-stack.png" alt="llama-stack-diagram" width="50%" height="50%"> <div align="center">
<img src="/assets/figures/llama-stack/llama-stack.png" alt="Icon" style="width: 60%; vertical-align:middle;">
</div>
Llama Stack defines and standardizes the set of core building blocks needed to bring generative AI applications to market. These building blocks are presented in the form of interoperable APIs with a broad set of Service Providers providing their implementations. Llama Stack defines and standardizes the set of core building blocks needed to bring generative AI applications to market. These building blocks are presented in the form of interoperable APIs with a broad set of Service Providers providing their implementations.

Binary file not shown.

After

Width:  |  Height:  |  Size: 118 KiB