mirror of https://github.com/vllm-project/vllm.git
9 lines
376 B
Markdown
9 lines
376 B
Markdown
---
|
|
title: llmaz
|
|
---
|
|
[](){ #deployment-llmaz }
|
|
|
|
[llmaz](https://github.com/InftyAI/llmaz) is an easy-to-use and advanced inference platform for large language models on Kubernetes, aimed for production use. It uses vLLM as the default model serving backend.
|
|
|
|
Please refer to the [Quick Start](https://github.com/InftyAI/llmaz?tab=readme-ov-file#quick-start) for more details.
|