Explore - Gitea: Git with a cup of tea

vLLM / vllm

Python 0 0

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated 2025-08-01 01:35:07 +08:00

vLLM / vllm-ascend

Python 0 0

Community maintained hardware plugin for vLLM on Ascend

mlops llm inference model-serving transformer vllm llm-serving llmops ascend

Updated 2025-07-20 16:30:47 +08:00

cncf / llm-in-action

Python 0 0

🤖 Discover how to apply your LLM app skills on Kubernetes!

cloudnative inference llm

Updated 2024-03-09 05:47:39 +08:00