Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Updated 2025-08-26 01:48:08 +08:00
Example directory of Kubernetes YAML and Quadlets tested with Podman
Updated 2025-08-12 15:31:35 +08:00
Nacos mcp wrapper Python sdk
Updated 2025-08-08 11:00:09 +08:00
Examples for building and running LLM services and applications locally with Podman
Updated 2025-06-19 16:22:59 +08:00
🤖 Discover how to apply your LLM app skills on Kubernetes!
Updated 2024-03-09 05:47:39 +08:00