Explore - Gitea: Git with a cup of tea

vLLM / llm-compressor

Python 0 0

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

compression quantization sparsity

Updated 2025-08-26 01:48:08 +08:00

containers / podman-desktop-extension-ai-lab

TypeScript 0 0

Work with LLMs on a local environment using containers

containers podman ai inference-server llms local

Updated 2025-08-25 23:46:14 +08:00

vLLM / vllm

Python 0 0

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated 2025-08-01 01:35:07 +08:00