Distributed ML Training and Fine-Tuning on Kubernetes
kubernetes
huggingface
ai
llm
gpu
jax
kubeflow
distributed
xgboost
machine-learning
mlops
python
pytorch
tensorflow
fine-tuning
Updated 2025-08-30 00:22:24 +08:00
Automated Machine Learning on Kubernetes
kubernetes
machine-learning
kubeflow
ai
tensorflow
huggingface
llm
mlops
jax
pytorch
hyperparameter-tuning
neural-architecture-search
automl
scikit-learn
Updated 2025-08-14 22:47:26 +08:00
Example directory of Kubernetes YAML and Quadlets tested with Podman
Updated 2025-08-12 15:31:35 +08:00
Machine Learning Pipelines for Kubeflow
Updated 2025-08-07 01:15:18 +08:00
A high-throughput and memory-efficient inference and serving engine for LLMs
llm
mlops
pytorch
cuda
inference
llama
llm-serving
llmops
model-serving
qwen
rocm
tpu
trainium
transformer
amd
xpu
deepseek
gpt
hpu
inferentia
Updated 2025-08-01 01:35:07 +08:00
Community maintained hardware plugin for vLLM on Ascend
Updated 2025-07-20 16:30:47 +08:00
OCI Artifact for ML model & metadata
Updated 2025-07-20 14:08:39 +08:00
a script to run docker-compose.yml using podman
Updated 2025-07-07 22:30:09 +08:00