Distributed ML Training and Fine-Tuning on Kubernetes
kubernetes
huggingface
ai
llm
gpu
jax
kubeflow
distributed
xgboost
machine-learning
mlops
python
pytorch
tensorflow
fine-tuning
Updated 2025-08-30 00:22:24 +08:00
Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. It fills a gap between model experimentation and production activities. It provides a central interface for all stakeholders in the MLOps lifecycle to collaborate on ML models.
Updated 2025-08-19 23:41:05 +08:00
Automated Machine Learning on Kubernetes
kubernetes
machine-learning
kubeflow
ai
tensorflow
huggingface
llm
mlops
jax
pytorch
hyperparameter-tuning
neural-architecture-search
automl
scikit-learn
Updated 2025-08-14 22:47:26 +08:00
Machine Learning Pipelines for Kubeflow
Updated 2025-08-07 01:15:18 +08:00
A high-throughput and memory-efficient inference and serving engine for LLMs
llm
mlops
pytorch
cuda
inference
llama
llm-serving
llmops
model-serving
qwen
rocm
tpu
trainium
transformer
amd
xpu
deepseek
gpt
hpu
inferentia
Updated 2025-08-01 01:35:07 +08:00
Community maintained hardware plugin for vLLM on Ascend
Updated 2025-07-20 16:30:47 +08:00
Workflow Engine for Kubernetes
kubernetes
cncf
hacktoberfest
cloud-native
gitops
machine-learning
knative
k8s
mlops
argo
pipelines
batch-processing
dag
data-engineering
airflow
workflow
argo-workflows
workflow-engine
Updated 2025-07-18 09:03:10 +08:00
Kubeflow Pipelines on Tekton
Updated 2024-11-19 20:23:50 +08:00