Distributed ML Training and Fine-Tuning on Kubernetes
kubernetes
huggingface
ai
llm
gpu
jax
kubeflow
distributed
xgboost
machine-learning
mlops
python
pytorch
tensorflow
fine-tuning
Updated 2025-08-30 00:22:24 +08:00
A CLI for Kubeflow.
Updated 2025-08-29 20:03:12 +08:00
Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. It fills a gap between model experimentation and production activities. It provides a central interface for all stakeholders in the MLOps lifecycle to collaborate on ML models.
Updated 2025-08-19 23:41:05 +08:00
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
kubernetes
kubernetes-operator
spark
apache-spark
kubernetes-crd
google-cloud-dataproc
kubernetes-controller
Updated 2025-08-15 01:18:23 +08:00
Automated Machine Learning on Kubernetes
kubernetes
machine-learning
kubeflow
ai
tensorflow
huggingface
llm
mlops
jax
pytorch
hyperparameter-tuning
neural-architecture-search
automl
scikit-learn
Updated 2025-08-14 22:47:26 +08:00
Machine Learning Toolkit for Kubernetes
kubernetes
machine-learning
kubeflow
tensorflow
ml
notebook
google-kubernetes-engine
jupyter
minikube
Updated 2025-08-13 04:28:12 +08:00
Kubeflow Central Dashboard is the web interface for Kubeflow
Updated 2025-08-07 01:44:25 +08:00
Machine Learning Pipelines for Kubeflow
Updated 2025-08-07 01:15:18 +08:00
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
Updated 2025-07-26 01:27:49 +08:00
Repository used to main group ACLs used by Kubeflow developers
Updated 2025-07-12 11:38:03 +08:00
Kubeflow Pipelines on Tekton
Updated 2024-11-19 20:23:50 +08:00