2.5 KiB
Data Working Group
The WG "Data" is focused on enhancing the support for data/metadata-related tasks within Kubeflow, with a specific focus on the Spark Operator and Model Registry. The group aims to simplify and improve data processing between various stages of ML lifecycle. For example, from Data Preparation to model training and fine-tuning. The group also aims to facilitate the ML model's metadata management, while ensuring seamless integration with other Kubeflow components. The goal of Spark on Kubernetes Operator is to simplify the capability of running Apache Spark on Kubernetes. It automates deployment and simplifies lifecycle management of Spark Jobs on Kubernetes. The goal of Model Registry is gather, analyze, and develop model registry requirements of Kubeflow community users.
The charter defines the scope and governance of the Data Working Group.
Meetings
- KF Model Registry community meeting (US/EMEA): Mondays at 7:00PM-8:00PM Europe/Madrid (biweely - every other Monday of the month). Convert to your timezone.
- Kubeflow Spark Operator Meeting: Fridays at 8:00AM-9:00AM (biweely - every other Friday of the month). Convert to your timezone.
Organizers
- Yi Chen (@ChenYi015), Alibaba Cloud
- Andrey Velichkevich (@andreyvelich), Apple
- Ramesh Reddy (@rareddy), Red Hat
Contact
- Slack: #https://www.kubeflow.org/docs/about/community/#slack-channels
- Mailing list
- Open Community Issues/PRs
- GitHub Teams:
- @kubeflow/wg-data-leads - Team of Data Working Group leads