A repository to host extended examples and tutorials
Go to file
Jeremy Lewi 712c29a18e Fix issues with the xgboost_synthetic example and deploying the model. (#682)
* Fix issues with the xgboost_synthetic example and deploying the model.

* install newer version of fairing
* modify preprocessor to use custom dockerfile
* use newer 0.7 base image.
* Fix endpoint.

Related to:

kubeflow/examples#673 model doesn't deploy its crash looping
Related to kubeflow/examples#655 update example to work with 0.7

* Add some comments to the notebook.
2019-11-25 14:55:10 -08:00
.github Updating issue bot configs (#630) 2019-09-03 17:32:58 -07:00
code_search Minor fixes to the notebook. (#427) 2019-01-15 08:33:19 -08:00
codelab-image Update Ksonnet version, Add Python2 pip (#216) 2018-08-07 22:58:20 -07:00
demos Fix broken url for example file in simple_pipeline demos (#660) 2019-10-16 00:59:39 -07:00
financial_time_series Update Financial Time Series example to v0.4.0 and add Kubeflow pipelines (#535) 2019-06-14 16:30:10 -07:00
github_issue_summarization notebook cleanup (#679) 2019-11-11 16:42:06 -08:00
mnist Allow extra arguments (#625) 2019-08-21 19:34:31 -07:00
named_entity_recognition added named entity recognition example (#590) 2019-09-18 16:41:00 -07:00
object_detection object_detection: fix typo error in tf-serving.libsonnet (#618) 2019-08-14 18:12:34 -07:00
pipelines Modify readme to match the actual source code (#659) 2019-10-14 18:41:51 -07:00
py Fix miscellaneous bugs with the xgboost_synthetic test (#676) 2019-11-07 19:46:19 -08:00
pytorch_mnist Lint fixes mnist (#581) 2019-07-24 19:23:52 -07:00
test Update xgboost_synthetic test infra; preliminary updates to work with 0.7.0 (#666) 2019-10-24 19:53:38 -07:00
videos Add video show notes (#582) 2019-06-28 18:01:24 -07:00
xgboost_ames_housing Add end2end test for Xgboost housing example (#493) 2019-02-12 06:37:05 -08:00
xgboost_synthetic Fix issues with the xgboost_synthetic example and deploying the model. (#682) 2019-11-25 14:55:10 -08:00
.gitignore Add .cache dir to gitignore (#573) 2019-06-15 06:52:10 -07:00
.pylintrc Update xgboost_synthetic test infra; preliminary updates to work with 0.7.0 (#666) 2019-10-24 19:53:38 -07:00
CONTRIBUTING.md Enable periodic tests for mnist & GH issue examples. (#486) 2019-01-22 16:10:17 -08:00
LICENSE Initial commit 2018-02-01 13:13:10 -08:00
OWNERS update_owner (#550) 2019-05-08 21:04:47 -07:00
README.md added named entity recognition example (#590) 2019-09-18 16:41:00 -07:00
prow_config.yaml Fix miscellaneous bugs with the xgboost_synthetic test (#676) 2019-11-07 19:46:19 -08:00

README.md

kubeflow-examples

A repository to share extended Kubeflow examples and tutorials to demonstrate machine learning concepts, data science workflows, and Kubeflow deployments. The examples illustrate the happy path, acting as a starting point for new users and a reference guide for experienced users.

This repository is home to the following types of examples and demos:

End-to-end

Named Entity Recognition

Author: Sascha Heyer

This example covers the following concepts:

  1. Build reusable pipeline components
  2. Run Kubeflow Pipelines with Jupyter notebooks
  3. Train a Named Entity Recognition model on a Kubernetes cluster
  4. Deploy a Keras model to AI Platform
  5. Use Kubeflow metrics
  6. Use Kubeflow visualizations

GitHub issue summarization

Author: Hamel Husain

This example covers the following concepts:

  1. Natural Language Processing (NLP) with Keras and Tensorflow
  2. Connecting to Jupyterhub
  3. Shared persistent storage
  4. Training a Tensorflow model
    1. CPU
    2. GPU
  5. Serving with Seldon Core
  6. Flask front-end

Pachyderm Example - GitHub issue summarization

Author: Nick Harvey & Daniel Whitenack

This example covers the following concepts:

  1. A production pipeline for pre-processing, training, and model export
  2. CI/CD for model binaries, building and deploying a docker image for serving in Seldon
  3. Full tracking of what data produced which model, and what model is being used for inference
  4. Automatic updates of models based on changes to training data or code
  5. Training with single node Tensorflow and distributed TF-jobs

Pytorch MNIST

Author: David Sabater

This example covers the following concepts:

  1. Distributed Data Parallel (DDP) training with Pytorch on CPU and GPU
  2. Shared persistent storage
  3. Training a Pytorch model
    1. CPU
    2. GPU
  4. Serving with Seldon Core
  5. Flask front-end

MNIST

Author: Elson Rodriguez

This example covers the following concepts:

  1. Image recognition of handwritten digits
  2. S3 storage
  3. Training automation with Argo
  4. Monitoring with Argo UI and Tensorboard
  5. Serving with Tensorflow

Distributed Object Detection

Author: Daniel Castellanos

This example covers the following concepts:

  1. Gathering and preparing the data for model training using K8s jobs
  2. Using Kubeflow tf-job and tf-operator to launch a distributed object training job
  3. Serving the model through Kubeflow's tf-serving

Financial Time Series

Author: Sven Degroote

This example covers the following concepts:

  1. Deploying Kubeflow to a GKE cluster
  2. Exploration via JupyterHub (prospect data, preprocess data, develop ML model)
  3. Training several tensorflow models at scale with TF-jobs
  4. Deploy and serve with TF-serving
  5. Iterate training and serving
  6. Training on GPU
  7. Using Kubeflow Pipelines to automate ML workflow

Pipelines

Simple notebook pipeline

Author: Zane Durante

This example covers the following concepts:

  1. How to create pipeline components from python functions in jupyter notebook
  2. How to compile and run a pipeline from jupyter notebook

MNIST Pipelines

Author: Dan Sanche and Jin Chi He

This example covers the following concepts:

  1. Run MNIST Pipelines sample on a Google Cloud Platform (GCP).
  2. Run MNIST Pipelines sample for on premises cluster.

Component-focused

XGBoost - Ames housing price prediction

Author: Puneith Kaul

This example covers the following concepts:

  1. Training an XGBoost model
  2. Shared persistent storage
  3. GCS and GKE
  4. Serving with Seldon Core

Demos

Demos are for showing Kubeflow or one of its components publicly, with the intent of highlighting product vision, not necessarily teaching. In contrast, the goal of the examples is to provide a self-guided walkthrough of Kubeflow or one of its components, for the purpose of teaching you how to install and use the product.

In an example, all commands should be embedded in the process and explained. In a demo, most details should be done behind the scenes, to optimize for on-stage rhythm and limited timing.

You can find the demos in the /demos directory.

Third-party hosted

Source Example Description

Get Involved

In the interest of fostering an open and welcoming environment, we as contributors and maintainers pledge to making participation in our project and our community a harassment-free experience for everyone, regardless of age, body size, disability, ethnicity, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, religion, or sexual identity and orientation.

The Kubeflow community is guided by our Code of Conduct, which we encourage everybody to read before participating.