History

Amy 767ecd240d Use the client libs to do a GCS copy instead of gsutil (#558 ) * use gcs client libs to copy checkpoint dir * more minor cleanup, use tagged image, use newer pipeline param spec. syntax. pylint cleanup. added set_memory_limit() to notebook pipeline training steps. modified the pipelines definitions to use the user-defined params as defaults. * put a retry loop around the copy_blob		2019-05-17 14:00:11 -07:00
..
Pachyderm_Example	Adding Pachyderm Example (squashed) (#522 )	2019-03-20 08:41:00 -07:00
demo	[GH Issue Summarization] Upgrade to kf v0.4.0-rc.2 (#450 )	2018-12-30 20:05:29 -08:00
docker	Setup continuous building of Docker images for GH Issue Summarization Example (#449 )	2019-01-04 17:02:24 -08:00
ks_app	GIS E2E test verify the TFJob runs successfully (#456 )	2019-01-08 15:06:49 -08:00
notebooks	import of Pipelines Github issue summarization examples & tutorial (#507 )	2019-04-18 17:57:54 -07:00
pipelines	Use the client libs to do a GCS copy instead of gsutil (#558 )	2019-05-17 14:00:11 -07:00
sql	Remove third_party folder & MIT license file	2018-02-27 13:17:42 -05:00
testing	Fixed tf_operator import for github_issue_summarization example (#527 )	2019-03-14 18:36:58 -07:00
workflow	Add .pylintrc (#61 )	2018-03-29 08:25:02 -07:00
.gitignore	Setup continuous building of Docker images for GH Issue Summarization Example (#449 )	2019-01-04 17:02:24 -08:00
01_setup_a_kubeflow_cluster.md	Merge branch 'master' into patch-1	2019-01-21 09:49:12 +05:30
02_distributed_training.md	[GH Issue Summarization] Upgrade to kf v0.4.0-rc.2 (#450 )	2018-12-30 20:05:29 -08:00
02_training_the_model.md	[GH Issue Summarization] Upgrade to kf v0.4.0-rc.2 (#450 )	2018-12-30 20:05:29 -08:00
02_training_the_model_tfjob.md	[GH Issue Summarization] Upgrade to kf v0.4.0-rc.2 (#450 )	2018-12-30 20:05:29 -08:00
03_serving_the_model.md	GitHub Summarization Seldon Update (#472 )	2019-01-17 16:07:34 -08:00
04_querying_the_model.md	[GH Issue Summarization] Upgrade to kf v0.4.0-rc.2 (#450 )	2018-12-30 20:05:29 -08:00
05_teardown.md	[GH Issue Summarization] Upgrade to kf v0.4.0-rc.2 (#450 )	2018-12-30 20:05:29 -08:00
Makefile	Setup continuous building of Docker images for GH Issue Summarization Example (#449 )	2019-01-04 17:02:24 -08:00
README.md	[GH Issue Summarization] Upgrade to kf v0.4.0-rc.2 (#450 )	2018-12-30 20:05:29 -08:00
image_build.jsonnet	Setup continuous building of Docker images for GH Issue Summarization Example (#449 )	2019-01-04 17:02:24 -08:00
requirements.txt	Remove third_party folder & MIT license file	2018-02-27 13:17:42 -05:00

README.md

End-to-End kubeflow tutorial using a Sequence-to-Sequence model

This example demonstrates how you can use kubeflow end-to-end to train and serve a Sequence-to-Sequence model on an existing kubernetes cluster. This tutorial is based upon @hamelsmu's article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models".

Goals

There are two primary goals for this tutorial:

Demonstrate an End-to-End kubeflow example
Present an End-to-End Sequence-to-Sequence model

By the end of this tutorial, you should learn how to:

Setup a Kubeflow cluster on an existing Kubernetes deployment
Spawn a Jupyter Notebook on the cluster
Spawn a shared-persistent storage across the cluster to store large datasets
Train a Sequence-to-Sequence model using TensorFlow and GPUs on the cluster
Serve the model using Seldon Core
Query the model from a simple front-end application

Steps:

Setup a Kubeflow cluster
Training the model. You can train the model using any of the following methods using Jupyter Notebook or using TFJob:
Serving the model
Querying the model
Teardown