examples/github_issue_summarization
Ankush Agarwal b24152cf06 Github Issue Summarization - Train using TFJob (#55)
* Github Issue Summarization - Train using TFJob

* Create a Dockerfile to build the image for tf-job
* Create a manifest to deploy the tf-job
* Create instructions on how to do all of this

Fixes https://github.com/kubeflow/examples/issues/43

* Address comments

* Add gcloud commands
* Add ks app
* Update Dockerfile base image
* Python train.py fixes

* Remove tfjob.yaml as it is replaced by ksonnet app

* Remove plot_model_history as it is not required for tfjob training

* Don't change WORKDIR

* Address reviewer comments

* Fix links

* Fix lint issues using yapf

* Sort imports
2018-03-29 13:37:04 -07:00
..
docker Add .pylintrc (#61) 2018-03-29 08:25:02 -07:00
notebooks Github Issue Summarization - Train using TFJob (#55) 2018-03-29 13:37:04 -07:00
sql Remove third_party folder & MIT license file 2018-02-27 13:17:42 -05:00
workflow Add .pylintrc (#61) 2018-03-29 08:25:02 -07:00
README.md Github Issue Summarization - Train using TFJob (#55) 2018-03-29 13:37:04 -07:00
querying_the_model.md [GitHub Issue Summarization] (very) simple front-end web app (#53) 2018-03-21 15:22:04 -07:00
requirements.txt Remove third_party folder & MIT license file 2018-02-27 13:17:42 -05:00
serving_the_model.md [GitHub Issue Summarization] (very) simple front-end web app (#53) 2018-03-21 15:22:04 -07:00
setup_a_kubeflow_cluster.md Add file copy instructions after training (#47) 2018-03-14 19:14:21 -07:00
teardown.md Move new files into renamed directory 2018-03-01 13:44:07 -05:00
training_the_model.md Add file copy instructions after training (#47) 2018-03-14 19:14:21 -07:00
training_the_model_tfjob.md Github Issue Summarization - Train using TFJob (#55) 2018-03-29 13:37:04 -07:00

README.md

[WIP] End-to-End kubeflow tutorial using a Sequence-to-Sequence model

This example demonstrates how you can use kubeflow end-to-end to train and serve a Sequence-to-Sequence model on an existing kubernetes cluster. This tutorial is based upon @hamelsmu's article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models".

Goals

There are two primary goals for this tutorial:

  • End-to-End kubeflow example
  • End-to-End Sequence-to-Sequence model

By the end of this tutorial, you should learn how to:

  • Setup a Kubeflow cluster on an existing Kubernetes deployment
  • Spawn up a Jupyter Notebook on the cluster
  • Spawn up a shared-persistent storage across the cluster to store large datasets
  • Train a Sequence-to-Sequence model using TensorFlow on the cluster using GPUs
  • Serve the model using Seldon Core
  • Query the model from a simple front-end application

Steps:

  1. Setup a Kubeflow cluster
  2. Training the model. You can train the model either using Jupyter Notebook or using TFJob.
    1. Training the model using a Jupyter Notebook
    2. Training the model using TFJob
  3. Serving the model
  4. Querying the model
  5. Teardown