Commit Graph

429 Commits

Author SHA1 Message Date
Jeremy Lewi 7bde5b484d
Update fairing in mnist to 1.0.1 (#807)
This addresses kubeflow/kfserving#806. Fairing 1.0.0 isn't compatible with kfserving 0.3.2
2020-07-07 17:37:07 -07:00
Jeremy Lewi c880fdaa80
Delete the notebook tests because they are outdated. (#808)
* Delete the notebook tests because they are outdated.

* We have rewritten the test infra for notebooks to use Tekton.
  see:
    https://github.com/kubeflow/testing/blob/master/tekton/templates/pipelines/notebook-test-pipeline.yaml
    https://github.com/kubeflow/examples/tree/master/py/kubeflow/examples/notebook_tests

* We are also no longer regularly deploying the v1 clusters; we are no using
  blueprints so that's why the tests can no longer get credentials

* * Add the mnist notebook test as a postsubmit and periodic test.

* Fix.
2020-07-07 01:23:58 -07:00
Yash Jakhotiya 10b34b8dc8
Add link to the 'Deploying Kubeflow on GCP' doc (#779)
This is important as this is an E2E tutorial. Moreover, the catch that GCP Free Tier and the 12-month trial period with $300 credit does not offer enough resources to run default GCP installation of Kubeflow is mentioned in those docs.
2020-06-29 14:18:41 -07:00
Ian Coffey 2bc4709bac
bentoml, fairing, kserving and prophet example (#801)
* add notebook around using bentoml, fairing and kserving together

* Update bentoml example to use KFServing custom inference service
2020-06-27 22:56:14 -07:00
Jamie Poole 951ca8f7f6
Update step-1-setup.md (#796) 2020-06-23 07:22:32 -07:00
Jeremy Lewi 197abc9daa
Some improvements to utilities for testing notebooks (#803)
* Changes pulled in from kuueflow/examples#764

* Notebook tests should print a link to the stackdriver logs for
  the actual notebook job.

* Related to kubeflow/testing#613

Co-authored-by: Gabriel Wen <gabrielwen@google.com>
2020-06-14 20:21:56 -07:00
Jeremy Lewi c9a10863c3
Cleanup the owners file (#804)
* Remove folks that are no longer active.
2020-06-12 18:35:55 -07:00
Ian Coffey b8b7179fc2
Bump fairing commit to latest on master to fix examples (#795)
* fixes kubeflow/examples#792 bump fairing commit to latest on master to fix periodic tests

* Skip xgboost synthetic test until solution is found
2020-05-20 13:56:19 -07:00
Eugene Fedorenko 0609ad4738
Fixes. End-to-End Pipeline Example on Azure (#788)
* Fixes

* Fixes
2020-04-30 14:07:38 -07:00
Bernd Verst b8aa657a84
Add OWNERS file for Azure Pipelines example (#791) 2020-04-20 17:32:55 -07:00
Olivier Martin 1f9e7a5fce
fixed pipeline, remoted a step (#768) 2020-04-08 11:41:43 -07:00
Hung-Ting Wen c337d90e87
freeze papermill version #783 (#784) 2020-03-31 19:23:27 -07:00
JohanWork 0b11a78bd9
update simple notebook (#1) (#771)
* update simple notebook
2020-03-16 13:10:31 -07:00
Christian Kadner b7e54a83c3
Update invitation link to Kubeflow Slack channel (#769) 2020-03-12 16:28:39 -07:00
Tommy Li e95c20112b
Update IBM Cloud instructions to use persistent storage. (#766) 2020-03-10 19:43:35 -07:00
Kubernetes Prow Robot c5222ddde8
add a general notebook test script (#763)
* add a general notebook test script

* fix join

* fix typo

* infer notebook name with path

* replace

* fix name

* add a log

* update comment
2020-03-04 14:54:39 -08:00
Derrick Miller 95db89ad74
Update Dockerfile ENTRYPOINT to use /usr/local/bin/python (#744)
Base image `FROM tensorflow/tensorflow:1.15.2-py3` uses python3 and therefore the python binary location is `/usr/bin/python3`. However, [tensorflow base image creates a symlink](e5bf8de410/tensorflow/tools/dockerfiles/dockerfiles/cpu.Dockerfile (L45)) to the current python binary as `/usr/local/bin/python` regardless if that is python version 2 or version 3, so that binary location should be used in the *ENTRYPOINT* of the `Dockerfile.model` instead of `/usr/bin/python` which is customary for Python v2.x installations.
2020-03-03 08:41:38 -08:00
Sarah Maddox 7589c004d7
Tech writing updates for GCP MNIST notebook (#756)
* Tech writing updates for GCP MNIST notebook.

* Changed logic for defining webapp endpoints.
2020-03-02 13:23:38 -08:00
Bernd Verst 2b827ea139
Adds MNIST E2E Example for Azure. (#759)
* Adds MNIST E2E Example for Azure.

* Remove auto-generated ToC

* Remove incompatible script to retrieve Ingress URL

* Remove orphaned ToC entry
2020-02-28 18:15:53 -08:00
Zhenghui Wang 984713300b
Bump notebook image version for xgboost-test (#760)
* bump notebook image version

* use 1.0.0 tag
2020-02-28 09:16:38 -08:00
Zhenghui Wang 2d9a1db3aa
update xgboost_synthetic with metadata sdk 0.3.1 (#758) 2020-02-26 16:31:19 -08:00
Tommy Li 222715031a
add ibm mnist example (#746)
remove cell outputs

update cos section

update missing typos
2020-02-26 15:27:19 -08:00
Amy fd7a98fed1
GIS KFP example: kfp-related syntax changes; use gcs client libs instead of gsutil (#749)
* some mods to accommodate (perhaps temporary) changes in how the kfp sdk works

* Use gcs client libs rather than gsutil for a gcs copy; required due to changes in node service account permissions.

* more mods to address kfp syntax changes
2020-02-24 08:10:48 -08:00
Jeremy Lewi b218d2b23c
Fix the mnist_gcp_test.py (#741)
* Fix the mnist_gcp_test.py

* The job spec was invalid; we were missing container name

* There were a bunch of other issues as well.

* Pull in the changes from xgboost_synthetic to upload an HTML version
  of the notebook output to GCS.

* Add exceptoin

* Revert "Add exceptoin"

This reverts commit 44f34d9d74.
2020-02-21 15:58:48 -08:00
Jeremy Lewi 5b4b0c6c94
Remove kustomize from mnist example. (#745)
* Remove kustomize from mnist example.

* The mnist E2E guide has been updated to use notebooks and get rid
  of kustomize

* We have notebooks for AWS, GCP, and Vanilla K8s.

* As such we no longer need the old, outdated kustomization files or
  Docker containers anymore

  * The notebooks handle parameterizing the K8s resources using Python
    f style string.

* Update the README to remove the old instructions.

* Cleanup more references.
2020-02-21 14:14:47 -08:00
Jiaxin Shan 4c4f1c0f88
Create a notebook for mnist E2E on AWS (#740)
* Add method to get ALB hostname for aws users

* Revoke setup based on the platform

* Add AWS notebook for mnist e2e example

* Remove legacy kustomize manifests for mnist example

* Address feedbacks from reviewers
2020-02-20 18:32:32 -08:00
Adhita Selvaraj 40f6ec8fe7
Mnist vanilla k8s (#737)
* adds mnist example for vanilla k8s

* typo fix

* address review comments; get minio endpoint from k8s client;
2020-02-20 06:51:04 -08:00
Amy 443f4bd2a3
deprecating gis e2e example until it is fixed. (#736) 2020-02-18 20:58:25 -08:00
Niklas Hansson d4f78455a4
Github action deploy to kubeflow (#730)
* Updated the azurepipeline example. 

I believe there is a small bug in the script, use tmp variable to solve the issue.

* updated with github actions example

* Update README.md

Updated the readme further.

* Update README.md

* Update README.md

* Update data.py

* specifing version of ubuntu and updateing text

* updating spelling misstake

* update the linting

* updated with github actions example

* Update README.md

Updated the readme further.

* Update README.md

* Update README.md

* Update data.py

* specifing version of ubuntu and updateing text

* updating spelling misstake

* update the linting

* updated yaml

* Update data.py

Co-authored-by: JohanWork <39947546+JohanWork@users.noreply.github.com>
2020-02-18 02:33:30 -08:00
Jeremy Lewi cc93a80420
Create a notebook for mnist E2E on GCP (#723)
* A notebook to run the mnist E2E example on GCP.

This fixes a number of issues with the example
* Use ISTIO instead of Ambassador to add reverse proxy routes
* The training job needs to be updated to run in a profile created namespace in order to have the required service accounts
     * See kubeflow/examples#713
     * Running inside a notebook running on Kubeflow should ensure user
       is running inside an appropriately setup namespace
* With ISTIO the default RBAC rules prevent the web UI from sending requests to the model server
     * A short term fix was to not include the ISTIO side car
     * In the future we can add an appropriate ISTIO rbac policy

* Using a notebook allows us to eliminate the use of kustomize
  * This resolves kubeflow/examples#713 which required people to use
    and old version of kustomize

  * Rather than using kustomize we can use python f style strings to
    write the YAML specs and then easily substitute in user specific values

  * This should be more informative; it avoids introducing kustomize and
    users can see the resource specs.

* I've opted to make the notebook GCP specific. I think its less confusing
  to users to have separate notebooks focused on specific platforms rather
  than having one notebook with a lot of caveats about what to do under
  different conditions

* I've deleted the kustomize overlays for GCS since we don't want users to
  use them anymore

* I used fairing and kaniko to eliminate the use of docker to build the images
  so that everything can run from a notebook running inside the cluster.

* k8s_utils.py has some reusable functions to add some details from users
  (e.g. low level calls to K8s APIs.)

* * Change the mnist test to just run the notebook
  * Copy the notebook test infra for xgboost_synthetic to py/kubeflow/examples/notebook_test to make it more reusable

* Fix lint.

* Update for lint.

* A notebook to run the mnist E2E example.

Related to: kubeflow/website#1553

* 1. Use fairing to build the model. 2. Construct the YAML spec directly in the notebook. 3. Use the TFJob python SDK.

* Fix the ISTIO rule.

* Fix UI and serving; need to update TF serving to match version trained on.

* Get the IAP endpoint.

* Start writing some helper python functions for K8s.

* Commit before switching from replace to delete.

* Create a library to bulk create objects.

* Cleanup.

* Add back k8s_util.py

* Delete train.yaml; this shouldn't have been aded.

* update the notebook image.

* Refactor code into k8s_util; print out links.

* Clean up the notebok. Should be working E2E.

* Added section to get logs from stackdriver.

* Add comment about profile.

* Latest.

* Override mnist_gcp.ipynb with mnist.ipynb

I accidentally put my latest changes in mnist.ipynb even though that file
was deleted.

* More fixes.

* Resolve some conflicts from the rebase; override with changes on remote branch.
2020-02-16 19:15:28 -08:00
Hung-Ting Wen b9a7719f29
Write xgboost_synthetic test output to html (#735)
* use nbconvert to write output as html

* write local file

* change dir

* write to gcs

* add kubeflow/testing

* update to env and checkout_repos

* format gcs path

* fix syntax

* fix

* add option notebook_artifacts_dir

* download to artifacts

* fix

* shorten name

* fix

* fix

* mkdirs

* fix

* fix

* log error

* use notebook_artifacts_path
2020-02-14 16:19:27 -08:00
Jeremy Lewi 3ac521d09f
Label bot should alias feature to kind/feature 2020-02-13 06:22:23 -08:00
Jeremy Lewi e91e9c0df3
Remove the lint tests because they are using python2 (#728)
* Lint is failing because we are still runing python2 for lint

* kubeflow/testing#560 is related to building an updated image with python3.8
  compatible version of lint so we can support f style strings.

* However, the unittests for kubeflow examples are still written in
  ksonnet. Its not worth trying to update that so we just
  remove that test for now. The test was just running lint

* We should really see about using Tekton to write the workflows

  see kubeflow/testing#425
2020-02-11 18:16:08 -08:00
Amy a467af7d71
add note to indicate that mnist pipeline example does not currently work (#726)
* add note to indicate that mnist pipeline example does not currently work
2020-02-11 09:56:14 -08:00
Amy 0c8d2fdfc1
mnist example namespace fix (#720)
* update mnist tutorial to use the user profile namespace for the tfjob

* add namespace arg to some kubectl commands
2020-02-10 10:11:54 -08:00
Hung-Ting Wen 06f9b3f880
kubeflow-kf-ci-v1-user (#719) 2020-02-06 20:09:42 -08:00
Hung-Ting Wen 188ba8f091
xgboost test for v1 (#718)
* add param for cluster pattern

* add new entry to prow-config

* add info to error

* fix prow-config

* match prefix instead of exact test target name matching

* update prow-config

* remove master suffix for 63 char limit

* fix lint
2020-02-04 16:49:55 -08:00
Zhenghui Wang 941686e4be
xgboost pin metadata==0.2.0 (#717) 2020-01-31 16:41:21 -08:00
Amy 68f172c2ee
pin the web-ui version of TF to 1.7-- same as training (#658) 2020-01-24 08:33:10 -08:00
Amy c4f1c46298
'simple pipeline' notebook update to reflect SDK changes and KF notebook image changes (#705)
* 'simple pipeline' notebook update to reflect SDK changes and KF notebook images changes

* added example of 'create_run_from_pipeline_func'
2020-01-21 17:21:24 -08:00
Svendegroote91 d925823716 Add kubeflow namespace for kubectl commands in Readme for financial time series example (#706) 2019-12-20 11:07:33 -08:00
Svendegroote91 d93c18f66e Financial example v0.7 (#693)
* Update financial time series example to Kubeflow v0.7

* Move from GRPC to HTTP for serving client request for financial time series example

* Update Tensorflow version to 1.15 on financial time series example

* Update KFP pipeline to show accuracy metric and remove deprecated dsl.PipelineParam

* Split train and deploy step and add conditional step to deploy in KFP

* Clean up readme and add visuals for financial time series example
2019-12-17 18:59:57 -08:00
dependabot[bot] c20eafc4fc Bump nltk from 3.2.5 to 3.4.5 in /github_issue_summarization (#698)
Bumps [nltk](https://github.com/nltk/nltk) from 3.2.5 to 3.4.5.
- [Release notes](https://github.com/nltk/nltk/releases)
- [Changelog](https://github.com/nltk/nltk/blob/develop/ChangeLog)
- [Commits](https://github.com/nltk/nltk/compare/3.2.5...3.4.5)

Signed-off-by: dependabot[bot] <support@github.com>
2019-12-17 15:54:03 -08:00
dependabot[bot] f6a7adb2fc Bump tensorflow-gpu from 1.3.0 to 1.15.0 in /github_issue_summarization (#697)
Bumps [tensorflow-gpu](https://github.com/tensorflow/tensorflow) from 1.3.0 to 1.15.0.
- [Release notes](https://github.com/tensorflow/tensorflow/releases)
- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)
- [Commits](https://github.com/tensorflow/tensorflow/compare/v1.3.0...v1.15.0)

Signed-off-by: dependabot[bot] <support@github.com>
2019-12-16 12:59:39 -08:00
Kunming Qu 0d49548b3a Mpi example (#690)
* mpi horovod example on kubeflow

* add readme
2019-12-09 17:49:29 -08:00
Jin Chi He 1e385247b0 update ci tests for mnist example (#684) 2019-12-06 16:55:54 -08:00
Jeremy Lewi 341deccc34 Add a GitHub action to auto add/remove issues to needs triage kanban board. (#683)
Related to: kubeflow/community#278
2019-11-26 13:11:20 -08:00
Jeremy Lewi 712c29a18e Fix issues with the xgboost_synthetic example and deploying the model. (#682)
* Fix issues with the xgboost_synthetic example and deploying the model.

* install newer version of fairing
* modify preprocessor to use custom dockerfile
* use newer 0.7 base image.
* Fix endpoint.

Related to:

kubeflow/examples#673 model doesn't deploy its crash looping
Related to kubeflow/examples#655 update example to work with 0.7

* Add some comments to the notebook.
2019-11-25 14:55:10 -08:00
Amy 91374e6d27 notebook cleanup (#679) 2019-11-11 16:42:06 -08:00
Jeremy Lewi 7a2977ef11 Fix miscellaneous bugs with the xgboost_synthetic test (#676)
* namespace where test runs should correspond to the namespace of a Kubeflow
  profile

* There was a bug in the logging format string

* There was a bug in the print statement for the job
2019-11-07 19:46:19 -08:00