Commit Graph

429 Commits

Author SHA1 Message Date
Amy 67041ec4d5 updates to reflect changed node credentials, minor cleanup, update component URLs, (#675)
update serving-only pipeline as well
2019-11-05 18:29:01 -08:00
Jeremy Lewi e2198ce1e8 Fix the xgboost_synthetic test so it actually runs and produces signal (#674)
* Fix the xgboost_synthetic test so it actually runs and produces signal

* The test wasn't actually running because we were passing arguments that
  were unknown to pytest

* Remove the old role.yaml; we don't use it anymore

* Wait for the Job to finish and properly report status; kubeflow/testing#514
  contains the new routine

* The test still isn't passing because of kubeflow/examples#673

* In addition we need to fix the auto deployments kubeflow/testing#444

Related to kubeflow/examples#665

* Fix lint.
2019-11-04 21:56:38 -08:00
Amy 452aa428b6
Updates to the pipelines GH summarization lab to demonstrate component input/output (#669)
* copy and training step params, remove unused args,
use google-samples images

* update notebook to reflect new pipeline

* type definition change

* fix typo, use kfp.dsl.RUN_ID_PLACEHOLDER

* change 'serve' setp to use gcp secret- req'd for 0.7
2019-10-27 04:55:14 -07:00
Jeremy Lewi 7e28cd6b23 Update xgboost_synthetic test infra; preliminary updates to work with 0.7.0 (#666)
* Update xgboost_synthetic test infra to use pytest and pyfunc.

* Related to #655 update xgboost_synthetic to use workload identity

* Related to to #665 no signal about xgboost_synthetic

* We need to update the xgboost_synthetic example to work with 0.7.0;
  e.g. workload identity

* This PR focuses on updating the test infra and some preliminary
  updates the notebook

* More fixes to the test and the notebook are probably needed in order
  to get it to actually pass

* Update job spec for 0.7; remove the secret and set the default service
  account.

  * This is to make it work with workload identity

* Instead of using kustomize to define the job to run the notebook we can just modify the YAML spec using python.
* Use the python API for K8s to create the job rather than shelling out.

* Notebook should do a 0.7 compatible check for credentials

  * We don't want to assume GOOGLE_APPLICATION_CREDENTIALS is set
    because we will be using workload identity.

* Take in repos as an argument akin to what checkout_repos.sh requires

* Convert xgboost_test.py to a pytest.

  * This allows us to mark it as expected to fail so we can start to get
    signal without blocking

  * We also need to emit junit files to show up in test grid.

* Convert the jsonnet workflow for the E2E test to a python function to
  define the workflow.

  * Remove the old jsonnet workflow.

* Address comments.

* Fix issues with the notebook
* Install pip packages in user space
  * 0.7.0 images are based on TF images and they have different permissions
* Install a newer version of fairing sdk that works with workload identity

* Split pip installing dependencies out of util.py and into notebook_setup.py

  * That's because util.py could depend on the packages being installed by
    notebook_setup.py

* After pip installing the modules into user space; we need to add the local
  path for pip packages to the python otherwise we get import not found
  errors.
2019-10-24 19:53:38 -07:00
Jane Man 6b37a40293 Fix broken url for example file in simple_pipeline demos (#660)
* Fix broken url for example file in simple_pipeline demos

* fix TOC from Visual Studio Code
2019-10-16 00:59:39 -07:00
Jane Man 25a4619827 Modify readme to match the actual source code (#659) 2019-10-14 18:41:51 -07:00
Amy ad55b0a246 change gpu limit (#651) 2019-10-03 09:12:09 -07:00
Niklas Hansson 351aaf81c7 Updated the azurepipeline example. (#647)
I believe there is a small bug in the script, use tmp variable to solve the issue.
2019-09-27 09:03:36 -07:00
Jin Chi He cfe166f73f update to kubeflow-metadata in examples (#646) 2019-09-26 16:13:34 -07:00
Jin Chi He 628babc66a update kubeflow-fairing commit sha to use job in clusterBuild (#643) 2019-09-20 08:07:00 -07:00
Amy c20ebb5c0f fix component URLs in pipeline now that primary PR is in (#642) 2019-09-19 14:46:58 -07:00
Amy b5349df27d Update to KFP pipelines codelab code (GH summarization) (#638)
* checkpointing

* checkpointing

* refactored pipeline that uses pre-emptible VMs

* checkpointing. istio routing for the webapp.

* checkpointing

* - temp testing components
- initial v of metadata logging 'component'
- new dirs; file rename

* public md log image; add md server connect retry

* update pipeline to include md logging steps

* - file rename, notebook updates
- update compiled pipeline; fix component name typo

- change DAG to allow md logging concurrently; update pre-emptible VMS PL

* pylint cleanup, readme/tutorial update/deprecation, minor tweaks

* file cleanup

* update the tfjob api version for an (unrelated) test to address presubmit issues

* try annotating test_train in github_issue_summarization/testing/tfjob_test.py with @unittest.expectedFailure

* try commenting out a (likely) problematic unittest unrelated to the code changes in this PR

* try adding @test_util.expectedFailure annotation instead of commenting out test

* update the codelab shortlink; revert to commenting out a problematic unit test
2019-09-19 08:47:00 -07:00
Sascha Heyer 1ff3cf50d3 added named entity recognition example (#590)
* added named entity recognition example

https://github.com/kubeflow/website/issues/853

* added previous and next steps

* changed all absolute links to relative links

* changed headline for better understanding

* moved dataset description section to top

* fixed style

* added missing Jupyter notebook

* changed headline

* added link to documentation

* fixed meaning of images and components

* adapted documentation to https://www.kubeflow.org/docs/about/style-guide/#address-the-audience-directly

* added link to ai platform models

* make it clear these are optional extensions

* changed summary and goals

* added kubeflow version

* fixed s/an/a/ also checked the rest of the documentation

* added #!/bin/sh

* added environment variables for build scripts and adapted documentation

* changed PROJECT TO PROJECT_ID

* added link to kaggle dataset and removed not required copy script (due to direct public location in gs://). Adapted Jupyter notebook input data path

* added hint to make clear no further steps are required

* fixed s/Run/RUN/

* grammar fix

* optimized text

* added prev link to index

* removed model description due to lack of information

* added significance and congrats =)

* added example

* guided the user's attention to specific screens/metrics/graphs

* explenation of pieces

* updated main readme

* updated parts

* fixed typo

* adapted dataset path

* made scripts executable

chmod +x

* Update step-1-setup.md

swaped sections and added env variables to gsutil comand

* added information regarding public access

* added named entity recognition example

https://github.com/kubeflow/website/issues/853

* added previous and next steps

* changed all absolute links to relative links

* changed headline for better understanding

* moved dataset description section to top

* fixed style

* added missing Jupyter notebook

* changed headline

* added link to documentation

* fixed meaning of images and components

* adapted documentation to https://www.kubeflow.org/docs/about/style-guide/#address-the-audience-directly

* added link to ai platform models

* make it clear these are optional extensions

* changed summary and goals

* added kubeflow version

* fixed s/an/a/ also checked the rest of the documentation

* added #!/bin/sh

* added environment variables for build scripts and adapted documentation

* changed PROJECT TO PROJECT_ID

* added link to kaggle dataset and removed not required copy script (due to direct public location in gs://). Adapted Jupyter notebook input data path

* added hint to make clear no further steps are required

* fixed s/Run/RUN/

* grammar fix

* optimized text

* added prev link to index

* removed model description due to lack of information

* added significance and congrats =)

* added example

* guided the user's attention to specific screens/metrics/graphs

* explenation of pieces

* updated main readme

* updated parts

* fixed typo

* adapted dataset path

* made scripts executable

chmod +x

* Update step-1-setup.md

swaped sections and added env variables to gsutil comand

* added information regarding public access

* fixed lint error

* fixed lint issues

* fixed lint issues

* figured kubeflow examples are using 2 rather then 4 spaces (due to tensorflow standards)

* lint fixes

* reverted changes

* removed unused import

* removed object inherit

* fixed lint issues

* added kwargs to ignored-argument-name (due to best practice in Google custom prediction routine)

* fix lint issues

* set pylintrc back to default and removed unused argument
2019-09-18 16:41:00 -07:00
Jin Chi He 78a79e72dc update example to kubeflow-fairing (#637) 2019-09-17 06:36:24 -07:00
Jin Chi He 4f8cf87d4f add testing for xgboost_synthetic (#633) 2019-09-16 15:28:24 -07:00
Michelle Casbon e37a9d7acd
Merge pull request #599 from rem20806/master
Added azure pipeline example for Kubeflow
2019-09-03 18:05:04 -07:00
Rishabh Budhiraja 230bb1bf1b Updating issue bot configs (#630) 2019-09-03 17:32:58 -07:00
Rebecca McFadden 43f6475f1e Update data.py 2019-09-03 17:44:06 -05:00
Rebecca McFadden f86fd5f0fc fix exception 2019-09-03 17:34:37 -05:00
Rebecca McFadden c3bb4b99f4 lint fixes 2019-09-03 16:00:14 -05:00
Daniel Sanche ec9020e851 Allow extra arguments (#625) 2019-08-21 19:34:31 -07:00
Mike Mainguy 0b33b536b7 Applied changes to README and Kustomize files to handle training, monitoring, and serving the mnist model in S3 using Kustomize (#543) 2019-08-19 17:41:33 -07:00
Jeremy Lewi 5b3016fae9 Fix a bunch of issues with the xgboost_synthetic example (#621)
* Need to add kfmd to requirements.txt because the training code now uses
  kfmd to log data.

* The Dockerfile didn't build with kaniko; it looks like a permission problem
  trying to install python files into the conda directory. The problem appears
  to be fixed by not switching to user root.

* Updte the base docker image to 1.13.

* Remove some references in the notebook to namespace because the fairing
  code should now detect namespace automatically and the notebook will no longer
  be running namespace kubeflow

* When running training in a K8s job; the code will now try to contact the
  metadata server but this can fail if the ISTIO side car hasn't started yet.
  So we need to wait for ISTIO to start; we do this by trying to contact
  the metadata server for up to 3 minutes.

* Add a lot more explanation in the notebook to explain what is happening.

* Related to #619
2019-08-19 16:05:32 -07:00
Rebecca McFadden fdea8cd59e fixed parentheses syntax errors 2019-08-16 01:22:46 -04:00
Rebecca McFadden d81d83512a fixed some lint errors 2019-08-16 01:04:15 -04:00
Rebecca McFadden 153d9d87aa Merge branch 'master' of https://github.com/rem20806/examples 2019-08-15 13:42:12 -04:00
Rebecca McFadden eef0c2dcd9
Merge pull request #1 from kubeflow/master
update fork
2019-08-15 13:37:19 -04:00
Rebecca McFadden 01a4dbbd3b Merge remote-tracking branch 'upstream/master' 2019-08-15 13:33:51 -04:00
MrXinWang 2acf34f916 object_detection: fix typo error in tf-serving.libsonnet (#618)
modified tf-serving.libsonnet in object_detection example to fix the error of
"FileSystemStoragePathSource encountered a file-system access error:
Could not find base path /models/model for servable model"

Change-Id: I946a0a7fbb6c80992d66fe003ca90b1c21c67cfc
Signed-off-by: Henry Wang <henry.wang@arm.com>
2019-08-14 18:12:34 -07:00
Simon Rey ef9484595f Add tensorboard support for local mninst example (#616)
* Add files via upload

* Update kustomization.yaml

* Update README.md

* Update README.md

* Update README.md
2019-08-12 16:03:38 -07:00
Zhenghui Wang 22de8cf7c1 Add metadata logging to xgboost-synthetic example (#610)
* meta logging

* lint

* pip install fairing

* update prredict() functuion
2019-08-05 20:45:54 -07:00
Rebecca McFadden 13f4cb1217 changed file permissions to 755 2019-08-02 16:06:40 -07:00
Rebecca McFadden 10f3a1fc32 fix indentation 2019-08-02 15:18:30 -07:00
Rebecca McFadden 733923410c update non-pipeline run bash scripts 2019-08-01 21:13:22 -07:00
Rebecca McFadden e2d1c94447 Update inference.sh 2019-08-01 21:06:35 -07:00
Rebecca McFadden 93ef78b091 remove generated files 2019-08-01 21:00:55 -07:00
Rebecca McFadden 9ab9e05b12 edit build.sh for !#/bin/bash 2019-08-01 21:00:08 -07:00
Rebecca McFadden 74d7bcdba0 Update README.md 2019-07-29 20:54:03 -07:00
Rebecca McFadden 7b166d211f Remove deprecated services and unnecessary folders 2019-07-29 20:50:22 -07:00
Xiao Kou 607533311e Fix mnist readme service name and deployments name typo (#611) 2019-07-29 20:12:51 -07:00
David Sabater Dinter a9c6e69f0e Lint fixes mnist (#581)
* Remove modules from .pylintrc

* Add lint inline exceptions

* Add lint inline exceptions as all as the specific exception is not available for Pylint 1.8

* Fix string formatting logging message and remove unnecessary Pylint exception

* Update app.yaml with correct environment details
2019-07-24 19:23:52 -07:00
Chun-Hsiang Wang 6e5ba488e2 Update readme for xgboost-synthetic and remove outdated yaml file (#605)
* Update readme for xgboost-synthetic and remove outdated yaml file.

* Update the class name to be more general.

* Update readme.

* Set google_application_credentials in the notebook.

* Install fairing from master branch.

* Do not set credentials again.

* Update readme.
2019-07-22 18:20:54 -07:00
Rebecca McFadden 4dd4d1069a fixed tags 2019-07-19 09:05:34 -07:00
Rebecca McFadden 1d425a1931 bug fixes 2019-07-19 08:56:18 -07:00
Chun-Hsiang Wang fb6cd69def Install pip dependencies and build base image with kaniko (#603)
* Install required pip packages not included in the base package.

* Use Kaniko builder to build the base image first.

* Directly install packages from requirements.txt to be more flexible.
2019-07-18 22:35:12 -07:00
Rebecca McFadden fcd1e4bce1 fixed small errors 2019-07-18 11:44:25 -07:00
Chun-Hsiang Wang cda6efed27 Include newly trained model in the newly built docker image (#601) (#602) 2019-07-17 19:50:11 -07:00
Rebecca McFadden 3b85d55d8c removed incorrect file 2019-07-17 15:08:10 -07:00
Rebecca McFadden 17ddeb49fb added azure pipeline example
Co-Authored-By: Mohona Ahmed <t-moahme@microsoft.com>
Co-Authored-By: Maanav Dalal <maanavdalal@gmail.com>
Co-Authored-By: sethjuarez <me@sethjuarez.com>
2019-07-17 15:04:04 -07:00
Chun-Hsiang Wang ac9f2f1238 Add kubecon demo to xgboost_ames_housing directory (#589)
* Add xgboost-ames-housing demo from Kubecon EU 2019.

* fix links in the .ipynb in the xgboost-ames-housing demo

* update to the xgboost demo example from kubecon
- move example to its own directory
- remove unnecessarry files
- modify util and update notebook

* change the names related to kubecon and update readme

* use fairing instead of own fairing_util in the notebook

* remove fairing_util and move the remaining to util instead

* update synthetic data example as comments
- generalize yaml
- remove updating github procedures
- update readme
- rename files

* fix pylint.

* fix pylint.
2019-07-16 10:33:25 -07:00