Commit Graph

414 Commits

Author SHA1 Message Date
Yuan (Bob) Gong 988f5b02e4
chore(release): bump version to 1.0.0 on master branch (#4249) 2020-07-20 02:04:51 -07:00
Yuan (Bob) Gong 45a91f6699
feat(deployment): GCP managed storage - detailed instructions to set up workload identity bindings before deployment (#4232)
* feat: allow creating workload identity bindings before deploying KFP

* more instructions

* fix formatting

* fixes

* Update doc ref

* fix storage role

* fix storage role

* add viewer KSA to standalone manifest

* fix missing configmap

* update documentation
2020-07-16 23:13:00 -07:00
Renmin c615e3796c
fix(deployment): fix database name prefix description (#4198)
* fix wording

* Update schema.yaml

* Update schema.yaml

Co-authored-by: Yuan (Bob) Gong <gongyuan94@gmail.com>
2020-07-10 15:22:16 +08:00
Yuan (Bob) Gong c5c9739a00
fix(deployment): gcp managed storage - add service account to minio and cloudsql proxy deployments for workload identity (#4188) 2020-07-09 18:24:54 +08:00
Joe Liedtke b8e210af25
docs(deployment): Suggested wording changes for schema.yaml. (#4151)
* Suggested wording changes to schema.yaml.

* Update schema.yaml

Co-authored-by: Yuan (Bob) Gong <gongyuan94@gmail.com>
2020-07-07 02:33:58 -07:00
Yuan (Bob) Gong d4a8329134
feat(deployment): marketplace - allow specifying gcs bucket directly (#4112)
* feat(deployment): marketplace - allow specifying gcs bucket directly

* Switch tfx default bucket to user specified one

* Update schema description

* Update version to 0.5.1 to match marketplace expectation

* Fix gcsBucketName var

* Remove gcp secret credentials
2020-07-01 20:18:00 -07:00
Yuan (Bob) Gong 79e0ee2b49
chore: remove inactive reviewers (#4111)
* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS
2020-06-30 19:10:06 -07:00
frozeNinK 8a2d11c96a
feat(backend): Make number of persistence worker goroutine configurable (#3904)
* Make number of persistence worker configurable

* address comments

* address comments

* address comments
2020-06-29 21:37:58 -07:00
Yuan (Bob) Gong c80a8a212c
refactor(deployment): separate metadata-writer and metadata-grpc folders (4071)
* refactor(deployment): separate metadata-writer and metadata-grpc folders

* fix release script
2020-06-28 01:14:14 -07:00
Yuan (Bob) Gong 9101ded02e
refactor(deployment): move kustomization.yaml images to the lowest level package (4072)
* refactor(deployment): separate metadata-writer and metadata-grpc folders

* refactor(deployment): move kustomization.yaml images to the lowest level package

* format
2020-06-27 21:18:14 -07:00
Yuan (Bob) Gong 1bbd82c25e
refactor(deployment): use minio artifact secret as single source of truth (4065)
* refactor(manifests): move minio artifact secret to minio package

* let api server and ui use minio artifact secret instead of default value

* Update kustomization.yaml

* fix name
2020-06-26 02:03:31 -07:00
dushyanthsc bc3c59aea1
MetadataStore: Update to release metadata-envoy in each release (#4026) 2020-06-23 19:07:17 -07:00
Yuan (Bob) Gong 002ea63279
chore(marketplace): fix version in release note (#4015) 2020-06-18 17:23:10 -07:00
jingzhang36 8553497c3c
Reduce ttl of persisted final workflow to 1 day (#4005)
* reduce ttl of pesisted final workflow to 1 day

* add comment

* enable pagination when expanding experiment in both the home page and the archive page

* Revert "enable pagination when expanding experiment in both the home page and the archive page"

This reverts commit 5b672739dd.

* Address comments
2020-06-18 00:22:06 -07:00
dushyanthsc 3de9987b56
Metadatastore: Updating metadata grpc server image to 0.22.1 (#3982) 2020-06-17 20:34:06 -07:00
Yuan (Bob) Gong c0074463ee
chore(manifests): add pod get permission to ui back 2020-06-16 15:24:03 +08:00
Yuan (Bob) Gong ecb6b1fe25
chore(manifests): add missing port name and other cleanups (#3987) 2020-06-15 22:38:03 -07:00
Yuan (Bob) Gong c0124cb747
[Manifest] Apply kustomize best practices to standalone manifest (#3978)
* Use configMapKeyRef for env vars

* Allow easy customization of cluster-scoped resources namespace

* clean up

* Clean up

* Simplify var replacement with direct configmap value ref

* clean up params.env
2020-06-15 19:09:57 -07:00
Yuan (Bob) Gong 3cf36cdef3
[Format] Format yaml files that will be automated (#3958)
* [Format] Format yaml files that will be automated

* Fix formatting

* Add a top level script

* Fix formatting
2020-06-10 20:28:58 -07:00
Yuan (Bob) Gong b12023ef3b
[Release] Various fixes to release script (#3918)
* Update kustomize/base/params.env

* fix

* kfp sdk requirement updated to kfp_api_server<2.0.0

* Fix major minor parsing

* Also fix cloudbuild.yaml
2020-06-05 13:32:21 +08:00
Amy 0324b2cac8
create cluster using 'cloud-platform' scope (#3892) 2020-06-04 12:14:38 -07:00
Yuan (Bob) Gong c517fc579d
[Release] Automate release script for all the changes (#3777)
* Refactor components/release.sh to provide a new components/release-branch.sh that updates release branch directly

* Release components as version tag instead of commit SHA

* Publish component images in release.cloudbuild.yaml

* Include script that updates version tag for component sdk

* [Manifest] Use kustomize native image transformer to override image

* Revert unintended changes

* Fix kustomization.yaml location

* Fix inverse proxy image

* Add release script for kustomize manifest

* Add release scripts for marketplace manifest and sdk

* Add global release.sh

* Fix sdk release script

* Clean up release scripts

* Fix release script

* Fix release scripts

* fix

* fix

* Fix cannot use uppercase vars in cloudbuild.yaml

* Add old components release script back

* Add a RELEASE.md doc
2020-06-03 08:44:18 -07:00
Yuan (Bob) Gong 53d35ddc87
[ScheduledWorkflow] Fix events permission missing (#3785) 2020-05-19 21:10:18 -07:00
Yuan (Bob) Gong 39805acc9e
[Manifest] Use kustomize native image transformer to override image (#3776)
* [Manifest] Use kustomize native image transformer to override image

* Revert unintended changes

* Fix kustomization.yaml location

* Fix inverse proxy image
2020-05-18 21:23:36 -07:00
Niklas Hansson 3d2c6ea84c
manual merge as the change it self is correct
but MKP mpdev:latest has an issue block our tests
2020-05-15 15:19:34 +08:00
Yuan (Bob) Gong 61960efb87
Add probes to metadata grpc service (#3765)
* Add probes to metadata grpc service

* Fix port name length limit

* Update README.md
2020-05-14 13:50:59 -07:00
Yuan (Bob) Gong f0c8effb3a
[Servers] Add liveness and readiness probes (#3757)
* probes for ml-pipeline-ui

* clean up comments

* Use wget instead of curl, because wget is included in alpine

* Also update marketplace manifest

* Add readiness/liveness probe for api server

* Add probes for python vis server
2020-05-13 23:46:22 -07:00
Alexey Volkov e83bd7d78f
Upgraded Argo to v2.7.5 (#3537)
* Upgraded Argo to v2.7.4

* Downgraded the Argo CLI version to 2.4.3

See https://github.com/argoproj/argo/issues/2793

* Removed the argo cli arg that had been removed

* Updated to Argo 2.7.5

* Added workflowtemplates and cronworkflows to the Role

* Added the new Argo CRDs
2020-05-11 23:52:21 -07:00
Renmin 9c16e12ba0
update version (#3694) 2020-05-06 19:37:09 -07:00
Renmin a6eb65013f
Move namespace to cluster-scoped (#3662)
* move namespace to cluster-scoped-resource

* fix doc
2020-04-30 00:30:17 -07:00
Renmin a94597f8cb
fix issue of creating default bucket (#3626) 2020-04-26 23:05:28 -07:00
hongye-sun 3dbbb16903
update version to 0.5.0 (#3566) 2020-04-22 14:00:50 -07:00
Renmin fcbb654563
Enable cache-deployer as fixed the root cause in other PR (#3574)
* default to kubeflow

* done

* include cache as we found root cause is namespace

* fix

* change the default to kubeflow, more for manual upgrade
2020-04-22 13:59:45 -07:00
Yuan (Bob) Gong 65e3db9443
Update ml-pipeline-ui-deployment.yaml (#3586) 2020-04-21 21:23:52 -07:00
Renmin 9b7689e971
Clusterrolebinding is using namespace which not parametrized (#3573)
submit quick to make sure others won't got confused
2020-04-21 22:23:10 +08:00
Renmin db8042a846
Fix test which uses Kustomize edit image but can't work with valueRef (#3572)
pass upgrade / installation test. submitting. now.

The e2e test fails but not due to this PR. Submit this PR to unlock KIR side
2020-04-21 20:10:41 +08:00
Renmin ad9bd5648d
use better sample name (#3558) 2020-04-20 18:57:35 +08:00
Renmin 91c3239004
OSS 1.0 Kustomize part-2 parameterize & fix CloudSQL (#3540)
submit without wait for fix for following as no dependency
https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/kubeflow_pipelines/3540/kubeflow-pipeline-e2e-test/1252173721301422090
2020-04-20 18:46:35 +08:00
Yuan (Bob) Gong 2742a3ed95
[SDK] Make service account configurable for build_image_from_working_dir (#3419)
* Add kfp-container-builder sa

* Allow service account to be configurable

* Fix tests

* Fix test

* Use documentation for service account to introduce compatibility with different types of installation

* updated doc

* clean up

* Update container_builder_test.py

* Update _build_image_api.py

* Update kustomization.yaml

* Add executable permission for presubmit tests mkp.sh
2020-04-15 00:06:02 -07:00
Yuan (Bob) Gong 2586e18990
Update metadata-envoy-deployment.yaml (#3502) 2020-04-14 03:13:12 -07:00
Renmin 75a92a740c
pass projectID from env/configmap without user input (#3458) 2020-04-13 09:05:48 -07:00
Renmin 9953e45619
one project share one default bucket (#3478) 2020-04-13 08:19:49 -07:00
Yuan (Bob) Gong e395276ea8
[Deployment] Move crds to cluster-scoped kustomize folders (#3498)
* [Deployment] Move crds to cluster-scoped kustomize folders

* Fix naming

* Rename folder

* Add STRUCTURE.md, fix bug

* fix
2020-04-13 01:09:48 -07:00
Jiaxiao Zheng cfcd824ed1
update version (#3467) 2020-04-07 14:13:44 -07:00
Yuan (Bob) Gong 163d49814b
Change cache-deployer to strategy recreate (#3456) 2020-04-06 19:55:43 -07:00
Rui Fang 85257a06ea
[Manifest] Cache - MKP deployment (#3430)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* fix master

* Add cache manifests for mkp deployment

* revert go.sum

* Add helm on delete policy for cache deployer job

* Change cache deployer job to statefulset

* remove unnecessary cluster role

* seperate clusterrole and role

* add role and rolebinding to mkp

* change secret role to clusterrole

* Add cloudsql support to cache

* fix comma

* Change cache secret clusterrole to role

* Adjust sequences of resources

* Update values and schema

* remove extra tab

* Change statefulset to job

* Add pod delete permission to cache deployer role

* Test changing cache deployer job to deployment

* remove extra permission

* remove statefulset check
2020-04-06 16:53:19 -07:00
Renmin 7fbb45d381
enable CloudSQL+GCSObjStore without default credential (#3378)
* enable CloudSQL+GCSObjStore without default credential

* refresh document

* fix schema

* minio project ID is required

* fix several

* self throtting Github requests to let build be stable

* can work now

* upsize and lowercase for bucket name

Co-authored-by: Renmin Gu <renming@google.com>
2020-04-03 17:22:19 +08:00
Niklas Hansson 8091752980
updated version (#3421) 2020-04-03 00:51:46 -07:00
Rui Fang 8e137a1ba6
[Manifest] Cache - Enable cache and cache deployer in base kustomization file (#3376)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* fix master

* Change cache deployer job to stateful set

* Delete cache deployer job

* Delete cache deployer job after it completes

* minor fix

* fix indention

* Change cache deployer job to statefulset

* Remove extra cluster role for cache deployer

* remove cache in base kustomize file for upgrade test

* minor fix

* Enable cache and cache-deployer in base kustomization file

* fix

* fix

* test

* test

* test

* Refactor cluster scope resources

* refactor

* Add namespace for sa

* Fix

* Add crds folder to cluster kustomization yaml

* namespace change

* fix

* fix

* fix

* update test

* Rename cluster to cluster-scoped-resource

* test adding namespace in kustomization file

* revert namespace for clusterrolebinding

* fix

* Add db_name in cache_deployment manifest

* rename

* change secret cluster role to role
2020-04-02 14:37:04 -07:00
Renmin bfb02f72c9
revert kfp-cache from Hosted/MKP (#3416)
Co-authored-by: Renmin Gu <renming@google.com>
2020-04-02 01:36:25 -07:00
Rui Fang c2a4a7376e
[Manifest]Fix - Cache mkp deployment (#3414)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* fix master

* Add cache manifests for mkp deployment

* revert go.sum

* Add helm on delete policy for cache deployer job

* Change cache deployer job to statefulset

* remove unnecessary cluster role

* seperate clusterrole and role

* add role and rolebinding to mkp

* change secret role to clusterrole

* Add cloudsql support to cache

* fix comma
2020-04-02 10:45:07 +08:00
Rui Fang b7d71b8f62
[Manifest]Cache - mkp deployment (#3343)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* fix master

* Add cache manifests for mkp deployment

* revert go.sum

* Add helm on delete policy for cache deployer job

* Change cache deployer job to statefulset

* remove unnecessary cluster role

* seperate clusterrole and role

* add role and rolebinding to mkp

* change secret role to clusterrole

* Add cloudsql support to cache
2020-03-31 21:03:28 -07:00
Ilias Katsakioris 14a56ba6ba
Manifests: Rename metadata gRPC server's resources to metadata-grpc-* (#3108)
* Manifests: Rename metadata gRPC server's resources to metadata-grpc-*

The metadata service deployed is a gRPC server.

Proper KF installation deploys both an HTTP server, naming the required
resources as 'metadata-deployment' and 'metadata-service', as well as a
gRPC server, naming the corresponding resources
'metadata-grpc-deployment' and 'metadata-grpc-service'.

KFP standalone installation manifests deploy solely the gRPC server, but
use naming identical to the KF's HTTP server one.
Applying them on top of an existing KF cluster breaks Metadata service.

In this PR we change the naming making it not diverge from a proper KF
installation. We also make MetadataWriter aware of that change.

Closes #2889.

Signed-off-by: Ilias Katsakioris <elikatsis@arrikto.com>

* Fix ConfigMaps' label

* metadata-configmap
* metadata-mysql-configmap

* README: Link to KF installation & reference KFP version
2020-03-26 22:22:23 -07:00
Rui Fang ad2c450c98
[Manifest] Cache - Fix upgrade manifest (#3338)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* fix master

* Change cache deployer job to stateful set

* Delete cache deployer job

* Delete cache deployer job after it completes

* minor fix

* fix indention

* Change cache deployer job to statefulset

* Remove extra cluster role for cache deployer

* remove cache in base kustomize file for upgrade test

* minor fix
2020-03-25 15:08:25 -07:00
Yuan (Bob) Gong 9873d24900
Add hostNetwork for marketplace proxy-agent manifest (#3330) 2020-03-24 00:46:47 -07:00
Ajay Gopinathan 2ff6d7bc03
Fix README formatting. (#3348)
* Fix README formatting.

* more fixes
2020-03-23 22:30:54 -07:00
Ajay Gopinathan d6abac5e8d
Bump version to 0.3.0 (#3329)
* Bump version to 0.3.0

* Fix formatting

* More formatting fixes

* More formatting fixes

* update requirements.txt

* update version
2020-03-22 19:44:45 -07:00
Rui Fang be497983cd
[Backend]Cache - Cache logic with db interaction (#3266)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* Add initial server logic

* Add const

* Change folder name

* Change execution key name

* Fix unit test

* Add Dockerfile and OWNERS file

This commit adds Dockerfile for building source code and OWNERS file for
easy review. This commit also renames some functions.

* fix go.sum

This PR fixes changes on go.sum

* Add local deployment scripts

This commit adds local deployment scripts which can deploy cache service
to an existing cluster with KFP installed.

* refactor src code

* Add standalone deployment scripts and yamls

This commit adds execution cache deployment scripts and yaml files in
KFP standalone deployment. Including a deployer which will generate the
certification and mutatingwebhookconfiguration and execution cache
deployment.

* Minor fix

* Add execution cache image build in test folder

* fix test cloudbuild

* Fix cloudbuild

* Add execution cache deployer image to test folder

* Add copyright

* Fix deployer build

* Add license for execution cache and cloudbuild for execution cache
images

This commit adds licenses for execution cache source code. Also adds
cloud build step for building cache image and cache deployer image.
Change the manifest name based on changed image.

* Refactor license intermediate data

* Fix execution cache image manifest

* Typo fix for cache and cache deployer images

* Add arguments in ca generation scripts and change deployer base image to google/cloud

* minor fix

* fix arg

* Mirror source code with MPL in execution_cache image

* Minor fix

* minor refactor on error handling

* Refactor cache source code, Docker image and manifest

* Fix variable names

* Add images in .release.cloudbuild.yaml

* Change execution_cache to generic name

* revice readme

* Move deployer job out of upgrade script

* fix tests

* fix tests

* Seperate cache service and cache deployer job

* mysql set up

* wip

* WIP

* WIP

* work mysql connection

* initial cache logic

* watcher

* WIP pod watching with mysql

* worked crud

* Add sql unit test

* fix manifest

* Add copyright

* Add watcher check and update cache key generation logic

* test replace container images

* work cache service

* Add configmap for cache service

* refactor

* fix manifest

* Add unit tests

* Remove delete table

* Fix sql dialect

* Add cached step log

* Add metadata execution id

* minor fix

* revert go.mod and go.sum

* revert go.sum and go.mod

* revert go.sum and go.mod

* revert go.mod and go.sum
2020-03-20 11:28:07 -07:00
Yuan (Bob) Gong f882f361cc
[UI] Show step pod yaml and events in RunDetails page (#3304)
* [UI Server] Pod info handler

* [UI] Pod info tab in run details page

* Change pod info preview to use yaml editor

* Fix namespace

* Adds error handling for PodInfo

* Adjust to warning message

* [UI] Pod events in RunDetails page

* Adjust error message

* Refactor k8s helper to get rid of in cluster limit

* Tests for pod info handler

* Tests for pod event list handler

* Move pod yaml viewer related components to separate file.

* Unit tests for PodYaml component

* Fix react unit tests

* Fix error message

* Address CR comments

* Add permission to ui role
2020-03-20 02:42:38 -07:00
Alexey Volkov 43d54e4768
Enable archiving logs (#3301)
Improve log persistence especially in on-prem scenarios.
2020-03-18 21:32:43 -07:00
Yuan (Bob) Gong f2beb96c01
[Deployment] Add secure=false explicitly in manifests for better observability (#3217)
* Revert "Revert "minio: Set secure=true to enable TLS by default (#3168)" (#3192)"

This reverts commit 743746b96e.

* Fix managed storage specific manifest

* Update pipeline.yaml

* Update client_manager.go
2020-03-12 00:44:37 -07:00
Jiaxiao Zheng 13fc87bf41
[Manifest] fix version for MLMD store (#3237)
* fix version

* update
2020-03-09 09:14:37 -07:00
Rui Fang ccdb885519
[Backend]Initial execution cache (#3036)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* Add initial server logic

* Add const

* Change folder name

* Change execution key name

* Fix unit test

* Add Dockerfile and OWNERS file

This commit adds Dockerfile for building source code and OWNERS file for
easy review. This commit also renames some functions.

* fix go.sum

This PR fixes changes on go.sum

* Add local deployment scripts

This commit adds local deployment scripts which can deploy cache service
to an existing cluster with KFP installed.

* refactor src code

* Add standalone deployment scripts and yamls

This commit adds execution cache deployment scripts and yaml files in
KFP standalone deployment. Including a deployer which will generate the
certification and mutatingwebhookconfiguration and execution cache
deployment.

* Minor fix

* Add execution cache image build in test folder

* fix test cloudbuild

* Fix cloudbuild

* Add execution cache deployer image to test folder

* Add copyright

* Fix deployer build

* Add license for execution cache and cloudbuild for execution cache
images

This commit adds licenses for execution cache source code. Also adds
cloud build step for building cache image and cache deployer image.
Change the manifest name based on changed image.

* Refactor license intermediate data

* Fix execution cache image manifest

* Typo fix for cache and cache deployer images

* Add arguments in ca generation scripts and change deployer base image to google/cloud

* minor fix

* fix arg

* Mirror source code with MPL in execution_cache image

* Minor fix

* minor refactor on error handling

* Refactor cache source code, Docker image and manifest

* Fix variable names

* Add images in .release.cloudbuild.yaml

* Change execution_cache to generic name

* revice readme

* Move deployer job out of upgrade script

* fix tests

* fix tests

* Seperate cache service and cache deployer job

* mysql set up

* Delete cache service in manifest, only test in presubmit tests

* fix

* fix presubmit tests

* fix

* fix

* revert unnecessary change

* fix cache image tag

* change image gcr to ml-pipeline-test

* Remove namespace in standalone manifest and add to test manifest
2020-03-03 16:13:47 -08:00
Renmin 4cb81ea047
revert #3168 as it doesn't work, put myself to reviewer (#3191)
Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-02-28 05:58:38 -08:00
Yuan (Bob) Gong 743746b96e
Revert "minio: Set secure=true to enable TLS by default (#3168)" (#3192)
This reverts commit 5cb158d313.
2020-02-28 20:51:23 +08:00
Johannes 'fish' Ziemke 5cb158d313
minio: Set secure=true to enable TLS by default (#3168)
* minio: Set secure=true to enable TLS by default

Not using TLS is a security concern, especially if using cloud storage
like S3. This should be set to secure to avoid people unknowingly not
using TLS.

To make the bundled minio still work, I've submitted
https://github.com/kubeflow/manifests/pull/950 to set secure=false in
this case explicitly.

* minio: secure=false in GCP & standalone manifests
2020-02-27 12:02:07 -08:00
Yuan (Bob) Gong 34ef1f1250
Update schema.yaml 2020-02-27 18:18:15 +08:00
Yuan (Bob) Gong 688aa08d42
Release 0.2.5 - e66dcb1860 (#3179)
* Updated component images to version e66dcb1860

* Updated components to version 3f4b80127f

* Update setup.py

* Fix swagger codegen download url

* Update versions
2020-02-26 23:59:19 -08:00
Gonzalo Gasca Meza 726682d8f9
Add BigQuery option (#3097) 2020-02-26 23:03:20 -08:00
Renmin 23e9d3786d
bump version to 0.2.4 and tiny BTW doc fix (#3115)
* update guide for cluster size

* bump version to 0.2.4

* remove ALPHA

* update sample source link

* fix snapshot

* done

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-02-19 03:54:25 -08:00
jingzhang36 881cde2be1
update version from 0.2.2 to 0.2.3 for kfp (#3072)
* update version from 0.2.2 to 0.2.3 for kfp

* 0.2.2 -> 0.2.3
2020-02-13 03:54:34 -08:00
Renmin 1a8225f32c
done (#3027)
Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-02-12 05:54:51 -08:00
Renmin 6dcfcc21d9
update changelog and document (#2990)
* bump version

* less resource request as MKP side anyway will request more

* done

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-02-05 03:19:54 -08:00
Renmin 9ffd11ac85
better default bucket name, less resource request (#2989)
* bump version

* less resource request as MKP side anyway will request more

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-02-05 00:37:54 -08:00
Renmin ae3ae79dc3
bump version (#2988)
Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-02-04 23:13:55 -08:00
Renmin 545de31eec
fix for Init Action (#2980)
* should can work

* didn't delete configmap for mpdev verify

* done

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-02-04 05:51:28 -08:00
Ajay Gopinathan fdcbdaf4b6
Bump version for release 0.2.1 of Pipelines (#2961) 2020-02-02 19:33:20 -08:00
Jiaxiao Zheng 3c6ee19393
[Backend] Patch default bucket name and project ID (#2938)
* update branch

* add env var

* update api-server

* populate default value

* lint

* bump build rule

* bump build rule

* bump travis bazel version

* fix

* Revert "bump travis bazel version"

This reverts commit 92db3842

* Revert "bump build rule"

This reverts commit be2bd7b4

* Revert "bump build rule"

This reverts commit 78926e3c

* Revert "fix"

This reverts commit 963e64b8

* fix

* patch all the samples

* unittest

* fix tests

* minor fix

* style change

* clean up
2020-01-31 15:25:21 -08:00
Yuan (Bob) Gong 4709c6b42f
[UI] Getting started page for hosted pipelines (#2935)
* Implement getting started page.

* Add feature flag to only show getting started page on hosted pipelines

* Add tests

* Fix format

* Implement requested layout in getting started page

* Minor adjust layout

* Fix tests

* Fix snapshots

* Update page title
2020-01-31 12:35:20 -08:00
dushyanthsc 553ee7ca90
Metadata: Update Metadata server version to v0.21.1 (#2931)
Change to bump the metadata version to v0.21.1 to  get the latest updates in MLMD.
2020-01-30 12:32:20 -08:00
Jiaxiao Zheng d9fb85dc51
[MKP] Add initializaion action for MKP deployer (#2932)
* update docker

* add gcloud sdk

* add gcs creation and configmap

* fix lint

* update docker

* rename/move old deploy.sh

* add permission

* typo

* fix random id

* namespace typo

* add project id

* fix typo
2020-01-30 10:42:19 -08:00
dushyanthsc 469e7196cf Metadata: Updating metadata server version from 0.14.0 to v0.21.0 (#2901)
Change to bump the metadata version to get the latest updates in MLMD.
2020-01-27 09:51:03 -08:00
jingzhang36 b20b85cda8 Have custom visualization on by default (#2891)
* Have custom visualization on by default

* Add flag to gcp market
2020-01-21 23:20:35 -08:00
Ilias Katsakioris 492cb1ce4e Manifests/Kustomize: Add metadata-writer to images (#2882)
Signed-off-by: Ilias Katsakioris <elikatsis@arrikto.com>
2020-01-20 18:55:37 -08:00
Renmin ca58b2227f fix the metadata writer manifest (#2878)
* fix the metadata writer manifest

* fix

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-01-20 01:25:35 -08:00
Renmin 232dd393f2 from 0.1.40 to 0.2.0 (#2872)
* done

* all done

* bump version to 0.2.1

* correct to 0.2.0

* done

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-01-18 23:05:35 -08:00
Renmin 1aabc74156 enable full scope for testing version (#2871)
* done

* all done

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-01-18 22:17:35 -08:00
IronPan a0a39a5eda Install application CRD and add pipeline application CR to pipeline standalone (#2585)
* install application CRD and add pipeline application CR

* add labels and let application manager to set ownerref

* fix

* address comments

* update test

* update test

* update test

* update readme

* fix test

* update

* update

* update

* Update application-crd.yaml

* fix

* fix

* Update .release.cloudbuild.yaml

* update tests

* Update kustomization.yaml

* Update deploy-pipeline-lite.sh

* Update ml-pipeline-viewer-crd-sa.yaml

* update tests

* update tests

* update tests
2020-01-16 09:20:25 -08:00
IronPan 403139511a Opt out proxy agent configmap (#2849)
Fix https://github.com/kubeflow/pipelines/issues/2848
2020-01-15 12:10:22 -08:00
Alexey Volkov dc34a3568d Service - Metadata writer (#2674)
* Metadata writer

* Added sleeper-based metadata writer

* Sleeper

* First working draft

* Added properties to Executions Artifacts and Contexts

Also added attributions.
context_id is now stored as label.

* Prefix the execution type names

* Ignoring TFX pods

* Fixed the deployment container spec

* Cleaned up the file and added deployment spec

* Added the Kubernetes deployment

* Added startup logging

* Made python output unbuffered

* Fixed None exception

* Formatting exceptions

* Prefixing the log message

* Improved handling non-S3 artifacts

* Logging input artifacts

* Extracted code to the link_execution_to_input_artifact function

* Setting execution's pipeline_name to workflow name

* Adding annotation with input artifact IDs

* Running infinitely

* Added component version to execution type name

* Marking metadata as written even for failed pods

* Cleaned up some comments

* Do not fail when upstream artifact is missing

* Change the completion detection logic

Waiting for Argo's "completed=true" instead of Kubernetes' "phase: Completed" introduced delays that lead to problems with missing input artifacts.
This changes allows us to log the outpuyt artifacts earlier.

* Added Dockerfile

* Added release deployment manifest

* Added OWNERS

* Switching to using MLMD service instead of direct DB access

* Adding licenses to the image

* Pinned Python's minor version

* Moved code to /backend/metadata_writer

Moved manifest to /manifests

* Added image building to CloudBuild

* Added Metadata Writer to release CloudBuild

* Added Metadata Writer to test scripts

* Finished the kustomization manifests

* Added Metadata Writer to marketplace manifests

* Added ServiceAccount, Role and RoleBinding for MW

* Fixed merge conflict

* Removed the debug deployment

* Forgot to add the chart templates for the SA and roles

* Specified the service account

* Switched to watching a single namespace

* Resolved feedback

Removed dev deployment comment from python code.
Added license.
Fixed the range of kubernetes package versions.

* More review fixes

* Extracted the metadata helper functions

* Improved the error message when context type is unexpected

* Fixed the import

* Checking the connection to MLMD

The latest tests started to have connection problems - "failed to connect to all addresses" and "Failed to pick subchannel".

* Improved the MLMD connection error logging

* Try creating MLMD client on each retry and using a different request

* Changed the MLMD connection check request

All get requests fail when the DB is empty, so we have to use a put request.
See https://github.com/google/ml-metadata/issues/28

* Using unbuffered IO to improve the logging latency

* Changed the URI schema for the artifacts

* Cleanup

* Simplified the kubernetes config loading code

* Resolving the feedback
2020-01-14 23:17:32 -08:00
Renmin fe6cf9a1f2 Update deployer base to 0.1.40 (#2696)
* add label and namespace to resource created inside pod

* fix

* done

* update existing configmap for better GC

* update dependency to make sure configmap got created before run script

* self fix

* use latest deployer base

* deployer

* all done

* upgrade deployer to 0.1.37 for bug bash

* all done

* done

* fix issue for MLMD

* done

* done

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-01-14 21:43:32 -08:00
Renmin b5c54e1ba7 better handle inverse-proxy configmap for better GC (#2391)
* add label and namespace to resource created inside pod

* fix

* done

* update existing configmap for better GC

* update dependency to make sure configmap got created before run script

* self fix

* all done

* reuse existing

* reuse

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-01-13 19:37:09 -08:00
jingzhang36 a8bd3ae836
visualization server wants kubernetes serivce account too (#2822)
* visualization server wants kubernetes serivce account too

* add ksa for visualization server and use this ksa for standalone and
hosted deployment of visualization server

* Use server name its ksa name

* add sa to pipeline.yaml
2020-01-12 17:46:49 +08:00
Renmin 10a4f55e3a Release bae654dc5c v0.1.40 (#2818)
* Updated component images to version bae654dc5c

* Updated components to version ff116b6f1a

* 0.1.40

* append old items

* fix line

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-01-09 01:33:46 -08:00
Jiaxiao Zheng 76489dc31d [Manifest/Doc] Another version bumping (#2797) 2020-01-07 13:34:18 -08:00
IronPan 2873420257 Delete namespaced-install.yaml (#2603)
This file is no longer needed as we store the materialized version in GCS
2020-01-07 06:40:19 -08:00
Ning dd207b1bcf
release 0.1.39 (#2799) 2020-01-06 13:15:17 -08:00
Jiaxiao Zheng 0d29d213f0 fix (#2778) 2020-01-02 00:59:40 -08:00
Jiaxiao Zheng a73d8f1db8 bump v (#2785) 2019-12-30 17:55:39 -08:00
Yuan (Bob) Gong 840979c6a9 Fix GCP cloudsql manifest (#2772) 2019-12-24 23:29:30 -08:00
Ning b3171f0e6c update manifest (#2762) 2019-12-19 15:41:32 -08:00
Sarah Maddox ee88c48d83 Added alpha notice to READMEs for KFP Marketplace (#2749)
* Added alpha notice to READMEs for KFP Marketplace.

* Added link to issue tracker.
2019-12-17 23:11:57 -08:00
Yuan (Bob) Gong 4a8d262abb Migrate standalone deployment to workload identity on GCP (#2619)
* Script to set up workload identity for standalone deployment

* Migrate tests to run on standalone + workload identity

* Fix test script

* Switch to static GSAs for testing, because they have name length limit

* Add workload identity binding for argo

* Fix argo workload identity bindings

* Remove user-gcp-sa from tests

* Remove use_gcp_secret from xgboost sample

* Allow debugging tests locally

* Wait for policies to take effect

* Update deploy-pipeline-lite.sh

* Update deploy-pipeline-lite.sh

* [WIP] test gcloud auth list with test-runner sa

* Add namespace

* test again

* Use new image builder

* test again

* Remove debug code

* Remove usages of use_gcp_secret

* Fix unit test and tensorboard pod template

* Add debug code again to test

* Try waiting until workload identity bindings are ready

* Fix some other samples

* Fix parameterized tfx oss sample

* Add retry to image building

* Try fixing tfx oss sample

* Fix compiled tfx oss sample

* Update all google/cloud-sdk to latest

* Try fixing parameterized tfx oss sample again

* Also verify pipeline-runner ksa is working

* Fix parameterized_tfx_oss sample

* Update gcp-workload-identity-setup.sh

* Revert unneeded change

* Pin to new google/cloud-sdk

* Remove wrongly commited binaries
2019-12-16 22:05:58 -08:00
dushyanthsc 979e3bdedb Metadata: Adding metadata-grpc config map (#2723)
This change adds the necessary config-map related to gRPC MLMD server.

To make the names more clear, this change also modifies the existing
'metadata-configmap' which provides mysql configurations to
'metadata-mysql-configmap'
2019-12-11 20:40:32 -08:00
Eterna2 5a0c2f423d [manifest] Added manifest for deploying on aws using s3 (#2633)
* Added manifest for deploying on aws using s3

* Revert "Added manifest for deploying on aws using s3"

This reverts commit 6a9c498c2c.

* Added readme and link to kubeflow-aws on how to deploy lightweight pipeline on AWS

* updated readme on how to deploy on aws

* Update README.md

* Update README.md
2019-12-11 17:52:32 -08:00
hongye-sun 1b04983d96
update server and sdk version to 0.1.37 (#2701)
* update server and sdk version to 0.1.37

* update api package version to latest

* fix test error
2019-12-05 17:05:07 -05:00
Ning 1b85a614bf add namespace to the resourcereference (#2651)
* add namespace to some run APIs

* update only the create run api

* add resourcereference for namespace runs

* add variables in const

* add types to toModel func

* bug fix

* strip the namespace resource reference when mapping to the db model

* add unit tests

* use gofmt

* replace belonging relationshipreference to owner

* put a todo for further investigation of using namespace or uuid

* apply gofmt

* revert minor change

* Update model_converter.go
2019-12-03 22:06:57 -08:00
Jiaxiao Zheng 4ca121d136 Bump version to 0.1.36 (#2661)
* update component sdk version

* bump python SDK and manifest version

* Revert "update component sdk version" to prevent conflict

This reverts commit 1fd6ddc8
2019-11-26 15:59:20 -08:00
Renmin 533ed1d6f9 Mkp better release - SemVersion (#2656)
* commit for test

* swap sequence for "mpdev verify"

* update App CRD version
2019-11-26 02:51:10 -08:00
jingzhang36 f308abeece Add a new field "TensorflowImage" to KFP viewer CRD file template. (#2544)
* Without version bump

* fix the delete caller

* return after delete

* reconciler removes old viewer crd file that misses image specification

* add frontend comment

* remove accidental changes that are irrelevant

* Revise log message

* Add error handling

* add test

* tensorflow image check only applies to viewer tensorboard type and thus put it after the type check.

* Use of default image instead of validation
2019-11-18 10:39:41 -08:00
IronPan e7c7c510b7 Enable inverse proxy to access host network (#2614)
When switching to GKE workload identity, the pods can't access to metadata server anymore by default due to metadata concealment. 
This can be unlocked by explicitly enable hostnetwork for the pod. 
https://cloud.google.com/kubernetes-engine/docs/how-to/protecting-cluster-metadata#concealment

This should be OK as proxy is an optional component. In any case when user feel this not a secure option he/she could opt out it.
2019-11-17 18:57:41 -08:00
Yuan (Bob) Gong ee9599fc93 Add Bobgy to OWNERS of kustomize manifests (#2600) 2019-11-13 10:43:39 -08:00
Yuan (Bob) Gong 12a9fa95d2 Bump version to 0.1.35 (#2598)
* Bump version to 0.1.35

* Update kustomization.yaml

* Update __init__.py
2019-11-13 10:43:31 -08:00
Jiaxiao Zheng 9f258a83fb
[Doc] Minor fix to MKP secret setup guide (#2576)
* fix doc

* apply suggestion

* split line
2019-11-09 07:57:06 -08:00
Eterna2 aa2d2f42b3 [pipeline-ui] Retrieve pod logs from argo archive (#2081)
* Retrieve pod logs from argo archive

* Added aws instance profile iam credential support for minio client. Read workflow status for argo archive location for pod logs.

* fix minor typo, and enforce typing for minio client options

* Update helm chart for pipelines ui role with permission to access secret and workflow crd

* remove unnecessary type cast

* Fix bug: s3client should be a callable, so that iam token is refreshed
2019-11-07 16:58:18 -08:00
jingzhang36 4f8e9fb169 Increase kfp version in kustomize and sdk init (#2567) 2019-11-07 14:00:26 -08:00
IronPan 8112af11aa
Update kustomization.yaml (#2530)
/assign @Ark-kun
2019-11-01 17:01:59 -07:00
IronPan 7356d35663 update location (#2520) 2019-11-01 16:45:41 -07:00
Jiaxiao Zheng a1f708bdd5 [MKP] Reduce the logo image size. (#2519)
* update

* add logo
2019-10-31 10:54:09 -07:00
IronPan 47bb1036f9 remove unnecessary namespace in UI rolebinding spec (#2510)
/assign @Bobgy
2019-10-30 14:07:37 -07:00
IronPan ff4123a248 clean up viewer crd spec (#2511)
/assign @Bobgy
2019-10-30 13:27:38 -07:00
Jiaxiao Zheng fc8e8e0e27 [Doc] Update permission requirement in README (#2422)
* add tfx permission guide

* add xgboost guide

* add a direct link to dataproc API

* Add tip

* Update README.md

* Update and apply suggestions

* Fix

* Update README.md
2019-10-29 10:36:50 -07:00
Renmin fdbc05ec02 update inverse proxy custom permission setting and role setting (#2493)
* update inverse proxy custom permission setting

* update doc

* update per comment

* fix the role command
2019-10-25 05:15:39 -07:00
Mingfei 0b8d2e12d1 clusterrole for pipeline-runner with seldondeployments (#2458) 2019-10-25 02:35:45 -07:00
Jiaxiao Zheng 2ebb1d801b [MKP] Fix metadata DB configmap (#2491)
* title change

* Update

* fix configmap
2019-10-24 23:33:38 -07:00
IronPan 7c1efbc6ee update base image to fix the GKE GC issue for marketplace deployment (#2484) 2019-10-24 18:57:39 -07:00
Renmin bb6066a029 done (#2481) 2019-10-24 09:29:10 -07:00
Renmin a558f5c350 update link so that user can easily jump to CAIP after deployment (#2377)
* update link so that user can easily jump to CAIP after deployment

* update info table
2019-10-24 03:17:07 -07:00
Jiaxiao Zheng 868b135e4b [Doc] Minor doc update (#2394)
* Update doc

* Update per suggestion

* Update guide.md

* Update guide.md

* fix a typo

* Add namespace
2019-10-23 22:31:07 -07:00
IronPan d36e664e3c Add owner files for marketplace deployment (#2436)
/assign @rmgogogo
2019-10-21 11:54:15 -07:00
hongye-sun aab5a50120 Update sdk release version (#2434)
* Update sdk release version

* Update the server image version
2019-10-18 12:19:08 -07:00
Yuan (Bob) Gong 49a50df518 Fix pipeline cannot run bug when using marketplace managed storage (#2341)
* Make gcs bucket name configurable + fix marketplace managed storage does not run pipelines successfully

* gcsBucketName is computed from cloudsqlInstanceConnectionName + avoid code duplication
2019-10-15 20:44:07 -07:00
Renmin 2c73968976 enlarge MKP cluster constraint (#2339)
* fix for MKP

* follow best practice

* enlarge cluster constraint from MKP
2019-10-09 17:47:00 -07:00
Jiaxiao Zheng ca17faa42b [MKP] Remove service account credential from deployment page. (#2308)
* Finalize. Add todo

* Remove
2019-10-09 13:45:49 -07:00
Jiaxiao Zheng 6a8d105194 fix typo (#2342) 2019-10-09 11:47:51 -07:00
Jiaxiao Zheng ed285ce768 [MKP/doc] Update doc for changes of service account credential. (#2309)
* Update README for MKP development. Remove managed SQL part and update verison tag.

* Update guide for MKP deployment.

* Small fix.

* fix one missing thing
2019-10-04 18:13:13 -07:00
Jiaxiao Zheng 88864fc840 Fix gcr paths (#2289) 2019-10-02 15:42:08 -07:00
Renmin bf81e635cf fix for MKP (#2288)
* fix for MKP

* follow best practice
2019-10-02 14:04:08 -07:00
Renmin f649eb57a0 Disable cloudsql and update to v0.1.31 (#2284)
* add owner

* updated owners per comment, keep at least one from SHA for easy co-operation

* limit OWNERS scope first

* refine doc for MKP

* fix James comments for wording

* fix doc and mask pwd

* temp disable managed storage

* also update images to 0.1.31
2019-10-01 18:01:07 -07:00
Yuan (Bob) Gong 51f5a5a9ed Add third party license + source code to argo and minio images to comply with their license (#2201)
* Add concatenated third party license to argo and minio images

* Add MPL dependencies source code in argoexec docker image

* Include source code of MPL dependencies of minio in its image

* Add source code for argo dependencies with MPL in argo images

* Updated workflow to release manually, included cloudbuild config to build each image, also added README for instructions

* docker env naming consistency

* Include release scripts and instructions in third_party/README.md

* Update README.md

* undo cloudbuild.yaml changes, update README

* Change argo and minio image tags in manifests

* Remove unneeded code

* Fix copyright year
2019-09-30 22:39:32 -07:00
Renmin 4b127208e0 Update doc for MKP release (#2242)
* add owner

* updated owners per comment, keep at least one from SHA for easy co-operation

* limit OWNERS scope first

* refine doc for MKP

* fix James comments for wording

* fix doc and mask pwd
2019-09-30 21:27:33 -07:00
Yuan (Bob) Gong 3d7df0c522 Fix pipeline lite README (#2217)
/cc @jingzhang36
2019-09-24 10:27:29 -07:00
IronPan 9b40564dc8 fix release CB and update readme (#2204) 2019-09-23 13:05:25 -07:00
IronPan 44a08a5275 Update pipeline-lite to v0.1.31 (#2189)
* small fixes

* update version to 0.1.31

* update version to 0.1.31
2019-09-22 09:57:23 -07:00
dushyanthsc ab8f5ed043 metadata: deployment update (#2093)
Change to create mysql credential K8 object as part of deployment
2019-09-20 21:43:24 -07:00
dushyanthsc 57d9f7f1cf metadata: Deployment scripts update (#2180)
Change to parameterize database_name, host and port for metadata
deployment using config maps
2019-09-20 16:45:25 -07:00
Jiaxiao Zheng ec1ae92210 Update help info for SA. (#2179) 2019-09-20 15:49:23 -07:00
IronPan a86ee2dfbb Fix more typos in KFP marketplace application (#2158)
* small fixes

* remove trunk for db name

* Delete Makefile

* more fixes

* more fixes

* more fixes
2019-09-19 02:09:01 -07:00
IronPan 0971006bf2
Removing the unnecessary trunk when generating the DB name (#2151)
* small fixes

* remove trunk for db name

* Delete Makefile

* more fixes
2019-09-18 23:55:26 +08:00
IronPan 6e0357d0c2
Small fixes to the kfp marketplace (#2138)
* small fixes

* small fixes
2019-09-17 17:25:10 +08:00
IronPan 6b6b83c9c5
Update documentations for Kubeflow Pipelines marketplace (#2125)
* fix

* update documentations

* update documentations

* update documentations

* small fixes
2019-09-17 17:23:31 +08:00
IronPan 0efdb55a84
fix (#2129) 2019-09-17 16:38:39 +08:00
IronPan f2b1437105 Add documentation for Marketplace CLI deployment (#2128)
* fix

* fix

* fix
2019-09-16 21:36:23 -07:00
IronPan d0bf18d83b
fix (#2122) 2019-09-15 08:46:17 +08:00
Ning f21d3eda0e update kustomize (#2111) 2019-09-13 14:42:29 -07:00
dushyanthsc 22eb915b4c gcp-marketplace: Updating metadata deployment to use gRPC server (#2083)
Change to update gcp marketplace helm charts to use gRPC MLMD metadata
server.
2019-09-12 20:02:31 -07:00
IronPan 97b01b8f4a
Remove broken environment variables in namespace install (#2087)
Likely introduced by a bad merge 
https://github.com/kubeflow/pipelines/pull/1918/files#diff-4f0f86e24382decf794ec0f3d0c4c54cR672
2019-09-12 11:06:38 +08:00
dushyanthsc 635370d256 manifests/metadata: Metadata server invocation change (#2041)
Change to update invocation to use command line parameters as against
using proto text.
2019-09-10 19:28:46 -07:00
IronPan b7f37da23f fix (#2079) 2019-09-10 02:16:55 -07:00
IronPan 715471f113 Store DB password as K8s secret for marketplace deployment (#2067)
* pass in secret

* fix

* use application name by default for database prefix

* bug fixes and bump kfp version

* Update application.yaml

* fix objectstore name

* fix objectstore name

* store db pwd as secret

* fix

* fix

* fix

* fix
2019-09-09 18:06:54 -07:00
IronPan 0d81785466 Chang object store naming for marketplace deployment (#2066)
* pass in secret

* fix

* use application name by default for database prefix

* bug fixes and bump kfp version

* Update application.yaml

* fix objectstore name

* fix objectstore name
2019-09-09 03:59:18 -07:00
IronPan 3edaa38e51 fixes name inconsistency for marketplace manifest and bump kfp version (#2065)
* pass in secret

* fix

* use application name by default for database prefix

* bug fixes and bump kfp version

* Update application.yaml
2019-09-09 01:31:20 -07:00
IronPan be01314ac2 Add parameter for service account credential (#2039)
* pass in secret

* fix

* use application name by default for database prefix
2019-09-08 23:51:17 -07:00
Riley Bauer 8d196d4dec Adds Metadata Artifacts to UI (#2057)
* Working, though the request seems malformed

* Working with grpc-web. trying to push to cluster

* WIP

* With great hax come great success

* Begin moving some metadata UI pages to KFP

* Artifact list and details pages work! A lot of clean up is needed. Look for console.log and TODO

* Clean up

* Fixes filtering of artifact list

* More cleanup

* Revert ui deployment

* Updates tests

* Update envoy deployment
2019-09-06 13:19:19 -07:00
dushyanthsc 1b4919f3fb
manifest/metadata: Switching to use metadata grpc server (#1973)
This change switches pipeline deployments to use metadata-grpc server
instead of kf-mlmd server.
2019-09-03 16:15:02 -07:00
IronPan 04ddb3277f
Add readme and update application parameter (#2021)
* pass pipeline runner service account to api server

* update schema

* fix

* change image

* rename

* fix

* fix

* update readme

* update readme
2019-09-03 12:44:03 -07:00
IronPan 8a87bd3641
Change schema.yaml to v2 (#2009)
* pass pipeline runner service account to api server

* update schema

* fix

* change image

* Update schema.yaml

* Update values.yaml

* Update values.yaml
2019-09-03 11:51:44 -07:00
IronPan 4baf453342
update service account for non managed storage case (#2023) 2019-09-03 11:01:12 -07:00
IronPan 524cbb2a18 pass pipeline runner service account to api server (#1996) 2019-08-29 17:19:13 -07:00
IronPan 180f003947
Update gcp-configurations-patch.yaml
remove information checked in by accident
2019-08-29 14:02:45 -07:00
IronPan 5d9da976a4 remove service account (#1983) 2019-08-29 00:01:58 -07:00
IronPan 77c6c013e3 update application name (#1980) 2019-08-28 17:03:54 -07:00
IronPan 52d3ef5473
Update GCP marketplace deployer dockerfile (#1975)
* Update docker

* Update Dockerfile
2019-08-28 15:40:07 -07:00
IronPan 7bb4c2a013
Add CRD admin permission to deployer SA (#1977)
This is following the instruction here
https://github.com/GoogleCloudPlatform/marketplace-k8s-app-tools/blob/master/docs/schema.md#deployerserviceaccount
2019-08-28 13:48:22 -07:00
IronPan f63c301344 add GCP marketplace application manifest for kubeflow pipelines (#1621)
* move kustomize to subdirectory

* add application cr

* add label application crd can recognize

* Update README.md

* Update README.md

* Update README.md

* add schema

* update

* stage

* add deployer

* sync components

* add components

* fix issues

* add configuration

* fix

* fix

* update
2019-08-27 14:49:11 -07:00
IronPan 336760cfd8 sync namespaced install file (#1932) 2019-08-22 17:47:18 -07:00
hongye-sun 119405deaf
upgrade backend image versions (#1918) 2019-08-22 15:49:29 -07:00
dushyanthsc db6c9b4fd0 pipeline-lite: Introduce metadata component to pipeline-lite (#1840)
This change introduces the metadata component to pipeline-lite
installation. This installation:

  1. Does not include metadata-ui
  2. mysql installation is used instead of metadata-db
  3. Replica count has been reduced 1 instead of 3
2019-08-22 13:11:18 -07:00
IronPan 60de44b61d update namespaced-install.yaml (#1926) 2019-08-22 12:36:34 -07:00
IronPan 06b7ad659f
Cleanup pipeline-lite deployment (#1921)
* restructure

* working example

* working example

* move mysql

* moving minio and mysql out

* add gcp

* add files

* fix test

* extract parameters to single place

* update

* update readme

* update readme

* address pr comment
2019-08-22 10:03:28 -07:00
Kirin Patel 8c3d6fe121 Add visualization-server service to lightweight deployment (#1844)
* Add visualization-server service to lightweight deployment

* Addressed PR suggestions

* Added field to determine if visualization service is active and fixed unit tests for visualization_server.go

* Additional small fixes

* port change from 88888 -> 8888
* version change from 0.1.15 -> 0.1.26
* removed visualization-server from base/kustomization.yaml

* Fixed visualization_server_test.go to reflect new changes

* Changed implementation to be fail fast

* Changed host name to be constant provided by environment

* Added retry and extracted isVisualizationServiceAlive logic to function

* Fixed deployment.yaml file

* Fixed serviceURL configuration issuse

serviceURL is now properly obtained from the environment, the service ip address and port are used rather than service name and namespace

* Added log message to indicate when visualization service is unreachable

* Addressed PR comments

* Removed _HTTP
2019-08-21 18:30:33 -07:00
IronPan 2e7f2d4816
Add cloud sql and gcs connection for pipeline-lite deployment (#1910)
* restructure

* working example

* working example

* move mysql

* moving minio and mysql out

* add gcp

* add files

* fix test
2019-08-21 16:36:35 -07:00
IronPan 7e062cea77
Update README.md 2019-08-21 15:14:31 -07:00
Ajay Gopinathan 0369a4c382 Update manifests to point to 0.26 release. (#1870) 2019-08-16 21:02:07 -07:00
IronPan a9602fbc3f
Add API to rerun the pipeline (#1720)
* add resubmit proto

* add compiled code

* fix

* add resubmit proto

* add

* refactor

* update builder

* refactor

* refactor

* refactor

* refactor

* refactor

* refactor

* add test

* add test

* add test

* add test

* fix test

* fix test

* fix test

* fix test

* fix test

* fix test

* fix test

* address comments

* add comments

* change request body def

* recompile api

* retry instead of resubmit

* update test

* update test

* fix tests

* fix tests

* fix tests

* robust retry

* robust retry

* robust retry

* robust retry

* robust retry

* robust retry

* robust retry

* robust retry

* robust retry

* add error handling

* reorder the call

* remove logic to update the database entry

* add mock

* add tests for rerousrce manager

* update error handling logic

* fix tests

* address comments
2019-08-07 13:59:06 -07:00
IronPan 609c7a6c0d
Move kustomize manifests a dedicate directory (#1690)
* move kustomize to subdirectory

* add application cr

* add label application crd can recognize

* Update README.md

* Update README.md

* Update README.md

* add schema

* update

* stage

* fix

* update
2019-07-30 10:21:16 -07:00
jingzhang36 a16f380637 Increase version in namespaced-install.yaml too (#1684) 2019-07-26 10:13:59 -07:00
jingzhang36 80d16406f6 Increase KFP version in (1) sdk/python/setup.py (2) component_sdk/python/setup.py (3) manifests/base/kustomization.yaml (#1674)
* Increase KFP version in sdk/python/setup.py component_sdk/python/setup.py

* Increase kfp version in manifests/base/kustomization.yaml
2019-07-25 20:03:58 -07:00
IronPan a8a162c6b9
switch third party images to gcr (#1622) 2019-07-16 12:03:21 -07:00
jingzhang36 784c4f12b7 viewer controller is now namespaced so no need for cluster role (#1623)
* viewer controller is now namespaced so no need for cluster role

* our default namespaced install (kubeflow namespace) can also use Role instead of ClusterRole
2019-07-16 09:35:26 -07:00
Ning cce52f4dfb update manifest and sdk version (#1617) 2019-07-12 14:39:04 -07:00
jingzhang36 b957a9872c Viewer CRD controller running under namespace (#1562)
* Viewer CRD controller running under namespace

* Change docker file and add manifest deployment yaml to support the new flag namespace

* Change docker file to support new flag namespace for viewer crd controller

* Modify kustomization.yaml and namespaced-install.yaml

* Change file name from ml-pipeline-viewer-crd-deployment to ml-pipeline-viewer-crd-deployment-patch

* Fix typo

* Remove some duplicate configs in namespaced-install
2019-07-03 11:39:40 -07:00
Ilias Katsakioris f0c8432748 Manifests: Update pipeline-runner permissions (#1570)
* Supplement pipeline-runner permissions according to
  kubeflow/kubeflow#2556
2019-07-02 17:11:13 -07:00
IronPan db9d1ca222 Add PV/PVC admin permission for pipeline runner (#1576) 2019-07-02 12:03:09 -07:00
Riley Bauer 3882c306cd Update manifests for 0.1.23 (#1567)
* Updates KFP manifests for v0.1.23

* Bump version in sdk
2019-06-28 11:32:58 -07:00
hongye-sun eee7834988 increment sdk versions (#1538)
* increment sdk versions

* Update cluster binary versions.
2019-06-20 21:24:36 -07:00
IronPan 808d384994
use cluster role for viewer crd for now since viewer crd doesn't support namespaced deployemnt (#1499) 2019-06-12 14:45:37 -07:00
IronPan f8b06387b8 update manifest/ readme (#1454)
/assign @Ark-kun
2019-06-07 09:51:58 -07:00
IronPan fd1c2a4bf3
Update README.md 2019-06-06 12:41:44 -07:00
IronPan 94344ae6ed
update manifest (#1446) 2019-06-06 10:07:18 -07:00
IronPan 67d40c41da
Update the doc to enable proxy agent by default (#1432)
/assign @paveldournov @Ark-kun
2019-06-04 00:25:00 -07:00
IronPan 18a8be7b5c enable proxy by default (#1431) 2019-06-03 22:54:59 -07:00
Ilias Katsakioris 198b022368 Manifests: Update Argo version to v2.3.0 (#1425)
Signed-off-by: Ilias Katsakioris <elikatsis@arrikto.com>
2019-06-03 20:14:59 -07:00
jingzhang36 37cea57984 Update pipeline version in KFP manifest (#1424) 2019-06-03 19:12:58 -07:00
IronPan 7bf95b1327 expose configuration for setting the max number of tensorboard (#1393)
* clean up

* argo

* expose configuration for max number of viewers

* add sample how to configure

* Revert "argo"

This reverts commit 3ff0d07679.

* update namespaced-install.yaml
2019-05-28 13:18:58 -07:00
IronPan ae64ca5516
Update README.md 2019-05-23 17:18:03 -07:00
IronPan 3736a70447
Update README.md 2019-05-23 17:17:28 -07:00
IronPan e8d09825ac update kfp ui rbac permission (#1350)
to reflect change here
https://github.com/kubeflow/kubeflow/pull/3255/files#diff-a843466b38824bfb7c7053bf4d4a6bf7R158
2019-05-17 11:46:18 -07:00
IronPan f5cc93045f
Update README.md 2019-05-17 01:25:39 -07:00
IronPan bdf62702c3
Update README.md 2019-05-14 23:48:49 -07:00
IronPan 7fc7b02c79
Update README.md 2019-05-14 23:40:10 -07:00
Eterna2 4eeeb6e224 [Frontend] minio client in API server can be configured with environment variables (#1324)
* kfp frontend API service can configure minio client params thru env vars

* minio endpoint is composed from host and namespace to support k8s yaml

* Added kustomize patch for pipeline-ui deploy
2019-05-14 17:38:18 -07:00
IronPan 47efd8235f Add proxy agent as optional kustomize component (#1325)
* clean up

* add proxy

* revert ns
2019-05-14 15:24:21 -07:00
IronPan c3235d725e
kustomize kf pipeline (#1305)
* stage minimum

* update

* working example

* move code

* remove code

* update readme

* address comments
2019-05-13 23:56:58 -07:00