Commit Graph

414 Commits

Author SHA1 Message Date
Yuan (Bob) Gong 0c4129c900
feat!: use argo emissary executor by default. Fixes #5718 (#5926) 2021-07-22 05:08:26 -07:00
Yuan (Bob) Gong 04342436a9
feat: use argo v3.1.2-patch.2. Part of #5718 (#6082)
* feat: use argo v3.1.2-patch

* upgrade to argo v3.1.2-patch.2

* update versions to v3.1.2
2021-07-22 03:51:25 -07:00
Yuan Gong efe22ca0a4 test(manifests): temporarily disable tests that use kpt because of v1 breaking changes 2021-07-21 12:37:29 +00:00
Tianchu Zhao 6e738caa79
fix(deployment): workaround fix, 34s timeout bug for argo crd on k8s1.20+ (#6075) 2021-07-17 17:06:14 -07:00
Yuan Gong f8ce345863 chore(release): bumped version to 1.7.0-rc.1 2021-07-15 11:23:40 +00:00
Yuan Gong 27c558e4ea chore(release): bumped version to 1.7.0-rc.0 2021-07-15 09:29:15 +00:00
Yuan (Bob) Gong cb75b4483a
feat: use argo v3.1.1-patch. Part of #5930 (#6027)
* feat: use argo v3.1.1-patch

* chore: also upgrade argo go modules to patch

* add comment

* fix download

* fix licenses

* go mod tidy
2021-07-14 10:41:43 +08:00
XiyueYu e6f498d4b5 chore(release): bumped version to v1.7.0-alpha.3 2021-07-12 16:57:12 -07:00
Yuan Gong bd6350dc27 chore(release): bumped version to 1.7.0-alpha.2 2021-07-03 13:29:11 +00:00
Yuan (Bob) Gong 4451c7d453
feat: upgrade argo to v3.1.1. Part of #5718 (#5971)
* feat: upgrade argo to v3.1.1

* go mod tidy
2021-07-03 05:52:49 -07:00
Yuan Gong a1d513eb41 chore(release): bumped version to 1.7.0-alpha.1 2021-06-28 13:19:50 +00:00
Yuan (Bob) Gong 3973407310
feat: upgrade argo-workflows to v3.1.0. Part of #5718 (#5922) 2021-06-28 04:37:14 -07:00
Yuan (Bob) Gong e958156274
fix(deployment): reduce artifact name collision rate under heavy load (#5894)
* fix(deployment): reduce artifact name collision rate under heavy

* fix

* update snapshots
2021-06-23 22:17:19 -07:00
Andrew Scribner 7607841f6a
feat(manifests): Expose sync.py frontend/visualization image as env var, add tests (#5732)
* Update and add tests to kubeflow-pipelines-profile-controller (sync.py)

- add control of visualization and frontend images through environment variables with backwards compatability to previous
- add a barebones end-to-end pytest suite and dev environment to help demonstrate behaviour and catch mistakes

* fix: remove deprecated comment

* fix: typo in copyright

* Refactor to simplify code, add test case

* Adds test case for passing parameters as arguments to factory (previous version had copy/paste errors that broke this feature, and no tests to catch this)
* fix some copy/paste errors (ex: controller was named improperly in HTTPServer call)

* feat: Add script to run tests for sync.py

* fix: resolve merge conflicts with master
2021-06-15 08:32:37 -07:00
Yuan (Bob) Gong 28431fc3fd
feat: update MLMD to 1.0.0 (#5786)
* feat: update MLMD to 1.0.0

* update MLMD frontend client
2021-06-03 10:34:36 -07:00
Yuan (Bob) Gong adc1951518
feat(deployment): configurable v2 compatible mode default pipeline root. Part of #5680. Fixes #5704 (#5750)
* feat(deployment): configurable v2 compatible mode default pipeline root

* clarify documentation
2021-06-02 08:52:02 -07:00
Yuan (Bob) Gong cf2807c558
fix(deployment): GCP AI Platform Pipelines -- correct default values when some fields left empty. Fixes #5717 (#5777)
* fix(deployment): GCP AI Platform Pipelines -- correct default values when some fields left empty

* backward compatible databaseNamePrefix behavior
2021-06-02 05:25:02 -07:00
Yuan Gong 1c66f93f51 chore(release): bumped version to 1.6.0 2021-05-24 13:36:51 +08:00
juliusvonkohout 87d9dd5c23
Fix pipelines with Kubeflow profile quota (#5695) 2021-05-19 09:56:45 -07:00
Yuan Gong bc329aa816 chore(release): bumped version to 1.6.0-rc.0 2021-05-13 07:17:55 +00:00
Luis Magana d25b001532
Add configurable connection max lifetime for MySQL (#5583)
* adding config for SetConnMaxLifetimeSec




making dbConMaxLifetime configurable


nit add time value


change the actual variable

* standardizing and documenting

* switched to getdurationconfig
2021-05-07 23:39:07 -07:00
capri-xiyue 31939086d6
feat: update MLMD to 0.30.0 (#5592) 2021-05-05 17:03:05 -07:00
Alexey Volkov cc83e1089b
Assigned copyright to the project authors (#5587) 2021-05-05 13:53:22 +08:00
Niklas Hansson 837429c072
feat(cache) : allow user to reset the node selector and affinity for cached steps. Fixes #4777 (#5456)
* initial work on removing pod affinity for cacheing image

* updated env extraction

* check if nodeselector or affinity is set. Also updated the variables needed

* missed to save

* fix test

Signed-off-by: NikeNano <niklas.sven.hansson@gmail.com>

* fixed test

* update manifests

* clean up

* remove withspaces
2021-04-23 17:19:43 -07:00
Yuan (Bob) Gong 3d72a2d9d5
chore(release): bump version to 1.5.0 on master branch (#5508) 2021-04-20 01:51:38 -07:00
Yuan Gong 3b5030aefa chore(release): bumped version to 1.5.0-rc.3 2021-04-09 13:28:26 +00:00
Yuan (Bob) Gong 9cbed6f7e8
refactor(deployment): move metadata-grpc-service istio config to its own folder (#5446)
* refactor(deployment): move metadata-grpc-service istio config to its own folder

* move virtual service too

* fix port
2021-04-09 20:51:38 +08:00
Yuan Gong f870570a79 chore(release): bumped version to 1.5.0-rc.2 2021-04-02 10:06:48 +00:00
Yannis Zarkadas 29676cef53
manifests: Allow traffic to the API-Server if the identity header is not set (#5420)
Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2021-04-02 18:04:12 +08:00
Yannis Zarkadas 6033718786
fix(manifests): Multi-User manifests fixes for 1.3 (#5416)
* manifests: Fix pipelines-profile-controller configMapRef

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Fix AuthorizationPolicy for multi-user

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Disable istio injection for metadata-envoy-deployment

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* fixup! manifests: Fix AuthorizationPolicy for multi-user

* manifests: Add minio option for Istio

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2021-04-02 02:43:20 -07:00
Yuan Gong 1cd189567d chore(release): bumped version to 1.5.0-rc.1 2021-04-01 11:59:53 +08:00
Yuan (Bob) Gong 5d0f3a3d32
fix(deployment): fix default resource requests. Fixes #5148 (#5409)
* fix(deployment): fix default resource requests

* fix mkp presubmit for rc version
2021-04-01 11:52:14 +08:00
Yuan (Bob) Gong 29da88b2be
chore(release): fix metadata release script and add presubmit (#5410)
* chore: fix metadata release script

* add presubmit test to verify release script

* fix

* show release script diff
2021-04-01 11:04:37 +08:00
Yuan (Bob) Gong 5ba3054ac9
fix(deployment): pipelines profile controller uses pipelines.kubeflow.org/enabled label (#5407) 2021-04-01 10:03:40 +08:00
Yannis Zarkadas ec2da24bc0
fix(deployment): Manifests fixes for KF 1.3 (#5406)
* manifests: Fix Argo ClusterRoleBinding reference

Closes #5402

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Default userid header

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Add metacontroller to third-party

Move metacontroller to third-party, as a dependency of KFP.

Closes #5400

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Add missing VirtualService for metadata UI

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Include metacontroller in multi-user installs

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2021-03-31 18:43:18 -07:00
Yuan (Bob) Gong 1b13be77a7
fix(manifests): fix error when kpt pkg get manifests. Part of #5368 (#5390)
* fix(manifests): fix error when kpt pkg get manifests

* fix another case

* add a test to check the problem automatically

* add script to prepare for presubmit test

* fix permission
2021-03-30 23:42:18 -07:00
Niklas Hansson a73a255fed
fix(docs): updated readme wait condition (#5371) 2021-03-26 17:37:47 -07:00
DavidSpek 409dd60c40
chore: Add platform-agnostic-multi-user-pns manifest (#5377) 2021-03-25 06:07:45 -07:00
Yannis Zarkadas d9c019641e
chore(manifests): Move metadata to third-party and add overlay for separate db (#5345)
* manifests: Add overlay for separate db to metadata

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Refactor kustomizations to use metadata from third-party folder

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Add platform-agnostic-multi-user-legacy env

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2021-03-24 02:20:44 -07:00
Yannis Zarkadas c36efb2ca5
chore(manifests): Add commonLabels to cache kustomizations (#5346)
Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2021-03-22 21:27:43 -07:00
Yuan Gong d2c66deb93 chore(release): bumped version to 1.5.0-rc.0 2021-03-22 21:25:52 +08:00
Yuan (Bob) Gong f474f934f6
chore(release): fix release script (#5352) 2021-03-22 21:19:54 +08:00
Yannis Zarkadas a2e556806f
fix(manifests): Fix multi-user manifest (#5344)
Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2021-03-21 18:33:17 -07:00
Yuan (Bob) Gong 8079ccaf8d
feat(deployment): move k8s application manifests to third-party and hide log by default (#5336)
* feat(deployment): move k8s application to third-party

* fixed multi-user manifest, add a quick test script

* hide application controller log by default
2021-03-19 01:01:16 -07:00
Ilias Katsakioris 5db66e0592
feat(backend): Support authentication with ServiceAccountTokens. Part of #5138 (#5286)
* Introduce kubernetes client utils

Introduce common utils for client initialization to factor out common
code.
This is a step towards fulfilling kubeflow/pipelines#4738.

* Use common util to initialize k8s clientset

* Introduce TokenReview client and fake ones

* Extend ResourceManager with a TokenReview client

* Extend FakeClientManager with a fake TokenReview Client

* Introduce authentication utils

* Introduce HTTP header authenticator

* Initialize Kubeflow-UserID header authenticator

* Refactor getUserIdentity() to use auth_util

* Move getting user identity logic to resource manager

Have the resource manager authenticate the request.
In following commits we will be extending the authentication methods to
use, among others, Kubernetes clients. Thus, we move the logic to the
resource manager to benefit from the clients kept its context.

* Introduce constants for the TokenReview authentication

* Introduce TokenReview authenticator

* Extend authenticators with a TokenReview one

Extend the authenticators which the KFP apiserver applies on a request
with a TokenReview authenticator.

This authenticator expects a ServiceAccountToken in a header with the
format: 'Authorization: Bearer <token>'

Part of https://github.com/kubeflow/pipelines/issues/5138

* Add tests for auth_util

* Add tests for HTTPHeaderAuthenticator

* Update server tests based on the new authentication API

* Remove old tests and unused code

* Add tests for TokenReviewAuthenticator

* Add server tests with unauthenticated requests

* manifests: Allow KFP API server to create TokenReviews

* auth: Split 'auth_util.go' into two parts

Split the file into:
* auth.go: contains the main entrance from the outside of the package
* util.go: contains all utility functions used inside

* Change token review audience variable and value

* Allow configuring audience with an environment variable

* Rename IsRequestAuthenticated -> AuthenticateRequest

* Don't use AuthenticateRequest method in tests

Instead of using AuthenticateRequest to retrieve the user from the
request and then use it for the expected values, allocate a variable for
the username in the request and use that in the expected values.
This ensures we don't hide potential errors of AuthenticateRequest.

* Change authenticators order

Have the HTTPHeaderAuthenticator first followed by the
TokenReviewAuthenticator

* Move authenticators to a ResourceManager property

To avoid potential race conditions when initializing the Authenticators
variable, we move authenticators to a ResourceManager property and
initialize it along with the initialization of the manager.
2021-03-18 21:59:16 -07:00
Yuan (Bob) Gong 4e2c45589f
feat(deployment): multi-user mode manifests for subject access review integration. Fixes #3513 (#5314) 2021-03-17 22:47:16 -07:00
Yuan (Bob) Gong 3bcaa522a4
feat(deployment): keep defaulting to argo docker container runtime executor. Part of #5285 (#5312) 2021-03-17 01:33:15 -07:00
Yuan (Bob) Gong 1e240885a1
chore: add missing places we need to upgrade mysql to 5.7 (#5301)
* fix(deployment): [gcp marketplace] fix mysql deployment after upgrade to 5.7

* tag_for_hosted should fail on error

* upgrade mysql to 5.7
2021-03-15 11:12:48 -07:00
juliusvonkohout 3cdb1d9954
feat(deployment): update mysql to 5.7 and support running as nonroot (#5278)
* Create mysql-serviceaccount.yaml

* Update kustomization.yaml

* Update mysql-deployment.yaml

* Update mysql-deployment.yaml

* Update mysql-deployment.yaml

* Update tag_for_hosted.sh

* Update mysql-deployment.yaml
2021-03-14 06:46:47 -07:00
Niklas Hansson 9c10d4f6fc
fix(cache): update to use google/cloud-sdk:alpine instead of alpine for mutating cached steps. Fixes #4099 (#5184)
* set up cluster

* allow to set with env variable

* fix the market place

* updated manifests

* Added default, still need to fix how env is set in test

* test alpine agin

* added test

* updated the image

* deleted

* change image by misstake

* updated after feedback

* deleted

* smaller image

* added to the config.json

* adjust to new updates for config handling

* Updated image to use latest
2021-03-13 02:01:47 -08:00
juliusvonkohout 6ebff17074
fix: runasnonroot for kubeflow-pipelines-controller (#5294)
* Update deployment.yaml

* Update sync.py

* Update service.yaml
2021-03-13 01:06:46 -08:00
Jeongwook Park 67afca4938
fix(deployment): Set default resource requests and limits for pipeline pods. Fixes #5214 (#5293) 2021-03-12 19:26:46 -08:00
Renmin 8174dacf8e
refactor(deployment): GCP marketplace detailed deployer KSA's permissions (#4147)
* update deployer ksa roles so that it can solve b/159616919

* add permission needed

Co-authored-by: Renmin Gu <renming@google.com>
Co-authored-by: Yuan Gong <gongyuan94@gmail.com>
2021-03-12 12:59:40 +08:00
Yuan (Bob) Gong 78db53eed1
feat(deployment): [GCP marketplace] upgrade argo to v2.12.9. Part of #5232 (#5277)
* feat(gcp): upgrade argo to v2.12.9

* add back missing labels

* fixed snapshot test

* add comment

* fix invalid yaml
2021-03-11 14:03:10 +08:00
Yuan (Bob) Gong 09a7236f07
refactor(deployment): refactor argo manifests to be overlay on top of upstream (#5273)
* refactor(deployment): refactor argo manifests to be overlay on top of upstream

* switch to pns executor

* updates

* update

* add cluster scoped resouces to installs/multi-user

* update manifests/kustomize/README.md

* fix image

* fix var replacement

* rm argo readme in manifests
2021-03-10 18:20:24 -08:00
Yuan (Bob) Gong be64b32798
feat: update argo image to v2.12.9 and automate update process. Fixes #5232 (#5266)
* add notices and licenses for argo 2.12

* feat: upgrade argo images to v2.12.9

* update all refs to argo image version

* add NOTICES generation script

* upgrade argo cli to latest

* fix

* fix

* add license_info.csv back

* make release process safer

* add back third_party/license.txt

* refactor(deployment): move argo manifests to third-party, updates for 2.12.9

* update marketplace snapshots

* set up marketplace presubmit test

* add comment
2021-03-10 12:52:42 +08:00
Yannis Zarkadas 6b1eba760b
Pipelines: Move manifests development upstream (#5256)
* manifests: Move base to installs/generic

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Add multi-user install

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Move minio, mysql under third-party

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* manifests: Add platform-agnostic-multi-user env

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2021-03-09 17:43:23 -08:00
Yuan (Bob) Gong 965bc62fd6
deployment: adjust default resource requests (#5237) 2021-03-04 14:03:41 +08:00
Niklas Hansson 19830416e7
fix(backend):Update with to allow eviction of pods with empty dir. Fixes #4530 (#5171)
* initial work on the sync

* removed quotation

* Moved annotation

* moved again

* clean up
2021-03-01 14:05:47 -08:00
Yuan (Bob) Gong 8b3d741c6e
chore(release): after release, also bump master to 1.4.1 (#5196)
* update release instructions

* chore(release): bump version to 1.4.1

* Update setup.py
2021-03-01 12:10:16 -08:00
Niklas Hansson 98f946a4cf
fix(deployment): Updated the manifests with resource requests. Fixes #5148 (#5158) 2021-02-26 16:02:15 -08:00
Luis Magana 5df2801603
feat(backend): Added multi-user pipelines (UI + API); Fixes #4197 (#4835)
* Added multi-user pipelines backend


corrected typo


updating code based on review


fixes for pipelines server


reverting this back

* removing unnecessary info logging
2021-02-26 06:10:15 -08:00
Rui Fang 2b95c002b7 chore(release): bumped version to 1.4.0 2021-02-12 09:31:46 +00:00
Chen Sun 26de102f82 chore(release): bumped version to 1.4.0-rc.1 2021-02-01 00:18:50 -08:00
Alexey Volkov 7ed110e984
fix(mkp): Added default values for parameters (#5010)
Fixes https://b.corp.google.com/issues/177083468
2021-01-19 14:29:58 -08:00
Alexey Volkov 3b9fdff26b
fix(mkp): Added service account description to prevent validation errors (#4992) 2021-01-15 15:53:43 -08:00
Niklas Hansson eeb7f8f04a
fix(backend): make the scheduleworkflowontroller timezone aware. Fixes #2653 (#4641)
* intial work'

* small fixes

* updated tests and how parameter are set

* try to fix test

* check with out adding missing test

* fixed small typo

* test changes

* updated config

* typo

* updated after feedback

* fixed pointer error

* test to add paramter

* moved to init so removed not needed code

* updated further

* updated tests to also check endtime

* clean up test

* fixed failing test

* fixed the expected test results

* added timezone examples

* further clean up

* fixed time format

* Update params.env

* moved location to cronjobscheduler

* clean up

* set env variable to empty

* reverted back

* updated to make magic nbr to constant

* updated the tests with comment

* added comments on cron expressions

* update naming and return types

* updated to UTC as default

* updated with an alpha notice
2021-01-10 00:59:05 -08:00
Yang Pan c484cfa46c chore(release): bumped version to 1.3.0 2021-01-07 00:39:26 -08:00
Chen Sun 5445ce82c7 chore(release): bumped version to 1.2.0 2020-12-17 23:24:32 -08:00
numerology 1449cfe0a5 chore(release): bumped version to 1.1.2 2020-12-14 09:43:07 -08:00
Rui Fang a0a1a5d0cf chore(release): bumped version to 1.1.2-rc.1 2020-12-04 07:09:17 +00:00
Rui Fang 8a22a89c7d
chore(release): upgrade mlmd to 0.25.1 (#4859)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* fix master

* fix go.sum

* upgrade mlmd to 0.25.1

* Update requirement.txt and it's scripts
2020-12-02 22:13:00 -08:00
hilcj c1aebb5d22 chore(release): bumped version to 1.1.1-beta.1 2020-11-26 17:58:04 +00:00
hilcj 4fe4a30545 Revert "chore(release): bumped version to 1.1.1-beta.1"
This reverts commit 9af3e79c10.
2020-11-26 16:10:10 +00:00
hilcj 9af3e79c10 chore(release): bumped version to 1.1.1-beta.1 2020-11-26 04:32:09 +00:00
hilcj bd86072a8c Revert "chore(release): bumped version to 1.1.1.beta.1"
This reverts commit 5928a2659b.
2020-11-26 04:20:10 +00:00
hilcj 5928a2659b chore(release): bumped version to 1.1.1.beta.1 2020-11-26 03:07:26 +00:00
Yuan (Bob) Gong aa162e6a24
feat(gcp): Revert "turn off archiveLogs for GCP marketplace #4758" (#4771)
This reverts commit 1165ed243b.
2020-11-17 23:00:04 -08:00
Ilias Katsakioris 39203d5ffa
feat(backend): Refactor authz to perform SubjectAccessReview. Fixes #3513 (#4723)
* [Backend] Return proper error codes for failures during auth

* [Backend] Implement helpers to initialize a SubjectAccessReview client

In preparation of SubjectAccessReview, we implement some helpers to
create a new Kubernetes Authorization clientset and return the
SubjectAccessReview client.
We also define some fake clients to be used by future tests.

* [Backend] Introduce RBAC-related constants

In preparation of SubjectAccessReview, introduce RBAC groups, resources,
and verbs.

* [Backend] Extend managers with a SubjectAccessReviewClient

* [Backend] Refactor the authorization mechanism for requests

Authorization should be based on performing some action on a resource
living in a namespace. This commit refactors the authorization utilities
to reflect this and perform SubjectAccessReview.

This commit also deletes some tests based on old authn/authz mechanism.
A following commit will fix/extend the tests for the new mechanism

* [Backend] Adjust endpoints to pass resource attributes for authz

With KFAM authorization, we passed only the namespace attribute for
authorization. With SubjectAccessReview, we need a richer list of
attributes. Thus, we adjust endpoints to pass request details (resource
attributes) necessary for authorizing the request. We only change the
already authorized endpoints, not introducing any new checks.

* [Backend] Adjust apiserver/server tests to SubjectAccessReview

* [Backend] Purge KFAM

Since we no longer use KFAM, we may as well purge it

* [Backend] Update BUILD files

Signed-off-by: Ilias Katsakioris <elikatsis@arrikto.com>

* [Manifests] Extend manifests for SubjectAccessReview

* API Server: Allow creating SubjectAccessReviews
* Add view/edit roles in a multi-user kustomization
2020-11-17 14:56:05 -08:00
Yuan (Bob) Gong 1165ed243b
feat(gcp): turn off archiveLogs for GCP marketplace (#4758) 2020-11-11 21:08:50 -08:00
Rafał Bigaj 678ae0fe08
feat(backend): new server API to read run log. Fixes #4468 (#4493)
* New server API: read run log

- The new server API endpoint (/apis/v1beta1/runs/{run_id}/nodes/{node_id}/log) to fetch run log
- `ARCHIVE_LOG_FILE_NAME` and `ARCHIVE_LOG_PATH_PREFIX` options allows to control archive log path
- UI Server fetches logs from server API or directly from k8s depending on `STREAM_LOGS_FROM_SERVER_API` option

* New server API: read run log

- ml-pipeline rbac update: allow for access to log

* Read run log: enhanced error handling

- log message on Pod access errors

* Read run log: enhanced log archive options

* Code format

* Test update after getPodLogs signature change

* Updated comments after review

* `follow` query parameter in GET /apis/v1beta1/runs/{run_id}/nodes/{node_id}/log

* Env variable friendly config names & comments

- Config options: ARCHIVE_CONFIG_LOG_FILE_NAME, ARCHIVE_CONFIG_LOG_PATH_PREFIX
- Copyright message update
- New endpoint as `v1alpha1`

* Licence updates

- fluent-bit licence inlined
- copyright message updates

* Master merge

- dependency conflicts
2020-11-11 00:37:50 -08:00
Yuan Gong 66ccf335e8 chore: update OWNERS 2020-11-09 10:18:18 +08:00
Vincent Link c0f94e189c
chore(deployment): Add app=minio label to minio service (#4707)
This keeps the minio service similar to the other minio resources and helps e.g. when defining a ServiceMonitor for the prometheus-operator.
2020-11-03 04:10:15 -08:00
Yuan Gong 7d36f48482 chore(release): bumped version to 1.1.0-alpha.1 2020-11-02 03:01:27 +00:00
Niklas Hansson 2317015085
feat(backend): allow configuring if default version should be updated when uploading new pipeline version. Fixes #4049 (#4476)
* update to fetch remote

* missed to add the description

* fixed merge conflict

* initial work

* fixed test and bug

* updated python client

* clean up

* clean up

* added config default

* fixed bug in API

* moved config  value

* reverted to load from config

* clean up

* Update _client.py

* removed unecessary function and updated after feedback

* missed to save pipeline.proto

* updated the last parts after feedback

* reverted back to use string and env variable

* updated typo

* fix typo in path

* clean up

* removed option in api

* clean up python part

* typo, cant run test locally

* clean up, problems with local env

* clean up missing differences

* reverted proto files

* further clean up

* clean up

* updated after feedback

* Added tests

* error in my defer statement

* Updated the test
2020-10-19 02:08:14 -07:00
jotaylo 7a5c3c60b8
feat(deployment): Kustomize manifest for Azure deployments (#4567) 2020-10-12 02:04:47 -07:00
Yao Xiao 1c2a4e8886
fix(doc): Use Managed NodeGroup for EKS Cluster (#4609) 2020-10-12 00:44:47 -07:00
Yuan (Bob) Gong d4e7398917
doc(gcp): recommend KFP standalone over marketplace CLI installation (#4508) 2020-09-16 21:20:45 -07:00
Yuan (Bob) Gong d91a0c9da1
chore(release): bump version to 1.0.1 on master branch (#4492)
* chore(release): bump version to 1.0.1 on master branch

* remove rc changelog
2020-09-14 01:28:58 -07:00
Yuan (Bob) Gong fbfc160f93
feat: GCP marketplace - preserve install config in configmap and secret (#4471)
* tools for comparing gcp marketplace manifests

* commit snapshots

* feat: GCP marketplace - preserve install config snapshot in a configmap and secret

* commit updated snapshots

* update syntax
2020-09-06 22:39:41 -07:00
Eugene Fedorenko d40d66aca3
fix(backend): Add a permission to create events to argo-role (#4449) 2020-09-04 02:17:41 -07:00
Alexey Volkov 64d15f76f4
chore(metadata): Added instruction to sync the MLMD version (#3658) 2020-09-02 01:23:07 -07:00
Alexey Volkov d9ee639aeb
Manifests - Added permissions for certificate approval (#4385) 2020-08-26 13:05:42 -07:00
jingzhang36 f39d8790b2
feat: add grafana as part of the customized kfp deployment (#4404)
* enable pagination when expanding experiment in both the home page and the archive page

* Revert "enable pagination when expanding experiment in both the home page and the archive page"

This reverts commit 5b672739dd.

* grafana

* add grafana directory to kust

* grafana deployment config
2020-08-24 16:49:34 +08:00
Alexey Volkov 265c9b70f4
fix(cache): adds certificate approver permission to kubeflow-pipelines-cache-deployer-role. Fixes #4138 (#4246) (#4383)
* certificat approval

* update pr

* update pr

* update pr

* update pr

* remove list&watch

Co-authored-by: Eugene Fedorenko <eugene.fedor@gmail.com>
2020-08-17 20:35:30 -07:00
Yao Xiao 5ec7ec8a00
feat(manifest): add support for Standalone KFP on AWS. Fixes #4337 (#4350) 2020-08-12 18:56:14 -07:00
Yuan (Bob) Gong f738cb3ac3
feat(deployment): KFP standalone should keep user data when application deleted (#4332) 2020-08-06 19:08:19 -07:00
jingzhang36 3f9ce57a81
chore: add optional prometheus deployment (#4323)
* enable pagination when expanding experiment in both the home page and the archive page

* Revert "enable pagination when expanding experiment in both the home page and the archive page"

This reverts commit 5b672739dd.

* Add prometheus deployment as an optional deployment

* move prom dir to under third-party

* comments

* third party folder to the kustomize folder
2020-08-05 23:52:55 -07:00
Alexey Volkov fe77c197d1
fix(backend): Backend - Cache - Fixed reinstallation. Fixes #4299 (#4320)
* Backend - Cache - Fixed reinstallation by adding missing roles

* Stop ignoring the deletion errors

* Added patch permission as well

It should not be triggered, but might be useful in the future.
2020-08-04 18:48:28 -07:00
Gabriele Santomaggio dac5a9433c
feat(deployment): Add env/platform-agnostic-pns standalone deployment option (#4278)
Add containerRuntimeExecutor explicit type on the configMap
configuration. Set `docker` as default.
Add platform-agnostic type configuration needed by Kind, K3s and
Minikube.
Part of https://github.com/kubeflow/pipelines/issues/4256
2020-08-03 19:24:18 -07:00
Yuan (Bob) Gong 988f5b02e4
chore(release): bump version to 1.0.0 on master branch (#4249) 2020-07-20 02:04:51 -07:00
Yuan (Bob) Gong 45a91f6699
feat(deployment): GCP managed storage - detailed instructions to set up workload identity bindings before deployment (#4232)
* feat: allow creating workload identity bindings before deploying KFP

* more instructions

* fix formatting

* fixes

* Update doc ref

* fix storage role

* fix storage role

* add viewer KSA to standalone manifest

* fix missing configmap

* update documentation
2020-07-16 23:13:00 -07:00
Renmin c615e3796c
fix(deployment): fix database name prefix description (#4198)
* fix wording

* Update schema.yaml

* Update schema.yaml

Co-authored-by: Yuan (Bob) Gong <gongyuan94@gmail.com>
2020-07-10 15:22:16 +08:00
Yuan (Bob) Gong c5c9739a00
fix(deployment): gcp managed storage - add service account to minio and cloudsql proxy deployments for workload identity (#4188) 2020-07-09 18:24:54 +08:00
Joe Liedtke b8e210af25
docs(deployment): Suggested wording changes for schema.yaml. (#4151)
* Suggested wording changes to schema.yaml.

* Update schema.yaml

Co-authored-by: Yuan (Bob) Gong <gongyuan94@gmail.com>
2020-07-07 02:33:58 -07:00
Yuan (Bob) Gong d4a8329134
feat(deployment): marketplace - allow specifying gcs bucket directly (#4112)
* feat(deployment): marketplace - allow specifying gcs bucket directly

* Switch tfx default bucket to user specified one

* Update schema description

* Update version to 0.5.1 to match marketplace expectation

* Fix gcsBucketName var

* Remove gcp secret credentials
2020-07-01 20:18:00 -07:00
Yuan (Bob) Gong 79e0ee2b49
chore: remove inactive reviewers (#4111)
* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS

* Update OWNERS
2020-06-30 19:10:06 -07:00
frozeNinK 8a2d11c96a
feat(backend): Make number of persistence worker goroutine configurable (#3904)
* Make number of persistence worker configurable

* address comments

* address comments

* address comments
2020-06-29 21:37:58 -07:00
Yuan (Bob) Gong c80a8a212c
refactor(deployment): separate metadata-writer and metadata-grpc folders (4071)
* refactor(deployment): separate metadata-writer and metadata-grpc folders

* fix release script
2020-06-28 01:14:14 -07:00
Yuan (Bob) Gong 9101ded02e
refactor(deployment): move kustomization.yaml images to the lowest level package (4072)
* refactor(deployment): separate metadata-writer and metadata-grpc folders

* refactor(deployment): move kustomization.yaml images to the lowest level package

* format
2020-06-27 21:18:14 -07:00
Yuan (Bob) Gong 1bbd82c25e
refactor(deployment): use minio artifact secret as single source of truth (4065)
* refactor(manifests): move minio artifact secret to minio package

* let api server and ui use minio artifact secret instead of default value

* Update kustomization.yaml

* fix name
2020-06-26 02:03:31 -07:00
dushyanthsc bc3c59aea1
MetadataStore: Update to release metadata-envoy in each release (#4026) 2020-06-23 19:07:17 -07:00
Yuan (Bob) Gong 002ea63279
chore(marketplace): fix version in release note (#4015) 2020-06-18 17:23:10 -07:00
jingzhang36 8553497c3c
Reduce ttl of persisted final workflow to 1 day (#4005)
* reduce ttl of pesisted final workflow to 1 day

* add comment

* enable pagination when expanding experiment in both the home page and the archive page

* Revert "enable pagination when expanding experiment in both the home page and the archive page"

This reverts commit 5b672739dd.

* Address comments
2020-06-18 00:22:06 -07:00
dushyanthsc 3de9987b56
Metadatastore: Updating metadata grpc server image to 0.22.1 (#3982) 2020-06-17 20:34:06 -07:00
Yuan (Bob) Gong c0074463ee
chore(manifests): add pod get permission to ui back 2020-06-16 15:24:03 +08:00
Yuan (Bob) Gong ecb6b1fe25
chore(manifests): add missing port name and other cleanups (#3987) 2020-06-15 22:38:03 -07:00
Yuan (Bob) Gong c0124cb747
[Manifest] Apply kustomize best practices to standalone manifest (#3978)
* Use configMapKeyRef for env vars

* Allow easy customization of cluster-scoped resources namespace

* clean up

* Clean up

* Simplify var replacement with direct configmap value ref

* clean up params.env
2020-06-15 19:09:57 -07:00
Yuan (Bob) Gong 3cf36cdef3
[Format] Format yaml files that will be automated (#3958)
* [Format] Format yaml files that will be automated

* Fix formatting

* Add a top level script

* Fix formatting
2020-06-10 20:28:58 -07:00
Yuan (Bob) Gong b12023ef3b
[Release] Various fixes to release script (#3918)
* Update kustomize/base/params.env

* fix

* kfp sdk requirement updated to kfp_api_server<2.0.0

* Fix major minor parsing

* Also fix cloudbuild.yaml
2020-06-05 13:32:21 +08:00
Amy 0324b2cac8
create cluster using 'cloud-platform' scope (#3892) 2020-06-04 12:14:38 -07:00
Yuan (Bob) Gong c517fc579d
[Release] Automate release script for all the changes (#3777)
* Refactor components/release.sh to provide a new components/release-branch.sh that updates release branch directly

* Release components as version tag instead of commit SHA

* Publish component images in release.cloudbuild.yaml

* Include script that updates version tag for component sdk

* [Manifest] Use kustomize native image transformer to override image

* Revert unintended changes

* Fix kustomization.yaml location

* Fix inverse proxy image

* Add release script for kustomize manifest

* Add release scripts for marketplace manifest and sdk

* Add global release.sh

* Fix sdk release script

* Clean up release scripts

* Fix release script

* Fix release scripts

* fix

* fix

* Fix cannot use uppercase vars in cloudbuild.yaml

* Add old components release script back

* Add a RELEASE.md doc
2020-06-03 08:44:18 -07:00
Yuan (Bob) Gong 53d35ddc87
[ScheduledWorkflow] Fix events permission missing (#3785) 2020-05-19 21:10:18 -07:00
Yuan (Bob) Gong 39805acc9e
[Manifest] Use kustomize native image transformer to override image (#3776)
* [Manifest] Use kustomize native image transformer to override image

* Revert unintended changes

* Fix kustomization.yaml location

* Fix inverse proxy image
2020-05-18 21:23:36 -07:00
Niklas Hansson 3d2c6ea84c
manual merge as the change it self is correct
but MKP mpdev:latest has an issue block our tests
2020-05-15 15:19:34 +08:00
Yuan (Bob) Gong 61960efb87
Add probes to metadata grpc service (#3765)
* Add probes to metadata grpc service

* Fix port name length limit

* Update README.md
2020-05-14 13:50:59 -07:00
Yuan (Bob) Gong f0c8effb3a
[Servers] Add liveness and readiness probes (#3757)
* probes for ml-pipeline-ui

* clean up comments

* Use wget instead of curl, because wget is included in alpine

* Also update marketplace manifest

* Add readiness/liveness probe for api server

* Add probes for python vis server
2020-05-13 23:46:22 -07:00
Alexey Volkov e83bd7d78f
Upgraded Argo to v2.7.5 (#3537)
* Upgraded Argo to v2.7.4

* Downgraded the Argo CLI version to 2.4.3

See https://github.com/argoproj/argo/issues/2793

* Removed the argo cli arg that had been removed

* Updated to Argo 2.7.5

* Added workflowtemplates and cronworkflows to the Role

* Added the new Argo CRDs
2020-05-11 23:52:21 -07:00
Renmin 9c16e12ba0
update version (#3694) 2020-05-06 19:37:09 -07:00
Renmin a6eb65013f
Move namespace to cluster-scoped (#3662)
* move namespace to cluster-scoped-resource

* fix doc
2020-04-30 00:30:17 -07:00
Renmin a94597f8cb
fix issue of creating default bucket (#3626) 2020-04-26 23:05:28 -07:00
hongye-sun 3dbbb16903
update version to 0.5.0 (#3566) 2020-04-22 14:00:50 -07:00
Renmin fcbb654563
Enable cache-deployer as fixed the root cause in other PR (#3574)
* default to kubeflow

* done

* include cache as we found root cause is namespace

* fix

* change the default to kubeflow, more for manual upgrade
2020-04-22 13:59:45 -07:00
Yuan (Bob) Gong 65e3db9443
Update ml-pipeline-ui-deployment.yaml (#3586) 2020-04-21 21:23:52 -07:00
Renmin 9b7689e971
Clusterrolebinding is using namespace which not parametrized (#3573)
submit quick to make sure others won't got confused
2020-04-21 22:23:10 +08:00
Renmin db8042a846
Fix test which uses Kustomize edit image but can't work with valueRef (#3572)
pass upgrade / installation test. submitting. now.

The e2e test fails but not due to this PR. Submit this PR to unlock KIR side
2020-04-21 20:10:41 +08:00
Renmin ad9bd5648d
use better sample name (#3558) 2020-04-20 18:57:35 +08:00
Renmin 91c3239004
OSS 1.0 Kustomize part-2 parameterize & fix CloudSQL (#3540)
submit without wait for fix for following as no dependency
https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/kubeflow_pipelines/3540/kubeflow-pipeline-e2e-test/1252173721301422090
2020-04-20 18:46:35 +08:00
Yuan (Bob) Gong 2742a3ed95
[SDK] Make service account configurable for build_image_from_working_dir (#3419)
* Add kfp-container-builder sa

* Allow service account to be configurable

* Fix tests

* Fix test

* Use documentation for service account to introduce compatibility with different types of installation

* updated doc

* clean up

* Update container_builder_test.py

* Update _build_image_api.py

* Update kustomization.yaml

* Add executable permission for presubmit tests mkp.sh
2020-04-15 00:06:02 -07:00
Yuan (Bob) Gong 2586e18990
Update metadata-envoy-deployment.yaml (#3502) 2020-04-14 03:13:12 -07:00
Renmin 75a92a740c
pass projectID from env/configmap without user input (#3458) 2020-04-13 09:05:48 -07:00
Renmin 9953e45619
one project share one default bucket (#3478) 2020-04-13 08:19:49 -07:00
Yuan (Bob) Gong e395276ea8
[Deployment] Move crds to cluster-scoped kustomize folders (#3498)
* [Deployment] Move crds to cluster-scoped kustomize folders

* Fix naming

* Rename folder

* Add STRUCTURE.md, fix bug

* fix
2020-04-13 01:09:48 -07:00
Jiaxiao Zheng cfcd824ed1
update version (#3467) 2020-04-07 14:13:44 -07:00
Yuan (Bob) Gong 163d49814b
Change cache-deployer to strategy recreate (#3456) 2020-04-06 19:55:43 -07:00
Rui Fang 85257a06ea
[Manifest] Cache - MKP deployment (#3430)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* fix master

* Add cache manifests for mkp deployment

* revert go.sum

* Add helm on delete policy for cache deployer job

* Change cache deployer job to statefulset

* remove unnecessary cluster role

* seperate clusterrole and role

* add role and rolebinding to mkp

* change secret role to clusterrole

* Add cloudsql support to cache

* fix comma

* Change cache secret clusterrole to role

* Adjust sequences of resources

* Update values and schema

* remove extra tab

* Change statefulset to job

* Add pod delete permission to cache deployer role

* Test changing cache deployer job to deployment

* remove extra permission

* remove statefulset check
2020-04-06 16:53:19 -07:00
Renmin 7fbb45d381
enable CloudSQL+GCSObjStore without default credential (#3378)
* enable CloudSQL+GCSObjStore without default credential

* refresh document

* fix schema

* minio project ID is required

* fix several

* self throtting Github requests to let build be stable

* can work now

* upsize and lowercase for bucket name

Co-authored-by: Renmin Gu <renming@google.com>
2020-04-03 17:22:19 +08:00
Niklas Hansson 8091752980
updated version (#3421) 2020-04-03 00:51:46 -07:00
Rui Fang 8e137a1ba6
[Manifest] Cache - Enable cache and cache deployer in base kustomization file (#3376)
* Initial execution cache

This commit adds initial execution cache service. Including http service
and execution key generation.

* fix master

* Change cache deployer job to stateful set

* Delete cache deployer job

* Delete cache deployer job after it completes

* minor fix

* fix indention

* Change cache deployer job to statefulset

* Remove extra cluster role for cache deployer

* remove cache in base kustomize file for upgrade test

* minor fix

* Enable cache and cache-deployer in base kustomization file

* fix

* fix

* test

* test

* test

* Refactor cluster scope resources

* refactor

* Add namespace for sa

* Fix

* Add crds folder to cluster kustomization yaml

* namespace change

* fix

* fix

* fix

* update test

* Rename cluster to cluster-scoped-resource

* test adding namespace in kustomization file

* revert namespace for clusterrolebinding

* fix

* Add db_name in cache_deployment manifest

* rename

* change secret cluster role to role
2020-04-02 14:37:04 -07:00
Renmin bfb02f72c9
revert kfp-cache from Hosted/MKP (#3416)
Co-authored-by: Renmin Gu <renming@google.com>
2020-04-02 01:36:25 -07:00