Commit Graph

32 Commits

Author SHA1 Message Date
Yuan (Bob) Gong 2460aaa653
test: migrate sample integration test to v2 sample test infra (#7115)
* test: migrate sample integration test to v2 sample test infra

* test: configure v2 integration test timeout
2021-12-26 04:17:44 +00:00
Yuan (Bob) Gong ac14a3b610
test: migrate all sample tests to v2 sample test infra (#7114)
* test: migrate all sample tests to v2 sample test infra

* fix
2021-12-24 08:15:43 +00:00
Alexey Volkov cc83e1089b
Assigned copyright to the project authors (#5587) 2021-05-05 13:53:22 +08:00
Jiaxiao Zheng db8a2360fd
two fixes (#3307) 2020-03-18 18:02:43 -07:00
Yuan (Bob) Gong 5391e88fbc
[Testing] KFP standalone test infra for upgradability (#1971)
* Implement upgrade test

* mark upgrade-tests.sh as executable

* Fix comments

* Base upgrade_test_setup.yaml

* e2e integration of upgrade test

* Fix entrypoint argument

* Fix e2e workflow yaml

* Fix run_test.sh argument processing

* Fix no closing backtick

* Restrucutre upgrade_test.go to focus the test on upgrade verification

* clean up code

* Clean up after upgrade test when it is run in integration tests.

* Include pipeline tests in upgrade test

* Reorder tests

* Add upgrade test coverage for run api resources

* Add job api resource coverage in upgrade test & refactored upgrade test

* Fix add missing step in upgrade test

* Fix BUILD.bazel

* Fix upgrade_test.go

* Try to fix upgrade test failure

* Fix hard coded namespace

* Sync upgrade-tests.sh with new changes in presubmit-tests-with-pipeline-deployment.sh

* Update upgrade test

* Remove redundant code

* Fix integration test exit code

* Fix trigger interval second mismatch
2020-03-09 16:53:37 -07:00
Yuan (Bob) Gong 02fabd306e
[Testing] Use google/cloud-sdk:279.0.0 to resolve workload identity flakiness (#3019)
* [Testing] Use gke 1.15.8 to mitigate workload identity flakiness

* Upgrade gcloud version

* Update image builder image too

* Turn on workload identity

* Update deploy-cluster.sh

* secret sample uses python3 instead

* Increase xgboost time limit

* Revert files with bad format

* Update component and pipelines to use gcloud 279.0.0

* Fix secret sample using python3

* Upgrade frontend integration test image

* Rebuild frontend integration test image
2020-02-11 18:34:07 -08:00
Yuan (Bob) Gong 19f721f380
[Testing] Use full scope cluster for testing to reduce flakiness (#3018) 2020-02-07 22:07:53 -08:00
Renmin de9d2076b2 enlarge timeout to 45m to avoid flakyness (#2875)
* to 45m

* 45m

Co-authored-by: renmingu <40223865+renmingu@users.noreply.github.com>
2020-01-20 00:17:37 -08:00
Yuan (Bob) Gong 493c3d4e98 [testing] Fix e2e tests timed out, but there's empty test log (#2810)
* Wait longer for e2e tests

* Let api integration test stream log output while testing

* Also fix initialization test

* Fix exit code on error

* Update test/initialization-test/run_test.sh

Co-Authored-By: Alexey Volkov <avolkov@google.com>

* Update according to CR

Co-authored-by: Alexey Volkov <alexey.volkov@ark-kun.com>
2020-01-09 21:00:34 -08:00
Yuan (Bob) Gong 4a8d262abb Migrate standalone deployment to workload identity on GCP (#2619)
* Script to set up workload identity for standalone deployment

* Migrate tests to run on standalone + workload identity

* Fix test script

* Switch to static GSAs for testing, because they have name length limit

* Add workload identity binding for argo

* Fix argo workload identity bindings

* Remove user-gcp-sa from tests

* Remove use_gcp_secret from xgboost sample

* Allow debugging tests locally

* Wait for policies to take effect

* Update deploy-pipeline-lite.sh

* Update deploy-pipeline-lite.sh

* [WIP] test gcloud auth list with test-runner sa

* Add namespace

* test again

* Use new image builder

* test again

* Remove debug code

* Remove usages of use_gcp_secret

* Fix unit test and tensorboard pod template

* Add debug code again to test

* Try waiting until workload identity bindings are ready

* Fix some other samples

* Fix parameterized tfx oss sample

* Add retry to image building

* Try fixing tfx oss sample

* Fix compiled tfx oss sample

* Update all google/cloud-sdk to latest

* Try fixing parameterized tfx oss sample again

* Also verify pipeline-runner ksa is working

* Fix parameterized_tfx_oss sample

* Update gcp-workload-identity-setup.sh

* Revert unneeded change

* Pin to new google/cloud-sdk

* Remove wrongly commited binaries
2019-12-16 22:05:58 -08:00
Yuan (Bob) Gong 3d008f96e9 Fix obsolete image cache when the same PR commit is tested with a new master (#2738) 2019-12-16 17:09:38 -08:00
Yuan (Bob) Gong 3d2b0ae417 Print cloud build logs when the task fails (#1989)
* Print cloud build logs when the task fails

* Add comment about why cloud build is made async
2019-09-03 18:12:58 -07:00
Alexey Volkov 3962032898 Testing - Fixed the comparison bug in presubmit test script (#1990)
* Testing - Fixed the comparison bug in presubmit test script

Bash was treating the single quotes literally.

* Addressed the feedback
2019-08-30 12:48:04 -07:00
Alexey Volkov 856949ed30
Test infra - Fixed the batch Prow tests (#1958)
Fixes https://github.com/kubeflow/pipelines/issues/1957
2019-08-27 14:16:19 -07:00
Yuan (Bob) Gong 8e53eb43ad Move postsubmit tests to lite deployment (#1939)
* Move postsubmit tests to lite deployment

* Reduce verbose logs by wget

* Add ignored files

* add test temporary file to gitignore
2019-08-23 14:34:26 -07:00
Yuan (Bob) Gong fe8d96ffb5 Use cloud build to build images instead (#1923)
* Use cloud build to build images instead

* Batch 3 image buld tasks

* Fix check cloud build status script
2019-08-23 01:43:18 -07:00
Yuan (Bob) Gong d11fae78d8 Use KFP lite deployment for presubmit tests (#1808)
* Refactor presubmit-tests-with-pipeline-deployment.sh so that it can be run from a different project

* Simplify getting service account from cluster.

* Migrate presubmit-tests-with-pipeline-deployment.sh to use kfp
lightweight deployment.

* Add option to cache built images to make debugging faster.

* Fix cluster set up

* Copy image builder image instead of granting permission

* Add missed yes command

* fix stuff

* Let other usages of image-builder image become configurable

* let test workflow use image builder image

* Fix permission issue

* Hide irrelevant error logs

* Use shared service account key instead

* Move test manifest to test folder

* Move build-images.sh to a different script file

* Update README.md

* add cluster info dump

* Use the same cluster resources as kubeflow deployment

* Remove cluster info dump

* Add timing to test log

* cleaned up code

* fix tests

* address cr comments

* Address cr comments

* Enable image caching to improve retest speed
2019-08-20 17:25:20 -07:00
Yuan (Bob) Gong 0ed5819ae9 Refactor presubmit-tests-with-pipeline-deployment.sh to run in other projects (#1732)
* Refactor presubmit-tests-with-pipeline-deployment.sh so that it can be run from a different project

* Simplify getting service account from cluster.

* Copy image builder image instead of granting permission

* Add missed yes command

* fix stuff

* Let other usages of image-builder image become configurable

* let test workflow use image builder image
2019-08-19 17:13:31 -07:00
Alexey Volkov e9aa69e353 Testing - Clean up the Argo controller that was used to build images (#1311)
* Testing - Clean up the Argo controller that was used to build images

* Try to not install Argo in the first place

* Added the test-runner service account

* Cleanup

* Changing the install-argo.sh script instead
2019-05-13 14:51:09 -07:00
Ning d0429b63f9 keep the api image name consistent between the presubmit test and staging (#1279) 2019-05-03 17:49:38 -07:00
Ning a50997c98d remove unnecessary args (#1249) 2019-04-29 13:57:36 -07:00
IronPan 18878f1bed fix bunch of issues in prow test (#866)
* update tests

* explicit return successful

* fix

* move variable

* Update deploy-kubeflow.sh
2019-02-27 14:40:01 -08:00
Ning 508210d40b
Add postsubmit component test (#613)
* add postsubmit script and yaml
* remove old sample tests component file
* extract deploy-pipeline.sh, deploy-kubeflow.sh and test-prep.sh from presubmit and postsubmit scripts
2019-02-25 13:29:04 -08:00
hongye-sun ad370933c7 Move e2e tests to us-east1 (#847)
* move to us-east1

* switch to us-east1-b
2019-02-22 10:51:42 -08:00
IronPan cc257f29a2 switch test to us-west1 (#808)
* switch test to east1

* Update presubmit-tests-with-pipeline-deployment.sh
2019-02-11 16:10:00 -08:00
IronPan e9bd7c6a4d
merge build image to test suit (#799)
* merge build image to test suit

* update image

* Update presubmit-tests-with-pipeline-deployment.sh

* add permission to access to gcr

* add service account

* test

* fix

* not exit

* speed boost
2019-02-09 00:23:06 -08:00
hongye-sun 969bb4ed2c Revert "Add gpu pool to test deployment and enable gpu in sample test (#696)" (#778)
This reverts commit 72a7de9d47.
2019-02-04 21:59:19 -08:00
hongye-sun 72a7de9d47 Add gpu pool to test deployment and enable gpu in sample test (#696)
* add gpu pool to test deployment and enable gpu in sample test

* enable clean up
2019-01-25 09:44:08 -08:00
IronPan 4c551bac60 bump ks version (#693) 2019-01-23 15:08:25 -08:00
Alexey Volkov d135be9c87 Deployment - Passing the platform parameter (#588) 2019-01-02 11:40:08 -08:00
IronPan 9a62c319c6 Clean up the resource after test finish (#410) 2018-11-29 13:03:30 -08:00
IronPan 9b77d4a8a6 Switching test to kubeflow deployment (#351)
* test

* fix

* fix

* fix

* fix

* fix

* update

* cleanup

* fix

* coopy test

* chmod

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* update

* fix

* fix

* fix

* fix

* fix

* fix

* fix sample test

* fix

* fix

* merge

* update image builder image

* update script

* mount permission
2018-11-28 21:36:12 -08:00