Commit Graph

45 Commits

Author SHA1 Message Date
Mathew Wicks 9d9aa0870f
chery-pick changes from master (#1648)
* update jupyter config with tolerations/selectors (#1644)

(cherry picked from commit 051f31ac90)

* update image tags (#1645)

(cherry picked from commit fdd3204ab2)

* fix application selectors (#1646)

(cherry picked from commit 3e88eab848)
2020-11-16 10:13:07 -08:00
kubeflow-bot 0f673cf13e
[auto PR] Update the notebook-controller image to tag vmaster-g6eb007d0 (#1570)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-g6eb007d0
* Image built from kubeflow/kubeflow@6eb007d0
2020-11-04 12:44:53 -08:00
axelgobletbdr 3fd1c71143
Fixes #1509: Notebook CRD allows misconfigured notebooks that make the notebook-controller crash (#1510)
* added regex to container cpu and memory requests and limits

* added newly generated tests

* added tests generated with kustomize 3.2.1
2020-09-09 03:27:53 -07:00
kubeflow-bot 8449d4ed07
[auto PR] Update the notebook-controller image to tag vmaster-g845af298 (#1474)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-g845af298
* Image built from kubeflow/kubeflow@845af298
2020-08-07 21:30:19 -07:00
kubeflow-bot fb8760dacd
[auto PR] Update the notebook-controller image to tag vmaster-g101c77a3 (#1345)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-g101c77a3
* Image built from kubeflow/kubeflow@101c77a3
2020-07-10 15:15:48 -07:00
Mathew Wicks 6bc84d9d86
replace `env:` with `envs:` in Kustomization resources (#1204) 2020-06-17 06:12:39 -07:00
Jeremy Lewi 9e381328c0
CRDs should not have status field. (#1175)
* CRDs should not have status field.

* kubeflow/manifests#1174 lots of CRDs have status fields which
  causes ACM to complain that the CRD is invalid.

* This PR cleans up those CRDs and adds an appropriate validation test.

* Fix typos in AWS application specs (kubeflow/testing#668)

* Update tests.
2020-05-18 14:21:37 -07:00
Jeremy Lewi 4307f37657
commonLabels need to be immutable to support upgrading. (#1140)
* Fix kubeflow/manifests#1131

* kustomize commonLabels get subsituted into selector fields. Selector fields
  are immutable. So if commonLabels change (e.g. between versions) then
  we can't reapply/update the existing resources which breaks upgrades
 (kubeflow/kfctl#304)

* For the most part the problematic commonLabels were on our Application
  resources. The following labels were being set

  "app.kubernetes.io/version"
  "app.kubernetes.io/instance"
  "app.kubernetes.io/managed-by"
  "app.kubernetes.io/part-of"

* Version was definetely changing between versions. instance was also changing
  between versions to include the version number.

* managed-by and part-of could also change (e.g. we may not be using kfctl)
* We could still set these labels if we wanted to; we just shouldn't set
  them as commonLabels and/or include them in the selector as the will
  inhibit upgrades with kubectl apply.

* I created a test validate_resources_test.go to ensure none of these
  labels are included in commonLabels

* I created a simple go binary tools/fix_common_labels.go to update
  all the resources.

* generat_tests.py - Delete the code to remove unmatched tests.
  * We no longer generate tests that way and the delete code was going
    to delete valid tests like our new validation test

* Get rid of the clean rule in the Makefile for the same reason.
2020-04-30 22:14:06 -07:00
Jeremy Lewi 68c90fd621
Initial "stack" for deploying Kubeflow on GCP (#962)
* As discussed in http://bit.ly/kf_kustomize_v3 we want to use better
  patterns in our kustomize manifests.

* This PR uses kustomize to compose Kubeflow applications into a "stack" of
  the applications to be installed on GCP.

  * Note: This is only an initial set of applications not all applications
    are installed.

* Define a "stack" a stack is an oppionated way of combining applications
  * a stack is oppinionated about which applications to include and how
    to configure them
  * So we define a GCP stack that would contain the apps to be
    included in GCP deployments.

  * Stacks can be used to replace much of the functionality currently
    achieved by the list of applications in KFDef

    * Instead of listing applications in KFDef we could just point to a
      kustomization file containing the list of applications.

* Define an example illustrating how alice would define an overlay
  to add her kustomizations.

  * Show how she can define a patch to modify the additional configuration
    parameters for her deployment.

Start defining a KFDef that uses the new v3 manifests and the stacks

  * We still need to have multiple applications (and not a single kustomize) application because the namespace for some applications are different
    * e.g. ISTIO is installed in istio-system not kubeflow namespace so
      we keep it as a separate kustomize package

* Create v3 version of a couple applications

  * meta controller
  * notebook controller
  * certmanager

Fix some bugs in some v3 packages

   * jupyter_web_app - base/deployment.yaml had some changes that needed to be removed because these are now in a patch because we don't want the changes to be included in the v3 version

Fix centraldashboard v3

  * Move the resources back to base and make base_v3 depend on base rather than
  the other way around.

  * It also looks like we ended up duplicating resources between base_v3 and
    base; probably because of bad merges and rebase

  * In rolebindings don't use vars to substitute in the namespace just
    hardcode the values as needed.

* Upgrade kustomize to v3.2.0 otherwise the tests don't pass
  * In particular using v3.1.0 the jupyter-web-app doesn't have the unique
    name of the kubeflow-config configmap (i.e. with the content hash)
    substituted into the environment variable configmap ref names.
2020-04-08 13:37:44 -07:00
kubeflow-bot 53329d6f78
[auto PR] Update the notebook-controller image to tag vmaster-gf39279c0 (#954)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-gf39279c0
* Image built from kubeflow/kubeflow@f39279c0
2020-02-26 07:14:35 -08:00
kubeflow-bot a95de6b0c0
[auto PR] Update the notebook-controller image to tag vmaster-g915b84be (#938)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-g915b84be
* Image built from kubeflow/kubeflow@915b84be
2020-02-25 06:52:48 -08:00
kubeflow-bot 909e9df2e2
[auto PR] Update the notebook-controller image to tag vmaster-gb629b36b (#935)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-gb629b36b
* Image built from kubeflow/kubeflow@b629b36b
2020-02-19 09:54:25 -08:00
kubeflow-bot 0a377dd50a
[auto PR] Update the notebook-controller image to tag vmaster-g0cb184ad (#879)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-g0cb184ad
* Image built from kubeflow/kubeflow@0cb184ad
2020-02-06 16:37:32 -08:00
kubeflow-bot 02c8b97bb7
[auto PR] Update the notebook-controller image to tag vmaster-gdfd447a5 (#855)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-gdfd447a5
* Image built from kubeflow/kubeflow@dfd447a5
2020-02-04 17:59:54 -08:00
kubeflow-bot 0831dd56a6
[auto PR] Update the notebook-controller image to tag vmaster-ge0d7b2ed (#839)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-ge0d7b2ed
* Image built from kubeflow/kubeflow@e0d7b2ed
2020-02-03 20:39:34 -08:00
Kunming Qu fc771032e9
livenessProbe for notebook-controller (#837) 2020-02-03 20:39:26 -08:00
kubeflow-bot 9265760292
[auto PR] Update the notebook-controller image to tag vmaster-gc80316d3 (#822)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-gc80316d3
* Image built from kubeflow/kubeflow@c80316d3
2020-01-31 11:37:21 -08:00
Yuan (Bob) Gong 3fe91f521c
Add istio sidecar inject false to deployments in kubeflow ns (#712)
* Add istio sidecar inject false to deployments

* Revert cert-manager changes

* Revert unneeded changes
2020-01-29 04:28:05 -08:00
Zhenghui Wang c6288f9003 Update Notebook Controller to v1.0.0 (#739)
* update notebook manifest

* update notebook controller tests

* use 1.0.0 tag
2020-01-28 12:17:39 -08:00
kubeflow-bot d59fa6cc54 [auto PR] Update the notebook-controller image to tag vmaster-gb1400446 (#763)
* image gcr.io/kubeflow-images-public/notebook-controller:vmaster-gb1400446
* Image built from kubeflow/kubeflow@b1400446
2020-01-28 07:33:40 -08:00
Lun-Kai Hsu a81f77f9b8 update notebook controller image (#682) 2019-12-18 11:55:58 -08:00
Fernando Diaz 298e2bd2b9 Allow nb-controller get/list/watch/create events (#641)
* Allow nb-controller get/list/watch/create events

* Allow jupyter-web-app to list events
2019-12-16 09:45:39 -08:00
Boris Lublinsky 542252eea5 Adding finalizers to enable Kubeflow to run correctly on OpenShift (#612)
* Add finalizers to roles, change argo executor to k8sapi

* Update Argo-UI deployment to make it work

* Update Katib v2

* Aligned Argo config map with master

* Fixed tests

* Updated for the version 0.7

* Updated for the version 0.7

* Updated for the version 0.7

* Updated for the version 0.7

* Regenerated tests
2019-12-06 18:35:54 -08:00
Jeremie Vallee c695a8c789 [3945] Configurable Istio Gateway for Notebook Controller (#499) 2019-10-16 00:55:40 -07:00
Kam Kasravi 9e78376476 Add application overlay for notebook-controller (#451)
* add notebook-controller, katib-controller

* regrouped resources by kind

* unit tests

* add application overlay for notebook-controller

* move to version v0.7.0

* revert merge errors

* update unit tests
2019-10-14 16:19:52 -07:00
Krishna Durai 07e9fd10e3 Replaces null values at ClusterRoles with [] (#462) 2019-10-11 12:24:03 -07:00
Jeremy Lewi e4cf84373a Update the notebook-controller image. (#443)
* Update the notebook-controller image.

* It looks like the code may not have actually changed since we last
  built the image. Nonethless, lets update the tag to use the sha
  as opposed to a tag like v20190911-e8193317-dirty-cd2831

* Update the tests.

* Disable the new test for making sure the images are set because it doesn't
handle digests.
  * Related to #444
2019-10-08 15:13:51 -07:00
Yannis Zarkadas abace77e67 kustomization: fill missing images (#444)
* tests: add script to add images to kustomizations

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* remove tabs from yaml files

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* e2e: check that kustomization is filled correctly

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* format all kustomizations the same way

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* kustomizations: add missing images

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* regenerate tests

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* fix run-tests script

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2019-10-08 10:23:12 -07:00
Krishna Durai 0dd9399454 K8s ClusterRoles for: TFJobs, PyTorchJobs, Notebooks, PodDefaults, Profiles (#393)
* K8s ClusterRoles for: TFJobs, PyTorchJobs, Notebooks, PodDefaults, Profiles

* PodDefault permissions changed
Profiles aggregation removed
2019-10-05 05:37:10 -07:00
Jeremy Lewi e5a91ea617 Revert PRs back to 3e85b3c83d (#429)
* Revert "Metadata fix (#424)"

This reverts commit 019d532ee6.

* Revert "kustomization: add missing images to kustomizations (#405)"

This reverts commit 967b6e09cf.

* Revert "Add new updated Jupyter notebook images (#401)"

This reverts commit c20260264b.

* Revert "first pass on remaining applications (#410)"

This reverts commit 7a6d519916.

* Revert "App instance (#342)"

This reverts commit e97671c579.
2019-10-03 16:35:55 -07:00
Yannis Zarkadas 967b6e09cf kustomization: add missing images to kustomizations (#405)
* tests: add script to add images to kustomizations

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* istio: istio-install: remove tabs from yaml spec

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* kustomizations: format all kustomizations the same way

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* fill missing image fields from kustomizations

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>

* tests: regenerate tests

Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>
2019-10-02 13:06:09 -07:00
Kam Kasravi 7a6d519916 first pass on remaining applications (#410)
* first pass on remaining applications

* update crds

* application for istio-crds, istio-install, istio
2019-10-01 20:07:07 -07:00
Kam Kasravi e97671c579 App instance (#342)
* added config files for kfctl_gcp_iap, kfctl-gcp-e2e-pipelinerun.yaml

* update location of kfctl_gcp_iap

* updated kfctl revision

* error in kfctl-gcp-e2e-pipelinerun.yaml

* mapping value

* map values

* remove testing_repo, kfctl_repo

* changed jupyter-web-app to use instance

* update manifests to use PR

* instance update

* leave generateName blank in params.env

* update to application instance

* change configmap name

* add additional varReferences

* update unit tests

* added application to admission-webhook, argo, profile, pipeline

* use kfctl_gcp_iap from the PR

* update unit tests

* add generateName parameter

* added generateName to kfct_anthos, kfctl_aws

* update kfctl_aws_cognito.yaml, kfctl_gcp_iap.yaml with generateName

* added generateName to kfctl_existing_arriko.yaml

* added kfctl_gcp_basic_auth.yaml

* added kfctl_k8s_istio

* update params.yaml, unit tests

* StatefulSet -> Deployment

* use different config map name

* fix for seldon-core

* update unit tests

* add katib application

* added katib-suggestion
2019-10-01 08:53:33 -07:00
Jerome Brette 0842eb2b97 Update manifest to deploy on Kubernetes 1.16. (#376)
* Kubernetes 1.16: Update Deployment and Ingress objects

- Update Deployment apiVersion to apps/v1
- Update Ingress apiVersion to networking.k8s.io

* Kubernetes 1.16: Solve selector issue in Deployments

* Kubernetes 1.16: Solve selector issue in StatefulSets

* Kubernetes 1.16: Regenerate go code after updating objects

* Kubernetes 1.16: Update RBAC setup

* Kubernetes 1.16: Regenerate after RBAC changes

* Kubernetes 1.16: Adjust RBAC

* Kubernetes 1.16: Regenerate after RBAC changes.
2019-09-23 17:45:24 -07:00
Lun-Kai Hsu b91942c48d Notebook beta changes (#364)
* notebook beta

* fix

* fix
2019-09-18 05:59:29 -07:00
Kam Kasravi 688848de37 Notebook controller fix (#170)
* remove unused variable viewer-clusterDomain from pipeline/pipelines-viewer/base/kustomization.yaml, regenerate tests

* add namespace to kustomization.yaml

* update unit-tests
2019-06-23 12:19:54 -07:00
Kunming Qu adf8de1d3e config update (#158) 2019-06-19 18:30:48 -07:00
zabbasi e1c03026ad Add notebook crd validation (#109)
* add notebook crd validation

* updated the test

* fixed crd validation

* run make test
2019-06-18 11:02:20 -07:00
Kam Kasravi 67258048cd application target, jupyter-web-app, notebook-controller application overlays (#121)
* [WIP] enable applications

* update application CRs

* update jupyter-web-app application

* update unit-tests

* changes to application controller

* update unit tests

* update to unit tests

* adding recommended labels (https://kubernetes.io/docs/concepts/overview/working-with-objects/common-labels/)

* add addOwnerRef

* update images in application

* remove icons since the array is empty

* add selector in application

* add uber application

* uber application

* change version to v1beta1
2019-06-18 09:28:18 -07:00
Kam Kasravi 91c22388f5 Istio namespace overlay (#102)
* ISTIO related deployments are deployed to wrong namespace

* adding unit tests

* new tests, fixes for istio overlays

* exclude istio-install base test for now

* update istio-install

* remove duplicate CRDs from istio-noauth.yaml

* reprocessed istio-noauth.yaml

* separate crds from install

* add commonLabel for mysql

* secretName should be tlsSecretName

* rebase from upstream master

* reconcile with master

* remove istio overlay for pipeline-viewer

* add istio overlay to katib

* update tests for katib and basic-auth istio overlays

* reorder crd.yaml

* separate cert-manager-crds since kustomize doesn't do ordering

* fix for invalid image

* add USE_ISTIO to notebook-controller

* update notebook_controller image, fix for namespace ref

* fix for namespace

* fix for couldn't find key errors

* separate pytorch crd since kustomize doesn't guarantee ordering

* separate pytorch crd since kustomize doesn't guarantee ordering

* revert back to including pytorch crd in pytorch-operator

* remove application.yamls and add as overlay in followup PR

* update tests, fix for pytorch-jobs to separate out the crd

* rebase from upstream/master

* add selector

* update unit-test for pytorch-operator

* move to v1, v1beta1 versions

* update to crd version and deployment

* update to deployment using var

* fixes for deployment merge

* versions must list v1beta2 first

* fix for argo-ui

* define POD_LABELS, USE_ISTIO vars in kustomization.yaml

* provide default value for USE_ISTIO

* sync unit-tests with notebook-controller fix of setting the default value of USE_ISTIO
2019-06-07 09:56:01 -07:00
zabbasi 5b2e9d5a19 added Jupyter application resources (#105)
* WIP: added Jupyter application resources

* formating fix and generating tests
2019-05-29 18:58:19 -07:00
Andrey Velichkevich 509fc32889 Fix ClusterRole in Jupyter Notebook controller (#70)
* Fix ClusterRole

* Change clusterrole
2019-05-19 20:15:11 -07:00
Abhilash Pallerlamudi 8ddefcbe92 Notebook controller requires rbac to create istio virtualservices (#78)
Signed-off-by: Abhilash Pallerlamudi <stp.abhi@gmail.com>
2019-05-16 10:36:59 -07:00
zabbasi ab9b3f3d1c added generalized admission-webhook component (#33)
* added generalized admission-webhook

* merged with master

* added podpreset access to jupyter-web-app

* update images

* fixed admission-webhook roles

* applied code review comments

* undo changes for  README

* undo README changes

* fixed bootstrap service account name

* fixed bugs

* seperated webhook and bootstrap components of admission-webhook

* fixed to pass tests

* skip juptrer test

* skip profile test

* fixes to skip tests

* removed jupyter and profiles test

* skipped admission-webhook unit test

* removed temp files

* small fix

* skip webhook and bootstrap unit test

* remove webhook related unit tests

* fixtes for test names
2019-05-15 20:09:42 -07:00
Kam Kasravi 4fa02fee7f port to kustomize, initial checkin (#13)
* port to kustomize, initial checkin

* updates to agree with design doc

* fixes for ambassador, centraldashboard and pytorch-operator

* fixes to argo, pipeline, and tensorboard

* make sure newTags are strings

* add array entry in notebook-controller

* indent error

* workaround for kustomize issue #677

* kustomization file is generated by package manager, added cert-manager, updates to pipeline, tensorboard

* add resources for cert-manager

* add cloud-endpoints

* added gcp-credentials-admission-webhook gpu-driver iap-ingress

* add spartakus

* update spartakus

* use same grouping as ksonnet

* remove kubeflow namespace references, make image reference a string

* Workaround for https://github.com/kubernetes-sigs/kustomize/issues/677

* StatefulSet requires serviceName

* clientConfig requires namespace

* added kubebench

* removed namespace from subjects since this is provided at the top-level kustomization.yaml

* add application, remove ksonnet label

* fixes for formatting, indentation

* added profiles

* using resources.yaml until integration

* update to README.md

* replace schema with the latest from sigs.k8s.io/application

* re-adding ambassador from master

* use argo/base files on master

* rebase from master

* rebase from master for jupyter-web-app

* rebase jupyter-web-app from master

* rebase from master

* rebase notebook-controller from master

* rebase pipelines from master

* remove namespace from pipelines, fix non-nested name

* fix indentation error in jupyter-web-app service

* fix indentation for jupyter-web-app/base/service.yaml

* deploy application from callback hook

* move pipelines back to pipeline so we can use either ksonnet or kustomize until conversion is complete

* refactored jupyter-web-app to use namePrefix, added role, rolebinding

* jupyter-web-app refactoring

* refactoring jupyter to better use kustomize

* additional fixes

* additional fixes in cloud-endpoints

* fixes for gcp-credentials-admission-webhook

* indentation error on env

* indentation errors

* modify an existing Application via callback

* move application CRD to metacontroller

* update application definition

* fix for Application CRD

* remove Application CR

* add Application CRD to metacontroller

* added jupyter and overlap for minikube

* standardized all components to be under base or overlays/<platform>

* fixes for base dirs

* remove katib for now, define a kustomization file for pipeline

* update README.md

* update to README.md

* rebase from upstream master

* update README.md

* update README.md

* update to README.md

* update to README.md

* fix for jupyter target
2019-04-18 10:30:05 -07:00