notebooks

Commit Graph

Author	SHA1	Message	Date
Kimonas Sotirchos	b19054e75b	Create an Angular Library with common frontend code (kubeflow/kubeflow#5252 ) Create an Angular Library with common frontend code. Our crud web apps should use this library to share common functionality like: * Talking to Central Dashboard for the Namespace selection * Making http calls * Surfacing and showing error messages and warnings * Form utilities * Showing a table with entries and actions Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com>	2020-08-28 05:14:53 -07:00
Konstantinos Andriopoulos	1936429ea5	Tensorboard controller: Add scheduling functionality for Tensorboard servers that use RWO PVCs as log storages (kubeflow/kubeflow#5218 ) * Add indexers as custom field selectors for list requests to cache The tensorboard controller must be able to list pods that have mounted a PVC with a specific ClaimName. In order for this list request to cache to work properly, custom field selectors are added. These selectors are used to index the "pod.spec.volumes.persistentvolumeclaim.claimname" field so that unneeded pods can be filtered out. * Set pod's nodeAffinity if log files exist in a PVC In the case of using a PVC as a logdir for Tensorboard Server, if the PVC had a ReadWriteOnce access mode and was alread mounted by another running pod X, then the Tensorboard Server pod would not always be scheduled on the same node as X. As a result, the Tensorboard Server pod would be blocked since multi-node access is prohibited on ReadWriteOnce volumes. In order for the Tensorboard Server pod to run successfully, nodeAffinity was added to the spec.template.spec.affinity field of the returned deployment. As a result, both X and the Tensorboard Server pod are now scheduled on the same node. Resolves kubernetes/kubernetes#26567 * Set Tensorboard Server scheduling feature to 'off' by default In the case that the Tensorboard Server used a RWO PVC (as a log storage) that was already mounted by another pod, nodeAffinity was used so that the Tensorboard Server would be scheduled (if possible) on the same node as that pod. Now, this added functionality is used only if the 'RWO_PVC_SCHEDULING' environmental variable is set to "true" when running the Tensorboard controller. This scheduling functionality is disabled by default.	2020-08-26 02:58:03 -07:00
Kimonas Sotirchos	1a0a3986d2	Add owners for the Notebooks Controller (kubeflow/kubeflow#5240 ) Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com>	2020-08-25 06:34:16 -07:00
Konstantinos Andriopoulos	f55c0d77dc	tensorboard web-app: Create Tensorboard web-app backend (kubeflow/kubeflow#5180 ) * Create Tensorboard web-app backend Create the code for the Tensorboard web-app backend which includes routes for GET, POST and DELETE requests. The backend is created with Python/Flask, so it also uses the common code from 'kubeflow.kubeflow.crud_backend'. * Add 'get_age(k8s_object)' function to 'crud_backend' common code It would be useful for all web apps of the 'crud-web-apps' folder to return age information to their frontends. As a result, 'get_age(k8s_object)' was added to the common code, so that all web apps can use it.	2020-08-20 03:25:22 -07:00
Kimonas Sotirchos	1db8a22ca9	Common code between the different python backends (kubeflow/kubeflow#5164 ) Create a python module under the kubeflow.kubeflow package that will be exposing common code and a base app the takes care of: * Exceptions handling * Common routes for serving static files and their cache control policy * Authorization checks with SubjectAccessReview * Authentication checks on the Kubeflow headers * Common helper functions for dates, yaml parsing etc * health/liveness probes Backends that are written with Python/Flask should use this common code in order for us to reduce code duplication and have our backends align with our accepted practices. Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com>	2020-08-07 07:30:18 -07:00
Kimonas Sotirchos	db97455152	Add OWNERs file to tensorboard controller (kubeflow/kubeflow#5088 ) The tensorboard controller should have a distinct list of reviewers and approvers. Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com>	2020-08-07 06:32:19 -07:00
Kimonas Sotirchos	40838253e4	Create a new directory in components for web apps (kubeflow/kubeflow#5184 ) * Create a new directory in components for web apps Since we want to also have some common code between our web apps we should create a parent dir for any future web app we want to develop. The code for the web apps, common or not, should be organized under this directory. Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * remove the reviewers Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com>	2020-08-05 06:36:29 -07:00
Nihir Patel	b13382b558	notebook_controller.go: make clusterDomain an option (kubeflow/kubeflow#4468 )	2020-07-03 19:42:48 -07:00
Humair	8470751a58	Fix notebook controller rbac gen (kubeflow/kubeflow#5083 )	2020-06-22 07:18:39 -07:00
Konstantinos Andriopoulos	9ae8d1ff40	tensorboard-controller: Mount GCP secret only when accessing Google storage (kubeflow/kubeflow#5069 ) * Remove duplicate package import Package "k8s.io/api/core/v1" was imported twice with names "v1" and "corev1". * Mount GCP secret only when accessing Google storage The Tensorboard controller used to create pods (running the Tensorboard server) that would always mount user-gcp-sa secret, regardless of the logs storage being a Google cloud bucket or not. This would lead to pods never starting properly in the case of using other cloud services (or PVCs) as log storages, if the user-gcp-sa secret didn't exist on the cluster. In order for the Tensorboard server pods to run properly, user-gcp-sa secret is now mounted only when Google cloud buckets are used as log storages. Fixes kubeflow/kubeflow#5065	2020-06-18 06:46:20 -07:00
Ali Soume'e	6942bf5f87	Remove duplicate import (kubeflow/kubeflow#5058 ) "k8s.io/api/core/v1" was imported with names "corev1" and "v1"	2020-06-08 20:47:19 -07:00
Chad Roberts	25bf002c34	Adding env var to suppress automatic additon of fsGroup in notebook pod (kubeflow/kubeflow#4713 ) (kubeflow/kubeflow#4782 ) * Allowing for an env var ADD_FSGROUP to be set to false to suppress the automatic addition of fsGroup: 100 in the pod's security context. This addresses issue #4617. * Adding note in README regarding ADD_FSGROUP.	2020-02-19 09:08:25 -08:00
Yannis Zarkadas	e02a82fbcc	notebook-controller: Fix event filtering (kubeflow/kubeflow#4777 ) This commit fixes the event filtering check, so it doesn't crash when the Pod name doesn't contain a dash ("-"). Signed-off-by: Yannis Zarkadas <yanniszark@arrikto.com>	2020-02-19 08:44:25 -08:00
Zhenghui Wang	e8bf7974d4	add loadtest for notebook controller (kubeflow/kubeflow#4779 )	2020-02-18 21:00:25 -08:00
Jeremy Lewi	0895c4d135	Fix docker builds of notebook and tensorboard controller (kubeflow/kubeflow#4664 ) * Fix docker builds of notebook and tensorboard controller * The notebook-controllers and tensorboard-controllers now depend on the go package components/common * We need to rewrite the Dockerfiles so that the context is now ${KUBEfLOW_REPO}/common * so that components/common can be included in the context and copied to the Dockerfile * Create skaffold configs to make it easier to do remote builds with Kaniko * The skaffold configs are currently written assuming the kubeflow-ci cluster is used to build the images. This could be generalized in the future. * Remove the code to build the notebook-controller with GCB; we can just use skaffold and kaniko to do efficient remote builds. * Related to #4582 - Jupyter image doesn't build. * Fix docker build rule.	2020-01-21 17:54:34 -08:00
Zhenghui Wang	89acff862c	Add Notebook Controller v1 spec (kubeflow/kubeflow#4649 ) * add v1 spec * change kubeflow.org_nootebooks.yaml	2020-01-13 19:43:08 -08:00
Zhenghui Wang	e5410cd7c8	add source code of MPL licensed library. (kubeflow/kubeflow#4643 )	2020-01-10 15:57:37 -08:00
Zhenghui Wang	4d2dc369cf	Update notebook ctrler dockerfile (kubeflow/kubeflow#4641 )	2020-01-09 13:56:34 -08:00
Jeremy Lewi	d25a14aea2	Fix notebook controller and tensorboard controller docker image build. (kubeflow/kubeflow#4631 ) * The jupyter docker image isn't building because it now depends on code in components/common * To make this work we need to configure it as a multi module package and modify go.mod to redirect to a local path. * Ref: https://github.com/golang/go/wiki/Modules#when-should-i-use-the-replace-directive * Replaces PR #4583 Related to #4582 - Jupyter image doesn't build.	2020-01-07 16:25:41 -08:00
Zhenghui Wang	71918b8b64	Add licensing info for Notebook Controller (kubeflow/kubeflow#4623 ) * add files for third party licensing for notebook ctlr * lint	2020-01-06 23:20:17 -08:00
Jeremy Lewi	a28e6692d6	Move the CD scripts and Tekton pipelines into kubeflow/testing (kubeflow/kubeflow#4593 ) * Delete all the Tekton pipelines and scripts for continuous delivery of Kubeflow applications because they are moving into kubeflow/testing * kubeflow/testing#551 is the PR moving the code into kubeflow/testing Related to: kubeflow/testing#544 redo how we use kustomize and Tekton to parameterize the pipelines	2019-12-30 07:09:39 -08:00
Fernando Diaz	1ff2f7a880	Reissue pod and sts events as notebook events (kubeflow/kubeflow#4139 )	2019-11-21 12:07:29 -08:00
MrXinWang	d4fb94b020	Add arm64 support for controllers (kubeflow/kubeflow#4438 ) Change-Id: I9f4b4871a5d02a53230abb836787f665dd8e3998 Signed-off-by: Henry Wang <henry.wang@arm.com> Jira: ENTOS-1322	2019-10-31 19:53:23 -07:00
Quanjie Lin	1236c5e6d7	initial checkin of tensorboard controller (kubeflow/kubeflow#4312 ) * initial checkin of tensorboard controller * initial checkin of tensorboard controller * typo * typo * fix typo * support local path * add status * conflict * remove binary	2019-10-29 09:12:44 -07:00
Lun-Kai Hsu	2fe3108347	fix notebook route (kubeflow/kubeflow#4402 )	2019-10-24 16:01:39 -07:00
Ben Ye	2e7dc7ec06	add culling metrics (kubeflow/kubeflow#4336 ) Signed-off-by: yeya24 <yb532204897@gmail.com>	2019-10-17 21:37:57 -07:00
Ben Ye	d14f6ac07f	support metrics in notebook-controller (kubeflow/kubeflow#4123 ) Signed-off-by: yeya24 <yb532204897@gmail.com>	2019-10-16 00:15:40 -07:00
Kam Kasravi	c1eca0937c	Ci for components (kubeflow/kubeflow#4238 ) * snapshot * fixes to service-account and task * adding admission-webhook, notebook-controller * update to README.md * update README.md	2019-10-15 08:31:53 -07:00
Jeremie Vallee	c88e721fc7	[3945] Configurable Istio Gateway for Notebook Controller (kubeflow/kubeflow#4216 )	2019-10-14 12:06:59 -07:00
Ben Ye	807843ec2a	cleanup some codes in notebook controller (kubeflow/kubeflow#4098 ) * cleanup some codes in notebook controller Signed-off-by: yeya24 <yb532204897@gmail.com> * remove ambassador in notebook controller Signed-off-by: yeya24 <yb532204897@gmail.com>	2019-10-14 12:06:52 -07:00
Jerome Brette	b5ff201a8c	Migrate kustomize.go to Kustomize3 (kubeflow/kubeflow#4055 ) * Migrate to kustomize3: Phase 1. Update kustomization.yaml * Migrate to kustomize3: Phase 2: Update kustomize.go - Update kustomize.go to match new package structure. - Update module dependencies. * Migrate to kustomize3: Phase 3: Implements code review - As per request, revert kustomization.yaml back to deprecated syntax. - As per request, revert kustomize.go to use deprecated .Bases field. - Note: patchesStrategicMerge: will be turned into a deprecated field pretty soon. - Rerun go mod tidy * Migrate to kustomize3: Phase 4: Activate legacy order transformer	2019-09-20 21:21:25 -07:00
Lun-Kai Hsu	2f2938bead	Notebook v1beta1 (kubeflow/kubeflow#4105 ) * add v1beta1 * add storage version * wip * add conversion * setup webhook * fix * fix manifest * webhook wip * no webhook	2019-09-13 07:04:29 -07:00
Lun-Kai Hsu	8cad496a13	Migrate notebook CR to kubebuilder V2 (kubeflow/kubeflow#4013 ) * wip * can build * tested: able to control notebook * fix	2019-09-04 17:06:22 -07:00
Kimonas Sotirchos	08f43598c2	Culling of Idle Jupyter Notebooks (kubeflow/kubeflow#3856 ) * Create a culler as a package Helper functions for culling resources. Takes for granted that ISTIO is installed to the system and queries Prometheus to get metrics. Specifically, requests/{configurable time}. If the resource should be culled, then it should be done by setting an annotation. This way the UIs can also show that the Resource is stopping and also easily stop a resource by making a PATCH request. Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Culling logic enhancements Add necessary ENV Vars. Culling won't happen by default. To enable it the user will need to set the ENABLE_CULLING=true Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Misc fixes in logging and comment cleanup Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Fix typo Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Add Notebooks specific culling Query the /api/status endpoint of each Server Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Remove the generic culling logic We need to discuss if it would make sense to have this logic as a go library, or use knative. Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Add unit tests Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Remove unused code Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Review changes #1 * rename `getEnvDef` to `getEnvDefault` * Add a comment to describe how the STOP_ANNOTATION gets used Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com> * Make cluster domain configurable Signed-off-by: Kimonas Sotirchos <kimwnasptd@arrikto.com>	2019-08-26 04:40:21 -07:00
Kam Kasravi	0b5e3bd995	add kkasravi to OWNERS (kubeflow/kubeflow#3311 )	2019-06-18 16:58:32 -07:00
Gabriel Wen	70bd7acdf5	Merge branch 'master' into fix-notebook-controller	2019-06-03 14:33:05 -07:00
zabbasi	daa4768f96	Add details to "conditions" in notebook status (kubeflow/kubeflow#3319 ) * added detailes into NotebookCondition to keep track of notebook container status change * update notebook controller image * fix conitions update * small fix * temporary changes to debug * temporary remove delete step from workflow for debugging * temoraray merging kfctl-test and kfctl-go-test fir debugging * debugging * undo the mistake * debugging * debugging tests * merged kfctl-test and kfctl-go-test * remove wait-for-kubeflow * merged with master * remove test delete step for debugging * small fix * update jupyter test component * update condition test for jupyter component * revert back deleting step * revert back change in kfctl.sh * added some temporary change to debug jupyter-test * revert back temp changes	2019-06-03 14:19:30 -07:00
Gabriel Wen	c22959f0ac	check env when setting watch	2019-06-03 13:54:10 -07:00
Gabriel Wen	525eee5ed8	update notebook_controller to use env	2019-06-03 13:15:56 -07:00
Kunming Qu	42bbb0cdbf	profile and Istio integration (kubeflow/kubeflow#3234 ) * profile and Istio integration * make profile manage Istio gateway * add README.md * make notebooks use gateway in kubeflow namespace * gateway format to ns/name; add watch for istio ServiceRoleBinding * Support setting auth header format via parameter * update README * update README * update readme; resolve comments	2019-05-29 18:36:19 -07:00
zabbasi	a7e7d75be9	Renamed PodPreset CRD to PodDefault (kubeflow/kubeflow#3320 ) * renamed PodPreset CRD to PodDefault * typos * update jupyter-web-app image	2019-05-21 11:22:10 -07:00
zabbasi	5ae44fbdb4	Integrates notebook-controller and jupyter-web-app with admission-webhook (kubeflow/kubeflow#3245 ) * integrate jupyter-web-app and notebook-controller with webhook * merged podpreset component into admission-webhook * applied cr comments * undo notebook image for tesing * update notebook controller image * temporaray disbaling kubeflow delete to debug presubmit failure * temporary remove cluster delete in kfctl workflow test * typo * typo * undo debugging changes	2019-05-20 12:39:13 -07:00
Kunming Qu	a80025787b	enable Istio Injection in user-created namespace; notebook and Istio integration (kubeflow/kubeflow#3235 ) * enable Istio Injection in user-created namespace; notebook service and Istio rbac integration * update README	2019-05-09 16:59:58 -07:00
Hung-Ting Wen	58c977c8e9	ISTIO support for notebook controller (kubeflow/kubeflow#3104 ) * virtual service func init * create virtualservice * fix * fix * add cluster role * fix unstructured format * updates * fix * reconcile virtual service * fix * revert quote changes * add virtualservice update * comment * copy if spec is not found in toSpec * add watch event	2019-04-29 11:43:19 -07:00
Lun-Kai Hsu	9f70ca7f10	add labels for notebook so that gcp credentials will be injected by webhook (kubeflow/kubeflow#2853 ) * add labels for gcp cred * kfctl set flag * review comment * review comment	2019-03-30 20:36:33 -07:00
Lun-Kai Hsu	dc69b63667	notebook CR shows container status (kubeflow/kubeflow#2787 ) * wip * fix * fix format	2019-03-26 17:08:47 -07:00
zabbasi	2500faee10	added ReadyReplicas status to notebook-controller (kubeflow/kubeflow#2743 ) * added ReadyReplicas status to notebook-controller * fixed issues related to updating the notebook status * fixed a problem in updating Notebook's status * applied cr comments * small change * small formating change	2019-03-21 21:46:18 -07:00
Lun-Kai Hsu	d47a5864ec	pf gcb (kubeflow/kubeflow#2603 )	2019-03-04 17:08:23 -08:00
Lun-Kai Hsu	bfa59d7769	fix (kubeflow/kubeflow#2620 )	2019-03-04 16:48:17 -08:00
Lun-Kai Hsu	931e8e32aa	Add status to notebook (kubeflow/kubeflow#2558 ) * wip * wip * update test to check status condition * fix	2019-03-04 14:36:17 -08:00
Lun-Kai Hsu	a9b8f4e8a0	fix (kubeflow/kubeflow#2506 )	2019-02-26 11:21:53 -08:00
Abolfazl Shahbazi	4c48320235	Update python code styles based on what's provided in .style.yapf (kubeflow/kubeflow#2447 ) * Fix Python code styles based on Pep8 and flake8 * More syle fixes to Python code * Update python code styles based on what's provided in .style.yapf * Sync with master and update styles * Sync with master * More Python style fixes * Changes per code review * Sync with master and update the remaining files * Add a .flake8 config file for future reference	2019-02-19 22:44:30 -08:00
Lun-Kai Hsu	ffc9a1d674	Add build with GCB support to notebook controller (kubeflow/kubeflow#2486 ) * fix * fix * ignore	2019-02-15 11:52:54 -08:00
Lun-Kai Hsu	e377455ce4	Notebook controller fixes (kubeflow/kubeflow#2463 ) * fix * enable e2e test * fix * fix * fix logging for pytest * fix * fix * fix * fix * fix * fix * address review * review comment	2019-02-15 00:09:02 -08:00
Lun-Kai Hsu	b7555c6727	NB controller fix (kubeflow/kubeflow#2439 ) * fix * fix	2019-02-10 17:37:51 -08:00
Abolfazl Shahbazi	ae07f8d4d8	port leftover diff from kfapp-ksapp branch after kfctl merge (kubeflow/kubeflow#2410 ) * port leftover diff from kfapp-ksapp branch after kfctl merge * minor gofmt fix	2019-02-10 10:50:08 -08:00
Lun-Kai Hsu	fa3b0b3b0b	Golang notebook controller (kubeflow/kubeflow#2336 ) * kubebuilder init * replae dep with modules * add notebook api * notebook controller impl * remove test * fix dockerfile * fix svc reconcile * notebook controller ksonnet * update generated crd * add sample * remove TODO * make golang version an arg * rename * fix path * add README * Add todo in readme * remove arg default	2019-02-05 16:43:39 -08:00

... 5 6 7 8 9

407 Commits