pipelines

Commit Graph

Author	SHA1	Message	Date
Alexey Volkov	8ba366b03f	SDK - Made outputs with original names available in ContainerOp.outputs (#3734 ) * SDK - Made outputs with original names available in ContainerOp.outputs Previously, ContainerOp had strict requirements for the output names, so we had to convert all the names before passing them to the ContainerOp constructor. Outputs with non-pythonic names could not be accessed using their original names. Now ContainerOp supports any output names, so we're now using the original output names. However to support legacy pipelines, we're also adding output references with pythonic names. * Fixed the compiler test data * Fixed the duplicate parameter outputs in the compiled workflow * Fixed long line * Stabilized the output naming conflict resolution * Fix case of missing special outputs	2020-05-12 19:08:26 -07:00
Alexey Volkov	fe30d5462a	SDK - Components - Calculate component hash digest (#3726 ) * SDK - Components - Calculate component hash digest The digest is calculated when loading the component from URL, tfile or text. Slightly refactored component loading - streams are no longer used, only bytes. TODO: Calculate the digest if missing TODO: Report possible digest conflicts * Updated the test graph component * Using the actual digest in the test	2020-05-12 18:24:26 -07:00
Alexey Volkov	b9aa106bb5	SDK - Prioritize lib2to3 when stripping type annotations (#3724 ) * SDK - Prioritize lib2to3 when stripping type annotations It's a standard python library (although not well supported) and it doe not leave training spaces. * Fixed compiler test data	2020-05-11 18:44:20 -07:00
Alexey Volkov	2279bde698	SDK - Annotate pods with component_ref (#3727 ) * SDK - Annotate pods with component_ref This preserves the information about the digest of the component and the location from which the component was loaded. * Fixed compiler tests	2020-05-11 17:18:21 -07:00
Niklas Hansson	05c1537f28	Add Nodeselector to pipelineconfig fix issue #2863 (#3616 ) * updated version * added pipeline nodeselector * removed old legacy * renaming * update test * Update sdk/python/kfp/compiler/compiler.py	2020-05-05 00:11:08 -07:00
Eterna2	9167da1b4e	Support execution throttling for executing the pipelines (#3346 ) (#3439 ) * Add parallelism limits to pipeline in kfp sdk * fix lint error	2020-05-04 23:25:08 -07:00
Alexey Volkov	9619655ed5	SDK - Enabled file inputs to be optional (#3620 ) * SDK - Enabled file inputs to be optional * Added unit tests	2020-04-27 19:34:04 -07:00
Jiaxiao Zheng	aa8da64b4c	[SDK] Add pod labels for telemetry purpose. (#3578 ) * add telemetry pod labels * revert the id label * update compiler tests * update cli arg * bypass tfx * update docstring	2020-04-27 18:50:04 -07:00
Alexey Volkov	e41ee9cdf7	SDK - Components - Task objects now have the .output attribute when component has only one output (#3622 )	2020-04-26 18:47:28 -07:00
Alexey Volkov	6cb92d45c8	SDK - Compiler - Include the SDK version information in the compiled workflows (#3583 ) * SDK - Compiler - Include the SDK version information in the compiled workflows * Fixed the unit tests * Removed the sdk_version annotation.	2020-04-25 01:49:28 -07:00
Niklas Hansson	2354776e1e	fix #2802 : Set ImagePullPolicy per pipeline. (#3534 ) * bump version * default image pull policy * Update sdk/python/kfp/dsl/_pipeline.py * task setting should dominate * Update sdk/python/kfp/dsl/_pipeline.py * fixed merge misstake	2020-04-23 07:09:13 -07:00
Alexey Volkov	b63ad7e614	SDK - Removed the ArtifactLocation feature (#3517 ) * SDK - Removed the ArtifactLocation feature The feature was deprecated in v0.1.34 https://github.com/kubeflow/pipelines/pull/2326 * Removed the artifact_location sample	2020-04-23 00:49:44 -07:00
Yuan (Bob) Gong	2742a3ed95	[SDK] Make service account configurable for build_image_from_working_dir (#3419 ) * Add kfp-container-builder sa * Allow service account to be configurable * Fix tests * Fix test * Use documentation for service account to introduce compatibility with different types of installation * updated doc * clean up * Update container_builder_test.py * Update _build_image_api.py * Update kustomization.yaml * Add executable permission for presubmit tests mkp.sh	2020-04-15 00:06:02 -07:00
Alexey Volkov	7ee500f702	SDK - Tests - Improved tests for serializing lists containing objects (#3326 ) Added test_fail_on_handling_list_arguments_containing_python_objects Added test_handling_list_arguments_containing_serializable_python_objects Moved test_handling_list_arguments_containing_pipelineparam to component_bridge_tests	2020-03-24 10:06:45 -07:00
Alexey Volkov	deb62f6b50	Style - Moved imports to the start of the file (#3325 )	2020-03-21 22:08:44 -07:00
Alexey Volkov	be12ccf2a1	SDK - Moved the @python_component decorator test to dsl tests (#3324 ) * SDK - Moved the @python_component decorator test to dsl tests * Deprecate @python_component	2020-03-21 08:14:43 -07:00
Alexey Volkov	194278337b	SDK - Moved python op pipeline compilation test to bridge tests (#3323 )	2020-03-21 00:18:44 -07:00
Alexey Volkov	734b43e3db	SDK - Added support for maxCacheStaleness (#3318 ) * SDK - Added support for maxCacheStaleness * Added the vendor prefix to the annotation	2020-03-20 13:38:09 -07:00
Alexey Volkov	264ff37c1e	SDK - Moved _dsl_bridge to dsl (#3267 ) This is a pure refactoring change. The components library should not have any dependencies on the DSL library.	2020-03-14 00:12:34 -07:00
Alexey Volkov	119e329108	SDK - Components - Fixed handling collection return values (#3263 ) * SDK - Components - Fixed handling collection return values Fixes https://github.com/kubeflow/pipelines/issues/3262 * Fixed the tests	2020-03-12 23:50:39 -07:00
Alexey Volkov	8ca603d679	SDK - Tests - Testing command-line resolving explicitly (#3257 ) * SDK - Tests - Testing command-line resolving explicitly After the recent small refactoring of the task resolving flow in the component library, some tests we left unupdated with compatibility shims added to make the tests pass. This PR updates the remaining tests and removes the shims. This mostly involves using explicitly using `_resolve_command_line_and_paths`. Some tests that validate the behavior of the dsl bridge were moved to `component_bridge_tests.py` * Indented the component texts	2020-03-11 19:38:38 -07:00
Ilias Katsakioris	c220059c8d	SDK/DSL: Enable the deletion of a resource via ResourceOp method (#3213 ) * SDK/DSL: Enable the deletion of a resource via ResourceOp method * Add the method delete() to ResourceOps * Extend ResourceOp & VolumeOp tests Signed-off-by: Ilias Katsakioris <elikatsis@arrikto.com> * Fix ValueError not being raised	2020-03-10 16:07:36 -07:00
xiaohanhuang	e704067d15	add an optional name for dsl.Condition (kubeflow#3210) (#3212 ) * add an optional name for dsl.Condition (kubeflow#3210) * add unit test	2020-03-05 21:45:22 -08:00
Alexey Volkov	578d8de91d	SDK - Reduce python component limitations - no import errors for cust… (#3106 ) * SDK - Reduce python component limitations - no import errors for custom type annotations By default, create_component_from_func copies the source code of the function and creates a component using that source code. No global imports are captured. This is problematic for the function definition, since any annotation, that uses a type that needs to be imported, will cause error. There were some special provisions for NamedTuple, InputPath and OutputPath, but even they were brittle (for example, "typing.NamedTuple" or "components.InputPath" annotations still caused failures at runtime). This commit fixes the issue by stripping the type annotations from function declarations. Fixes cases that were failing before: ```python import typing import collections MyFuncOutputs = typing.NamedTuple('Outputs', [('sum', int), ('product', int)]) @create_component_from_func def my_func( param1: CustomType, # This caused failure previously param2: collections.OrderedDict, # This caused failure previously ) -> MyFuncOutputs: # This caused failure previously pass ``` * Fixed the compiler tests * Fixed crashes on print function Code `print(line, end="")` was causing error: "lib2to3.pgen2.parse.ParseError: bad input: type=22, value='=', context=('', (2, 15))" * Using the strip_hints library to strip the annotations * Updating test workflow yamls * Workaround for bug in untokenize * Switched to the new strip_string_to_string method * Fixed typo. Co-Authored-By: Jiaxiao Zheng <jxzheng@google.com> Co-authored-by: Jiaxiao Zheng <jxzheng@google.com>	2020-02-24 20:50:48 -08:00
Alexey Volkov	7ee3244f5b	SDK - Components - Fixed dict-style type annotations (#3107 ) Refactored `_data_passing.py` interface to expose functions instead of dictionaries.	2020-02-18 20:40:25 -08:00
Alexey Volkov	839198f502	SDK - Fixed the broken kfp.gcp.use_preemptible_nodepool extension (#3091 ) It was generating broken Kubernetes structures that made the workflow fail at submission time. Fixes https://github.com/kubeflow/pipelines/issues/2847	2020-02-14 17:27:28 -08:00
Yuan (Bob) Gong	02fabd306e	[Testing] Use google/cloud-sdk:279.0.0 to resolve workload identity flakiness (#3019 ) * [Testing] Use gke 1.15.8 to mitigate workload identity flakiness * Upgrade gcloud version * Update image builder image too * Turn on workload identity * Update deploy-cluster.sh * secret sample uses python3 instead * Increase xgboost time limit * Revert files with bad format * Update component and pipelines to use gcloud 279.0.0 * Fix secret sample using python3 * Upgrade frontend integration test image * Rebuild frontend integration test image	2020-02-11 18:34:07 -08:00
Alexey Volkov	4a1b282461	SDK - Compiler - Fixed ParallelFor argument resolving (#3029 ) * SDK - Compiler - Fixed ParallelFor name clashes The ParallelFor argument reference resolving was really broken. The logic "worked" like this - of the name of the referenced output contained the name of the loop collection source output, then it was considered to be the reference to the loop item. This broke lots of scenarios especially in cases where there were multiple components with same output name (e.g. the default "Output" output name). The logic also did not distinguish between references to the loop collection item vs. references to the loop collection source itself. I've rewritten the argument resolving logic, to fix the issues. * Argo cannot use {{item}} when withParams items are dicts * Stabilize the loop template names * Renamed the test case	2020-02-11 12:18:09 -08:00
Alexey Volkov	c83aff2738	SDK - Components - Made it easier to access component spec classes (#2860 ) * SDK - Components - Made it easier to access component spec classes * Updated the imports	2020-01-31 11:41:21 -08:00
Alexey Volkov	2d9f2524c1	SDK - Components refactoring (#2865 ) * SDK - Components refactoring This change is a pure refactoring of the implementation of component task creation. For pipelines compiled using the DSL compiler (the compile() function or the command-line program) nothing should change. The main goal of the refactoring is to change the way the component instantiation can be customized. Previously, the flow was like this: `ComponentSpec` + arguments --> `TaskSpec` --resolving+transform--> `ContainerOp` This PR changes it to more direct path: `ComponentSpec` + arguments --constructor--> `ContainerOp` or `ComponentSpec` + arguments --constructor--> `TaskSpec` or `ComponentSpec` + arguments --constructor--> `SomeCustomTask` The original approach where the flow always passes through `TaskSpec` had some issues since TaskSpec only accepts string arguments (and two other reference classes). This made it harder to handle custom types of arguments like PipelineParam or Channel. Low-level refactoring changes: Resolving of command-line argument placeholders has been extracted into a function usable by different task constructors. Changed `_components._created_task_transformation_handler` to `_components._container_task_constructor`. Previously, the handler was receiving a `TaskSpec` instance. Now it receives `ComponentSpec` + arguments [+ `ComponentReference`]. Moved the `ContainerOp` construction handler setup to the `kfp.dsl.Pipeline` context class as planned. Extracted `TaskSpec` creation to `_components._create_task_spec_from_component_and_arguments`. Refactored `_dsl_bridge.create_container_op_from_task` to `_components._resolve_command_line_and_paths` which returns `_ResolvedCommandLineAndPaths`. Renamed `_dsl_bridge._create_container_op_from_resolved_task` to `_dsl_bridge._create_container_op_from_component_and_arguments`. The signature of `_components._resolve_graph_task` was changed and it now returns `_ResolvedGraphTask` instead of modified `TaskSpec`. Some of the component tests still expect ContainerOp and its attributes. These tests will be changed later. * Adapted the _python_op tests * Fixed linter failure I do not want to add any top-level kfp imports in this file to prevent circular references. * Added docstrings * FIxed the return type forward reference	2020-01-25 08:39:01 -08:00
Alexey Volkov	f39cbdca70	SDL - DSL - Stabilized the PipelineVolume names (#2794 ) The name no longer depends on unset parameters or the version of the Kubernetes package. Needed for https://github.com/kubeflow/pipelines/pull/2780 Fixes https://travis-ci.com/kubeflow/pipelines/jobs/270786161	2020-01-03 18:07:40 -08:00
Jiaxiao Zheng	358e26adb1	[SDK/compiler] Sanitize op name for PipelineParam (#2711 ) * sanitize op name for pipeline param * refactor sanitization to compiler level, and add unittest	2019-12-27 18:01:39 -08:00
Alexey Volkov	27f7e77356	SDK - Unified the function signature parsing implementations (#2689 ) * Replaced `_instance_to_dict(obj)` with `obj.to_dict()` * Fixed the capitalization in _python_function_name_to_component_name It now only changes the case of the first letter. * Replaced the _extract_component_metadata function with _extract_component_interface * Stopped adding newline to the component description. * Handling None inputs and outputs * Not including emply inputs and outputs in component spec * Renamed the private attributes that the @pipeline decorator sets * Changged _extract_pipeline_metadata to use _extract_component_interface * Fixed issues based on feedback	2019-12-27 10:05:40 -08:00
Ilias Katsakioris	4624ac817d	SDK/DSL: Fix PipelineVolume name length (#2739 ) * SDK/DSL: Fix PipelineVolume name length Volume name must be no more than 63 characters Signed-off-by: Ilias Katsakioris <elikatsis@arrikto.com> * Change which part of the hash value we make use of Signed-off-by: Ilias Katsakioris <elikatsis@arrikto.com>	2019-12-18 12:52:04 -08:00
Yuan (Bob) Gong	4a8d262abb	Migrate standalone deployment to workload identity on GCP (#2619 ) * Script to set up workload identity for standalone deployment * Migrate tests to run on standalone + workload identity * Fix test script * Switch to static GSAs for testing, because they have name length limit * Add workload identity binding for argo * Fix argo workload identity bindings * Remove user-gcp-sa from tests * Remove use_gcp_secret from xgboost sample * Allow debugging tests locally * Wait for policies to take effect * Update deploy-pipeline-lite.sh * Update deploy-pipeline-lite.sh * [WIP] test gcloud auth list with test-runner sa * Add namespace * test again * Use new image builder * test again * Remove debug code * Remove usages of use_gcp_secret * Fix unit test and tensorboard pod template * Add debug code again to test * Try waiting until workload identity bindings are ready * Fix some other samples * Fix parameterized tfx oss sample * Add retry to image building * Try fixing tfx oss sample * Fix compiled tfx oss sample * Update all google/cloud-sdk to latest * Try fixing parameterized tfx oss sample again * Also verify pipeline-runner ksa is working * Fix parameterized_tfx_oss sample * Update gcp-workload-identity-setup.sh * Revert unneeded change * Pin to new google/cloud-sdk * Remove wrongly commited binaries	2019-12-16 22:05:58 -08:00
Alexey Volkov	b8a2e6f400	SDK/Compiler - Preventing pipeline entrypoint template name from clashing with other template names (#1555 ) Case exhibiting the problem: ``` def add(a, b): ... @dsl.pipeline(name="add') def some_name(): add(...) ```	2019-12-05 18:08:49 -08:00
Niklas Hansson	88b4757d5b	SDK - Python support for arbitrary secret, similar to ".use_gcp_secret('user-gcp-sa')" (#2639 ) * added new secret support * updated the documentation and env settings * updated after feedback * added tests * nameing issue fixed * renamed test to follow unittest standard * updated after feedback * the new test after renaming * added the test to main * updates after feedback * added licensce agreement * removed space * updated the volume named to be generated * secret_name as volume name and updated test * updated the file structure * fixed build	2019-12-03 12:00:59 -08:00
Jiaxiao Zheng	790fe99aca	[SDK] Relax k8s sanitization (#2634 ) * update * add allow_capital * fix * fix volume_ops sample * fix pipeline name sanitization * fix unittests * fix sanitization in _client.py * fix component output sanitization	2019-11-26 10:28:10 -08:00
Alexey Volkov	6eb00e7aec	SDK - Containers - Renamed constructor parameter in the private ContainerBuilder class (#2261 )	2019-11-07 15:54:27 -08:00
Alexey Volkov	d315bf654c	SDK - DSL - Deprecated ArtifactLocation (#2326 ) * SDK - DSL - Deprecated the per-task artifact_location * Removed artifact_location from the docstring * Deprecated ArtifactLocation	2019-11-05 19:12:59 -08:00
Alexey Volkov	1282f16335	SDK - Python components - Fixed bug when mixing file outputs with return value outputs (#2473 )	2019-10-23 19:45:05 -07:00
Alexey Volkov	681d873fc7	SDK - Components - Added type to graph input references (#2451 ) This makes the graph input references consistent with task output references. This is a breaking change, but the graph components are not exposed in the documentation or samples yet.	2019-10-23 17:03:05 -07:00
Alexey Volkov	4c24650e5f	SDK - Tests - Fixed most of the test warnings (#2336 )	2019-10-22 18:06:13 -07:00
Alexey Volkov	735e627a03	SDK - Refactoring - Split the K8sHelper class (#2333 ) * SDK - Refactoring - Split the K8sHelper class One part was only used by container builder and provided higher-level API over K8s Client. Another was used by the compiler and did not use the kubernetes library. * Updated the license year.	2019-10-21 14:57:22 -07:00
Alexey Volkov	fd6c756dd2	SDK - DSL - Make is_exit_handler unnecessary in ContainerOp (#2411 ) Fixed two broken tests. The tests did not have `is_exit_handler=True` which was required before this commit.	2019-10-16 13:26:15 -07:00
Alexey Volkov	f4d689b4ed	SDK - Python components - Fixed handling multiline decorators (#2345 ) * SDK - Python components - Fixed handling multiline decorators * Switched to using dedent * Added error checking * Testing multiline decorator * Test calling the component created from decorated function Also fixed `helper_test_component_against_func_using_local_call`.	2019-10-16 12:17:29 -07:00
Alexey Volkov	8025511c30	SDK - Added version (#2374 )	2019-10-14 15:35:51 -07:00
Alexey Volkov	1b6047aa69	SDK - Improve errors when ContainerOp.output is unavailable (#1578 ) * SDK - Improve errors when ContainerOp.output is unavailable ContainerOp.output is only available when there is only one output. Right now, when there are multiple outputs it just holds `None` instead of the a task output reference. In this case however it's indistinguishable from just passing None argument. This PR gives a quick fix to make accessing the nonexistent `.output` a compile-time error. * Fixed the implementation and added tests * Trigger retests	2019-10-11 18:20:40 -07:00
Alexey Volkov	dc8cd7a8eb	SDK - Containers - Added support for container image cache (#2216 ) * SDK - Containers - Added support for container image cache This change makes `build_image_from_working_dir` fast when the working directory has not changed between invocations. We cache pushed container images using specially-calculated context directory hash as the cache key. * Moved the import to the top	2019-10-11 15:10:04 -07:00
Alexey Volkov	03da0a2cce	SDK - Tests - Test creating component from the real AutoML pipeline (#2314 ) * SDK - Tests - Test creating component from the real AutoML pipeline Creating component from the AutoML retail_product_stockout_prediction pipeline. * Ignoring flake8 error E821	2019-10-08 13:39:50 -07:00

1 2 3 4 5

227 Commits