pipelines

Commit Graph

Author	SHA1	Message	Date
Chen Sun	2f19a26ffd	chore(sdk): Format all Python files under SDK folder. (#6501 ) * Reformat sdk only using the new yapf config. * Reformat docstrings using docformatter. * update golden files to resolve diff caused by whitespaces * fix some tests * format .py files under sdk/python/tests using yapf * additional docformatter * fix some tests	2021-09-03 11:25:11 -07:00
Joshua Carp	f43ecc223d	chore(sdk): Import mock from stdlib and drop dependency. (#6456 ) * Import mock from stdlib and drop dependency. * Drop mock from requirements. h/t @chensun	2021-08-31 16:38:44 -07:00
Yaqi Ji	937cacd4ce	feat(sdk): add default schema_version to pipeline (#6366 ) * feat(sdk): add default schema_version to pipeline * sync api for go * Fix tests and address comments * Bump pipeline_spec version * Fix v1 tests * rebase to master * sync api for go * Fix tests and address comments * Bump pipeline_spec version * Fix v1 tests	2021-08-24 01:04:39 -07:00
Ajay Gopinathan	f3f383c2ff	chore(sdk): Refactor and move all v2 related code to under the v2 namespace. (#6358 ) * Refactor and move all v2 related code to under the v2 namespace. Most of the changes are around imports and restructuring of the codebase. While it looks like a lot of code was added, most of the code already existed and was simply moved or copied over to v2. The only exceptions are: - under kfp/v2/components/component_factory.py: some helper functions were copied with simplification from _python_op.py - we no longer strip the `_path` suffix in v2 components. Note: there is still some duplication of code (particularly between component_factory.py and _python_op.py), but it's ok for now since we intend to replace some of this with v2 ComponentSpec + BaseComponent. * Update setup.py. * update tests. * revert accidental change of gcpc * Fix component entrypoint. * Update goldens. * fix tests. * fix merge conflict. * revert gcpc change. * fix tests. * fix tests. * Add type aliases for moved files. * merge and update goldens.	2021-08-17 19:25:37 -07:00
Chen Sun	434e5c3489	fix(sdk): block dsl.importer usage in KFP OSS. Fixes: #6323 (#6330 ) * block dsl.importer in KFP OSS * address cr comments	2021-08-16 01:02:06 -07:00
Chen Sun	7559e27cfb	feat(backend/sdk): Rename `pipeline-output-directory` to `pipeline-root`. Fixes #6307 (#6329 )	2021-08-13 00:41:54 -07:00
Chen Sun	d48792c373	test(sdk): restore a v2-compatible unit test (#6263 )	2021-08-09 11:18:23 -07:00
Niklas Hansson	e6becd71ff	feat(sdk): add GPU runtime resource request and fix spelling in runtime_resouce_request. Fixes #4877 . Fixes #1252 (#5972 ) * Add runtime resource request for GPUs * clean up * Updated docks and add check * updated with test * remove from branch * run tests * fix gpu vendor format * Update after feedback * add unit tet * remove integration test * clean up * Clean up * Updated to resource_constraints instead of resource	2021-08-01 22:52:38 -07:00
Chen Sun	f4c6631e51	chore: release KFP SDK and v2 launcher 1.6.6 (#6125 ) * release 1.6.6 * skip failing UT	2021-07-23 14:49:39 -07:00
Chen Sun	b200e1bc7d	fix(sdk): Fix URI placeholder in v2 compatible mode. (#6040 ) * fix uri placeholder in v2 compatible mode * fix tests * fix path generation * fix tests * fix test * cleanup * clean up * fix test * fix test * fix test	2021-07-21 08:32:50 -07:00
Yuan (Bob) Gong	ee663d9593	chore(v2): standardize MLMD data model. Fixes #5669 (#6054 ) * chore(v2): standardize MLMD data model * change context type to system namespace * update sdk snapshots * fix go v2 tests * update * update v2 compat snapshots * fix all samples * fix must specify pipeline root * add artifact display name * add UI rendering of new fields * fix sample tests * let ui read artifact and execution names consistently * fix samples * fix frontend tests * fix sample test * fix last sample * address feedback	2021-07-19 22:26:15 -07:00
Yaqi Ji	c6cb8acf7a	feat(sdk): Add interface for enable_caching at task level. (#6007 ) * feat(sdk): Add pipeline level caching options * Update golden * fixing caching options * fix tests * add a caching disabled case Co-authored-by: Chen Sun <chensun@users.noreply.github.com>	2021-07-13 17:33:58 -07:00
Joe Liedtke	ade34542e0	chore: Updates argoproj/argo URLs to argoproj/argo-workflows (#5969 ) * Updt argoproj/argo URLs to argoproj/argo-workflows * Update link to workflows.ts * Update license.txt to reduce # of changed lines * Revert changes to backend Dockerfile & license.txt * Update license.txt, keep line endings	2021-07-06 21:52:20 -07:00
Yuan (Bob) Gong	8a256db1bf	feat(sdk/dsl/compiler): dsl-compile --mode flag to turn on V2_COMPATIBLE, defaults to KF_PIPELINES_COMPILER_MODE env var. Fixes #5840 (#5952 ) * feat(sdk/dsl/compiler): support --mode flag which can turn on v2 compatible mode * override compiler default mode using KF_PIPELINES_COMPILER_MODE env var * update V1_LEGACY to V1 * add unit tests * address feedback * clean up * cleanup again * use absl.testing.parameterized for table driven tests * update	2021-07-03 02:22:49 -07:00
Niklas Hansson	5db843102a	feat(sdk): add runtime resource requests. Fixes #1956 (#5447 ) * added resource request at runtime * fixed things * Update to use read only parameter insteadt * added test case and better example * Updated again * add the validation * add to the test suit * work in progress * update after feedback * fix the test * clean up * clean up * fix the path * add the test again * clean up * fix tests * feedback fix * comment out and clean up	2021-06-10 16:27:59 -07:00
capri-xiyue	6717434978	[SDK] Add pod labels for telemetry purpose (#5582 ) * Add pod labels for telemetry purpose * fixed test * added sdk label in pods * added sdk type label * fixed test * added UT back * updated UT	2021-05-05 10:43:27 -07:00
Alexey Volkov	cc83e1089b	Assigned copyright to the project authors (#5587 )	2021-05-05 13:53:22 +08:00
Ilias Katsakioris	a12e88d1da	feat(sdk): Enable setting OwnerReference on ResourceOps. Fixes #1779 (#4831 ) Argo supports a field in the ResourceTemplate that makes the controller add an owner reference of the workflow to the created resource since v2.4.0 [1]. With the upgrade of Argo client [2] and deployment [3] we are now able to exploit it. We set it to 'false' by default on all ResourceOps (actually, leave it empty). Setting the field to 'true' for VolumeOps allows the garbage collection of PVCs upon workflow cleanup [4]. [1] https://github.com/argoproj/argo/blob/v2.4.0/pkg/apis/workflow/v1alpha1/workflow_types.go#L1044-L1045 [2] https://github.com/kubeflow/pipelines/pull/4498 [3] https://github.com/kubeflow/pipelines/pull/3537 [4] https://github.com/kubeflow/pipelines/issues/1779 Signed-off-by: Ilias Katsakioris <elikatsis@arrikto.com>	2021-03-12 06:45:24 -08:00
Tommy Li	9708a2ae0f	fix(sdk/tests) fix parallelfor_item_argument_resolving compiler test, Fixes #5270 (#5271 ) * fix parallelfor_item_argument testdata typo * remove parentheses instead of brackets	2021-03-11 14:06:24 -08:00
Ajay Gopinathan	83eded130c	feat(sdk): Introduce experimental v2-compatibility in KFP SDK (#5218 ) * WIP: Enable v2 compatibility in KFP SDK compiler. * First pass clean up * Clean up and introduce enum instead of boolean for execution mode. * More cleanup * Clean up and add comments. * Undo formatting changes. * Undo formatting changes. * Add method to unconditionally add kfp pod env. * minor formatting change. * Update docstrings. * Undo formatting changes. * fix imports. * fix pod_env tests * rebased. * undo format changes. * Undo compiler changes.: * format _default_transformers.py for consistency * Fix various rebasing issues. * fix bug referring to pipeline_name/pipeline_root in v1 pipelines. * revert output dir name. * allow both types of attributes for pipeline root. * fix pod env yaml golden. * fix for input/output uri tests. * Add v2 compatible compiler test. * Use ordereddict to fix flaky golden file tests. * Address PR comments. * Address PR comments. * Address PR comments.	2021-03-08 15:40:23 -08:00
StefanoFioravanzo	fa135fc0bc	feat(sdk): Support backoffs in retry strategy (#5060 ) * feat(sdk): Support backoffs in retry strategy Signed-off-by: Stefano Fioravanzo <stefano@arrikto.com> * Add Optional type hint Signed-off-by: Stefano Fioravanzo <stefano@arrikto.com>	2021-03-03 03:59:48 -08:00
Jiaxiao Zheng	846423a870	feat(sdk): Always add pipeline root as a pipeline parameter (#5122 ) * refactor pipeline root passing * fix test	2021-02-10 16:29:57 -08:00
Chen Sun	051a022937	feat(sdk.v2): Allow set pipeline_root via @dsl.pipeline decorator. Make pipeline_root optional. (#5107 ) * Allow set pipeline_root via @dsl.pipeline decorator. * test covering pipeline_root not set	2021-02-07 02:06:32 -08:00
Jiaxiao Zheng	85a3b51713	feat(sdk): Add v2 component to build_python_component (#5079 ) * porting the original PR * comment * refactor * remove python2 * comment on default entrypoint * update comment * min versioned KFP * fix tests	2021-02-04 01:50:36 -08:00
Vitalii Vokhmin	2f1db59798	fix(sdk): compile ParallelFor in a deterministic manner (#4926 ) * fix(sdk): compile ParallelFor in a deterministic manner During compilataion ParallelFor components end up with randomized names, which makes it very inconvenient to compare two versions of a pipeline. This commit fixes this issue. * fix(sdk): fix new parallel-for test cases	2021-01-29 18:31:09 -08:00
Michalina Kotwica	ce985bc287	fix(sdk): Allow keyword-only arguments in pipeline function signature (#4544 ) * add test for keyword-only arguments in pipeline func * fix: kwargs-only argument for pipeline func * test: kwargs generate same yaml as args * remove whole metadata * assert -> self.assertEqual * programmatic example --> fixed example * same name for both Co-authored-by: Alexey Volkov <alexey.volkov@ark-kun.com>	2021-01-29 18:31:02 -08:00
Jiaxiao Zheng	a36a62a700	feat(sdk): Artifact metadata related placeholder for components. (#5003 ) * resolve comments. * fix tests * wip: add structures and skeleton for component resolution logic * add generator * fix the problem * cleanup * add a test * fix tests	2021-01-19 08:57:45 -08:00
Alexey Volkov	691eefc599	fix(sdk): Components - Fixed python components that use \n. Fixes #4939 (#4993 ) * SDK - Components - Fixed python components that use \n The escape sequence was being replaced by the `echo` command. Apparently, unlike in the `bash` shell, the `echo` command of the `sh` shell expands the escape sequences by default and does not support an option to turn it off. (For some reason the -n option works properly even though it should not). Fixes https://github.com/kubeflow/pipelines/issues/4939 * Fixed the test data * Fixed the deprecated container component builder * Fixed the new compiler test case * Added test	2021-01-14 18:21:51 -08:00
radcheb	5633b9abda	fix(sdk): fixes unresolved PipelineParam when static list passed to dsl.ParallelFor. Fixes #4890 (#4891 ) * fix parallelfor compiling items + add tests * remove debug print * fix tests * fix parallelfor_pipeline_param_in_items_resolving test * debug test * fix tests * Revert "debug test" This reverts commit `57451143bd`. * fix tests	2021-01-14 00:09:03 -08:00
Jiaxiao Zheng	7540ba5c3b	feat(sdk): Implements artifact URI placeholder. (#4932 ) * add placeholder to spec * add output_directory to pipeline * respect uri placeholder in file outputs * wip: add data passing rewriting logic to respect the uri semantics * merge input_uri and paths when instantiating ContainerOp * fix * fix workflow rewriting * Add topology rewriting * add a test case, and various fixes * make the test case more complex * Fix the case when working with OpsGroup * Fix test case * fix resolving test * fix redundant cmd lines * fix redundant cmd lines * resolve comments * fix file outputs * resolve comments * copy file outputs instead of modifying inplace.	2021-01-05 20:39:51 -08:00
Kenta Onishi	5a4b70e37c	feat(sdk): Add settings of the dnsConfig field. Fixes #4836 (#4837 ) * feat(sdk): Add settings of the dnsConfig field. Fixes #4836 * feat(sdk): Add dnsConfig example and sample. * feat(sdk): Refactor dnsConfig param. * feat(sdk): Refactor dnsConfig param.	2020-12-14 20:05:49 -08:00
Alexey Volkov	7a66414cf7	feat(sdk): Components - Restored stack traces in lightweight python components. Fixes #4273 , #4849 (#4861 ) Currently were running the python code inline using `python -c <code>`. This has two issues: 1) Python does not show source code line in exception stack traces 2) inspect.getsource does not work. This method is used in PyTorch JIT for example. We solve these issues by writing the code into a file before executing it. The disadvantage of the new approach is that it adds complexity, a filesystem write operation and also requires the `sh` executable to be present (we could replace it with python-based program if needed).	2020-12-14 14:33:49 -08:00
Vitalii Vokhmin	2f3a686e54	feat(sdk): add ability to set retry policy (#4858 ) * feat(sdk): add ability to set retry policy This fixes the second part of the issue described in #4333 The first part was addressed in #4392 * feat(sdk): validate retry policy name * feat(sdk): simplify retry policy interface	2020-12-11 14:47:29 -08:00
Alexey Volkov	f7874d38ff	fix(sdk): Compiler - Fixed pipeline parameters with empty default values (#4552 ) Fixes https://github.com/kubeflow/pipelines/issues/4549	2020-11-12 15:52:28 -08:00
Chen Sun	5020fd1079	compiler for IR (#4529 ) * Compile IR proto in setup.py * compile to IR * Fix importer node logic and lint * cleanup and lint * merge, undo setup.py change * cleanup and lint * remove currently unused code * format _component_bridge.py * cleanup and format * cleanup * upgrade protobuf in test * restructure and test * address review comments * fix bug * avoid f-strings formatting * address review comments * address review comments * limit the primitive types to only int, double, and string. * Fix test for python3.5 * use instance_schema instead of schema_title * add v2 to setup.py * address review comments * move the tests closer to the code * add more tests * cleanup and linting * add more tests * fix bug on input paramter connection * linting * restructure tests * fix python3.5 test failure * support outputs.parameters placeholder * remove pipeline decorator from v2.dsl	2020-10-13 17:13:54 -07:00
Alexey Volkov	e8fb58a221	feat(sdk): Preserve parameter arguments and input names (#4563 ) ContainerOp has no concept of inputs, so it looses any information about them such as input names and in some cases even the passed argument values (which are just injected into the command line). This commit fixes that issue by preserving the paramater arguments map and ultimately storing it in an Argo template annotation. Fixes https://github.com/kubeflow/pipelines/issues/4556	2020-10-11 20:32:48 -07:00
Alexey Volkov	1aa8068507	fix(sdk): DSL - Enabled arbitrary ContainerOp names (#4554 ) Fixes https://github.com/kubeflow/pipelines/issues/4522	2020-09-29 05:21:35 -07:00
Niklas Hansson	c32ea232d5	feat(compiled): set pod disruption budget for pipelines. Fixes #3877 (#4178 ) * Update _client.py * Update _client.py * added pod disruption budget * clean up * Update sdk/python/kfp/dsl/_pipeline.py * fixed parameter * updated after feedback * removed selector	2020-09-14 13:45:26 -07:00
Victor	22b7b99a8b	fix(sdk): Fix opsgroups dependency resolution (#4370 )	2020-08-27 09:03:53 -07:00
Alexey Volkov	d0b799e4a9	fix(sdk): SDK - Avoiding deprecated ContainerOp methods (#4134 ) Switched from `task.set_X` to `task.container.set_X`	2020-07-14 17:02:37 -07:00
Alexey Volkov	db0af86e53	feat(sdk): SDK - Enable placeholders in task display names. Fixes #4163 (#4164 )	2020-07-09 18:42:35 -07:00
Alexey Volkov	48889a99d1	fix(sdk): Compiler - Fixed input artifact name sanitization when using raw string arguments. Fixes #4110 (#4120 )	2020-07-08 10:43:09 -07:00
Alexey Volkov	229eff2516	SDK - Compiler - Removed the deprecated dsl-compile --package command (#4055 )	2020-07-01 19:12:01 -07:00
Alexey Volkov	6960366846	fix(sdk): Compiler - Fixed the input argument mapping when using dsl.graph_component. Fixes #3915 (4082) * SDK - Compiler - Fixed the input argument mapping when using dsl.graph_component Fixes https://github.com/kubeflow/pipelines/issues/3915 * Stopped relying on the argument order at all This can make the compilation less fragile.	2020-06-29 02:31:37 -07:00
Alexey Volkov	d24eb78371	test(sdk) Restored the ParallelFor compiler test data (4103) * SDK - Tests - Restored the ParallelFor compiler test data Fixes https://github.com/kubeflow/pipelines/issues/4102 * Removed the pipeline-sdk-type annotations * Fixed the test_artifact_passing_using_volume test data	2020-06-29 01:30:14 -07:00
Jiaxiao Zheng	b099c6f5d3	chore: Rollback telemetry related changes (4088) * Revert "fix length (#3934)" This reverts commit `7fbb7cae` * Revert "[SDK] Add first party component label (#3861)" This reverts commit `1e2b9d4e` * Revert "[SDK] Add pod labels for telemetry purpose. (#3578)" This reverts commit `aa8da64b`	2020-06-27 15:46:14 -07:00
Alexey Volkov	54a596abd8	SDK - Compiler - Added support for volume-based data passing (3371) * SDK - Compiler - Added support for volume-based data passing Currently artifact passing is performed by Argo sidecar containers what download input data and upload output data to artifact repository (usually, S3-compatible blob storage like Minio). The performance of this method is not optimal and it requires that pod disks have enough capacity to hold all artifact data. This commit adds support for volume-based data passing. This method involves using a single milti-write Kubernetes data volume to pass all intermediate data. Parts of the volume are mounted to the input/output artifact directories, so when the user program reads and writes files, the files actually reside in the data volume. This method improves the performance and reduces storage resource requirements. The data volume must exist and support "READ_WRITE_MANY". Limitations: * All artifact file names must be the same (e.g. "data"). All auto-generated paths are already consistent. Avoid using any hard-coded paths. * Passing constant values (text) as arguments for artifact inputs is not supported. * The feature is experimental. * Added data_passing_methods.KubernetesVolume This class represents a configured volume-based artifact passing method. * Added PipelineConf.data_passing_method This property allows setting the method that will be used for intermediate data passing. Added the compiler support for the new feature. Example: ```python from kfp.dsl import PipelineConf, data_passing_methods from kubernetes.client.models import V1Volume, V1PersistentVolumeClaim pipeline_conf = PipelineConf() pipeline_conf.data_passing_method = data_passing_methods.KubernetesVolume( volume=V1Volume( name='data', persistent_volume_claim=V1PersistentVolumeClaim('data-volume'), ), path_prefix='artifact_data/', ) ``` * Added unit test * Fixed bug in the unit test Kubernetes does not validate the structures at all... * Fixed bug in the result structure * Fixed the test data The class should be V1PersistentVolumeClaimVolumeSource, not V1PersistentVolumeClaimSpec. * Fixed the test	2020-06-25 16:11:31 -07:00
Alexey Volkov	ceb860c594	SDK - Components - Python - Switched the default base image to python 3.7 (4054) Previously the default image was set to an old version of tensorflow image. That image is now outdated. It's also framework-specific and pretty big. We're switching to the official python image which is small, official and framework-agnostic. The users can easily switch to the old behavior by just specifying `base_image='tensorflow/tensorflow:1.13.2-py3'` during the component creation.	2020-06-25 15:15:31 -07:00
Alexey Volkov	f773b9c263	SDK - Components - Stabilize JSON serialization by sorting keys (#3879 ) * SDK - Components - Stabilize JSON serialization by sorting keys Otherwise serialization of the default values of the component/pipeline inputs is unstable on Python 3.5. * Fixed the test data	2020-06-01 03:07:55 -07:00
Jiaxiao Zheng	1e2b9d4e7e	[SDK] Add first party component label (#3861 ) * add OOB component dict and utility function * add test * add a transformer, which appends the component name label * add transformer function, compiler and test * move telemetry test * fix none uri * applies comments * revert dependency on frozendict * fixes some tests * resolve comments	2020-05-29 08:55:16 -07:00

1 2 3 4

174 Commits