pipelines/components
Kartik Kalamadi b3d8e04e1e
[AWS SageMaker] Print SageMaker job logs in kfp UI (#3954)
* Print logs for AWS SM Componenets on KFP UI

* address comments

* update version number to 0.5.0

* update yaml to version 0.5.0

* update changelog
2020-06-19 00:33:58 -07:00
..
CatBoost Components - CatBoost (#3844) 2020-06-17 00:28:40 -07:00
XGBoost Components - XGBoost (#3843) 2020-05-27 02:10:42 -07:00
_converters/ApacheParquet Components - Apache Parquet converters (#3834) 2020-05-28 20:17:15 -07:00
arena Make wget quieter (#2069) 2019-09-09 14:32:54 -07:00
aws [AWS SageMaker] Print SageMaker job logs in kfp UI (#3954) 2020-06-19 00:33:58 -07:00
datasets/Chicago_Taxi_Trips Datasets - Added Chicago Taxi Trips dataset (#3775) 2020-05-27 01:24:40 -07:00
deprecated Components - Moved TFX components to deprecated directory (#3854) 2020-05-26 16:28:41 -07:00
diagnostics/diagnose_me quick fix for quota list (#3075) 2020-02-14 09:18:18 -08:00
filesystem Components - Filesystem (#2659) 2019-11-27 11:43:03 -08:00
gcp Update Dataflow component _client.py to use projects.locations.templates API endpoint (#3851) 2020-06-11 21:35:55 -07:00
git/clone Components - Git clone (#2658) 2019-11-26 20:29:19 -08:00
google-cloud/storage Components - Google Cloud Storage (#2532) 2019-11-07 18:06:19 -08:00
ibm-components Update Watson ML example to take output param path (#3316) 2020-03-20 22:10:44 -07:00
kubeflow Kfserving component service account option (#3852) 2020-05-28 18:39:15 -07:00
local Release ad9bd5648d (#3560) 2020-04-22 14:00:15 -07:00
nuclio add nuclio components (to build/deploy, delete, invoke functions) (#1295) 2019-05-08 01:58:33 -07:00
pandas/Transform_DataFrame Components - Pandas - Transform_DataFrame (#3985) 2020-06-17 12:18:05 -07:00
presto/query add presto pipeline component (#3261) 2020-03-14 17:32:34 -07:00
sample/keras/train_classifier SDK - Hiding Argo's workflow.uid placeholder behind DSL (#1683) 2019-10-07 18:33:11 -07:00
tensorflow/tensorboard/prepare_tensorboard Components - Tensorboard visualization (#3760) 2020-05-13 18:32:22 -07:00
OWNERS add jiaxiao to the component owners (#2804) 2020-01-07 12:48:18 -08:00
README.md move old gcp components to deprecated folder (#2031) 2019-09-06 16:29:20 -07:00
build_image.sh common build image script (#815) 2019-02-13 10:37:19 -08:00
license.sh Initial commit of the kubeflow/pipeline project. 2018-11-02 14:02:31 -07:00
pipeline_component_repository.yaml Components - Added the pipeline component repository marker file (#3883) 2020-06-01 03:08:10 -07:00
release-in-place.sh [Release] Automate release script for all the changes (#3777) 2020-06-03 08:44:18 -07:00
release.sh [Release] Automate release script for all the changes (#3777) 2020-06-03 08:44:18 -07:00
test_load_all_components.sh Testing - Handling spaces in paths in the component file loading test (#3856) 2020-05-27 02:10:58 -07:00
third_party_licenses.csv chore: fix subprocess32 license url (#4018) 2020-06-18 03:30:04 -07:00

README.md

Kubeflow pipeline components

Kubeflow pipeline components are implementations of Kubeflow pipeline tasks. Each task takes one or more artifacts as input and may produce one or more artifacts as output.

Example: XGBoost DataProc components

Each task usually includes two parts:

Client code The code that talks to endpoints to submit jobs. For example, code to talk to Google Dataproc API to submit a Spark job.

Runtime code The code that does the actual job and usually runs in the cluster. For example, Spark code that transforms raw data into preprocessed data.

Container A container image that runs the client code.

Note the naming convention for client code and runtime code—for a task named "mytask":

  • The mytask.py program contains the client code.
  • The mytask directory contains all the runtime code.

See how to use the Kubeflow Pipelines SDK and build your own components.