examples/digit_recognition
Joseph Olaide 639f84a9d6
Kaggle notebook to kfp pipeline (#940)
* Create README.md

* kaggle to kfp

* Create README.md

* Update README.md

* Add files via upload

* Update README.md

* Update README.md

* Update README.md

* kaggle to kfp
2022-04-26 05:46:11 +00:00
..
data Kaggle notebook to kfp pipeline (#940) 2022-04-26 05:46:11 +00:00
README.md Kaggle notebook to kfp pipeline (#940) 2022-04-26 05:46:11 +00:00
digit-recognizer-kfp-pipeline.ipynb Kaggle notebook to kfp pipeline (#940) 2022-04-26 05:46:11 +00:00

README.md

Objective

Here we convert the https://www.kaggle.com/competitions/digit-recognizer code to kfp-pipeline The objective of this task is tois to correctly identify digits from a dataset of tens of thousands of handwritten images.

Testing environment

Name version
Kubeflow v1
kfp 1.8.11

Kfp version used for testing can be installed as pip install kfp==1.8.11

Components used

kubeflow lightweight component method

Here, a python function is created to carry out a certain task and the python function is passed inside kfp component methodcreate_component_from_func.

Kubeflow pipelines

Kubeflow pipelines connect each components according to how they were passed and creates a pipeline. The kfp dsl.pipeline method was used to create a pipeline function. The kkfp component method InputPath and OutputPath was used to pass data amongst component.

Finally, the create_run_from_pipeline_func was used to submit pipeline directly from pipeline function

To create pipeline

  1. Navigate to data directory, download compressed kaggle data and put your training.zip and test.zip data in the data folder.
  2. Open your setup kubeflow cluster and create a notebook server and connect to it.
  3. Clone this repo and navigate to this directory
  4. run the kfp-digit-recognizer notebook from start to finish
  5. View run details immediately after submitting pipeline.