Commit Graph

9 Commits

Author SHA1 Message Date
Yuki Iwai ef9d049daa
Introduce debian bookworm (#661)
* Upgrade debian version to bookworm

Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>

* Add obviously verifications if all Ranks reached final phase in the pi example

Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>

---------

Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>
2024-10-15 15:25:17 +00:00
royliang 54160b0f7c
Fix invalid link for horovod cpu-only example Dockerfile (#601) 2023-11-05 20:24:13 +00:00
royliang f03139471b
Fix invalid link for horovod cpu-only example (#600) 2023-11-04 16:01:22 +00:00
Mateusz Kubica 21f326d1d2
MPICH support (#562)
* Add support for MPICH

* Fix CI errors

* Temporary: manual trigger

* Fix file name

* Add an empty line at the end of the file

* Fix formatting

* Revert "Temporary: manual trigger"

This reverts commit 15164a8b70.

* fix formatting

* Regenerate the mpi-operator.yaml

* Adding an empy line at the end of Dockerfiles

* Share the same entrypoin for Intel and MPICH

* share hostfile generation between Intel and MPICH

* Add validation test for MPICH

* Fix formatting

* Don't over engineer the tests - be explicit

* add non-root tests for IntelMPI and MPICH
2023-06-16 17:57:36 +00:00
R0CKSTAR edd9559fb0
Replace https://bootstrap.pypa.io/get-pip.py with https://bootstrap.pypa.io/pip/3.6/get-pip.py in horovod example Dockerfile (#554)
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
2023-05-05 12:36:23 +00:00
Aldo Culquicondor 651ee6769f
Add versioned labels to images (#545) 2023-04-05 18:56:02 +00:00
Carlos Eduardo Arango Gutierrez c7ca541451
Fix broken E2E test (#455)
* Fix broken E2E test

Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>

* Add missing dependencies

Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>
2022-02-12 00:51:10 +00:00
Carlos Eduardo Arango Gutierrez 3f808b1c59
Organize examples folder by api compatibility (#451)
* MV base Dockerfile to build forlder, they are not an example

Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>

* Consolidate tensorflow-benchmarks under v2beta1

Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>

* Move pi demo under v2beta1

Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>

* MV mxnet examples under examples/v1

Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>

* MV horovod and tensorflow examples under the compatible API

Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>

* Update Makefile after reorg of examples folder

Signed-off-by: Carlos Eduardo Arango Gutierrez <carangog@redhat.com>
2022-02-07 23:42:42 +00:00
Aldo Culquicondor b9141c0540
Preparing release of v0.3.0 (#414)
Also
- Updated Makefile to use new version
- extra notes for developers
2021-09-01 08:04:45 -07:00