* Ensure restart policy is Never or OnFailure
Always doesn't make sense for Jobs
* Manage launcher through k8s Job
Still tracking Running status of the job pods.
* Add launcher Pod failed reason
* Allow running MPI applications as non-root
Adds the spec field sshAuthMountPath for MPIJob.
The init script sets the permissions and ownership based on the securityContext of the launcherPod
* Add pure MPI sample that run as non-root
* Do inter-pod communication through SSH
The controller generates keys and mounts them to the containers. The container images must know how to place the credentials and set file permissions.
* Use init-container instead of entrypoint
* Fix scheme for recorder and defaults
* Add integration tests for v2 controller