* Adds Makefile for local development * Fix chat completions example request * Added delete example * Dockerize model-runner * WIP Run container with host access to socket * Dockerize model-runner * WIP Run container with host access to socket * Debugging * Run in Docker container with TCP port access * mounted model storage * - Remove duplication in .gitignore - Do not use alpine in builder image - NVIDIA seems to use Ubuntu in all of their CDI docs and produces Ubuntu tags for nvidia/cuda but not Debian. So use Ubuntu for our final image For more details: https://github.com/docker/model-runner/pull/22 * - Add MODELS_PATH environment variable to configure model storage location - Default to $HOME/.docker/models when MODELS_PATH is not set - Update Docker container to use /models as the default storage path - Update Makefile to pass MODELS_PATH to container - Update Dockerfile to create and set permissions for /models directory This change allows users to: - Override the model storage location via MODELS_PATH - Maintain backward compatibility with default $HOME/.docker/models path - Use a more idiomatic folder for /models * Removes unneeded logs |
||
|---|---|---|
| .. | ||
| backends | ||
| models | ||
| scheduling | ||
| api.go | ||
| backend.go | ||