docs/model-runner.md at cdcf7ef9458e6b06f232cc5de91bfd748875aad8

2.5 KiB

Raw Blame History

title

description

keywords

weight

params

Use Docker Model Runner

Learn how to integrate Docker Model Runner with Docker Compose to build AI-powered applications

compose, docker compose, model runner, ai, llm, artificial intelligence, machine learning

111

sidebar

badge

color	text
green	New

Docker Model Runner can be integrated with Docker Compose to run AI models as part of your multi-container applications.
This lets you define and run AI-powered applications alongside your other services.

Prerequisites

Docker Compose v2.35 or later
Docker Desktop 4.41 or later
Docker Desktop for Mac with Apple Silicon or Docker Desktop for Windows with NVIDIA GPU
Docker Model Runner enabled in Docker Desktop

Provider services

Compose introduces a new service type called provider that allows you to declare platform capabilities required by your application. For AI models, you can use the model type to declare model dependencies.

Here's an example of how to define a model provider:

services:
  chat:
    image: my-chat-app
    depends_on:
      - ai-runner

  ai-runner:
    provider:
      type: model
      options:
        model: ai/smollm2

Notice the dedicated provider attribute in the ai-runner service.
This attribute specifies that the service is a model provider and lets you define options such as the name of the model to be used.

There is also a depends_on attribute in the chat service.
This attribute specifies that the chat service depends on the ai-runner service.
This means that the ai-runner service will be started before the chat service to allow injection of model information to the chat service.

How it works

During the docker compose up process, Docker Model Runner automatically pulls and runs the specified model.
It also sends Compose the model tag name and the URL to access the model runner.

This information is then passed to services which declare a dependency on the model provider.
In the example above, the chat service receives 2 environment variables prefixed by the service name:

AI-RUNNER_URL with the URL to access the model runner
AI-RUNNER_MODEL with the model name which could be passed with the URL to request the model.

This lets the chat service to interact with the model and use it for its own purposes.

Reference

Docker Model Runner documentation

2.5 KiB Raw Blame History

Prerequisites

Provider services

How it works

Reference

2.5 KiB

Raw Blame History