model-runner/pkg/inference
Piotr Stankiewicz 64c85dcd83 inference: Support disabling pre-pull memory checks
Signed-off-by: Piotr Stankiewicz <piotr.stankiewicz@docker.com>
2025-08-22 10:15:03 +02:00
..
backends inference: Support memory estimation for remote models 2025-08-22 10:15:03 +02:00
config Respect context size from model config 2025-06-27 09:35:14 -06:00
memory inference: Block pull if model requires too much memory to run 2025-08-22 10:15:03 +02:00
models inference: Support disabling pre-pull memory checks 2025-08-22 10:15:03 +02:00
scheduling inference: Support memory estimation for remote models 2025-08-22 10:15:03 +02:00
api.go Move prefix paths to inference package 2025-03-28 17:53:12 -06:00
backend.go inference: Support memory estimation for remote models 2025-08-22 10:15:03 +02:00
cors.go Improve CORS config 2025-06-03 12:35:59 +03:00