Examples for building and running LLM services and applications locally with Podman
Go to file
Platform Engineering Bot 63a83a7398
fix(deps): update auto merged updates (#880)
Signed-off-by: Platform Engineering Bot <platform-engineering@redhat.com>
2025-06-09 08:44:36 +03:00
.devcontainer Update Auto merged updates 2024-04-30 04:00:52 +02:00
.github/workflows update models build & push workflow 2024-12-03 19:28:10 -05:00
assets feat: add node.js function calling recipe with langgraph 2025-03-10 08:25:10 -04:00
ci debug workflows otel trace 2024-06-25 12:17:35 -04:00
convert_models chore(deps): update auto merged updates 2024-10-10 23:43:45 +00:00
data add PDF reading to summarizer 2024-04-22 11:20:58 -04:00
eval chore(deps): update auto merged updates 2024-09-23 00:04:21 +00:00
hooks hooks: pre-commit hook to duplicate file 2024-08-07 12:24:24 +02:00
model_servers fix:object_detection_python build failure (#871) 2025-05-19 16:07:11 +02:00
models chore(deps): update auto merged updates 2024-09-23 00:04:21 +00:00
recipes fix(deps): update auto merged updates (#880) 2025-06-09 08:44:36 +03:00
training cloud gcp mkdir handle existing directory 2025-01-28 16:21:42 +02:00
vector_dbs fix(deps): update auto merged updates 2024-12-12 10:54:01 +00:00
.gitattributes Add multimodal support (#45) 2024-03-13 09:52:59 -04:00
.gitignore Documentation updates & .gitignore add 2025-01-06 15:25:59 -05:00
CODEOWNERS chore: add jeffmaury as code owner 2025-03-14 15:10:39 +01:00
CONTRIBUTING.md fix: restore backend as an array for compatibility 2024-08-14 14:25:59 +02:00
LICENSE add apache license 2024-04-13 15:39:51 -04:00
README.md hooks: pre-commit hook to duplicate file 2024-08-07 12:24:24 +02:00
ailab-images.md Remove training images from ailab-images.md 2024-06-27 12:01:27 -04:00
install-hooks.sh hooks: pre-commit hook to duplicate file 2024-08-07 12:24:24 +02:00
renovate.json changes based on feedback 2024-06-11 12:06:30 -04:00
requirements-test.txt fix(deps): update auto merged updates 2024-12-12 10:54:01 +00:00

README.md

AI Lab Recipes

This repo contains recipes for building and running containerized AI and LLM Applications with Podman.

These containerized AI recipes can be used to help developers quickly prototype new AI and LLM based applications locally, without the need for relying on any other externally hosted services. Since they are already containerized, it also helps developers move quickly from prototype to production.

Model servers

What's a model server?

A model server is a program that serves machine-learning models, such as LLMs, and makes their functions available via an API. This makes it easy for developers to incorporate AI into their applications. This repository provides descriptions and code for building several of these model servers.

Many of the sample applications rely on the llamacpp_python model server by default. This server can be used for various generative AI applications with various models. However, each sample application can be paired with a variety of model servers.

Learn how to build and run the llamacpp_python model server by following the llamacpp_python model server README.

Current Recipes

Recipes consist of at least two components: A model server and an AI application. The model server manages the model, and the AI application provides the specific logic needed to perform some specific task such as chat, summarization, object detection, etc.

There are several sample applications in this repository that can be found in the recipes directory.

They fall under the categories:

Learn how to build and run each application by visiting their README's. For example, learn how to run the chatbot recipe here.

Current AI Lab Recipe images built from this repository

Images for many sample applications and models are available in quay.io. All currently built images are tracked in ailab-images.md

Training

Linux Operating System Bootable containers enabled for AI Training

Setting Up Git Hooks

To install our standard git hooks, run the following command:

./install-hooks.sh

pre-commit hook

Ensures that training/ilab-wrapper/ilab is duplicated into training/nvidia-bootc/duplicated/ilab-wrapper/ilab