|
…
|
||
|---|---|---|
| .. | ||
| README.md | ||
| entry_point.sh | ||
| nvidia-device-plugin-ds.yaml | ||
README.md
Setting up AKS vLLM stack with one command
This script automatically configures an AKS LLM inference cluster.
Installing Prerequisites
Before running this setup, ensure you have:
TL;DR
Set up
[!CAUTION] This script requires cloud resources and will incur costs. Please make sure all resources are shut down properly.
To run the service, go to the deployment_on_cloud/azure/ folder and run the following command:
cd deployment_on_cloud/azure/
./entry_point.sh setup
Clean up
To clean up the service, run the following command:
cd deployment_on_cloud/azure/
./entry_point.sh cleanup