mirror of https://github.com/vllm-project/vllm.git
Signed-off-by: zxw <1020938856@qq.com> |
||
---|---|---|
.. | ||
README.md | ||
conserving_memory.md | ||
engine_args.md | ||
env_vars.md | ||
model_resolution.md | ||
optimization.md | ||
serve_args.md |
README.md
Configuration Options
This section lists the most common options for running vLLM.
There are three main levels of configuration, from highest priority to lowest priority:
- [Request parameters][completions-api] and [input arguments][sampling-params]
- Engine arguments
- Environment variables