mirror of https://github.com/vllm-project/vllm.git
Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com> |
||
---|---|---|
.. | ||
README.md | ||
conserving_memory.md | ||
engine_args.md | ||
env_vars.md | ||
model_resolution.md | ||
optimization.md | ||
serve_args.md |
README.md
Configuration Options
This section lists the most common options for running vLLM.
There are three main levels of configuration, from highest priority to lowest priority:
- [Request parameters][completions-api] and [input arguments][sampling-params]
- Engine arguments
- Environment variables