vllm/docs/source/contributing/model/index.md

825 B

(new-model)=

Adding a New Model

This section provides more information on how to integrate a PyTorch model into vLLM.

:::{toctree} :caption: Contents :maxdepth: 1

basic registration tests multimodal :::

:::{note} The complexity of adding a new model depends heavily on the model's architecture. The process is considerably straightforward if the model shares a similar architecture with an existing model in vLLM. However, for models that include new operators (e.g., a new attention mechanism), the process can be a bit more complex. :::

:::{tip} If you are encountering issues while integrating your model into vLLM, feel free to open a GitHub issue or ask on our developer slack. We will be happy to help you out! :::