vllm/docs/contributing/model/README.md

886 B

title
Adding a New Model

{ #new-model }

This section provides more information on how to integrate a PyTorch model into vLLM.

Contents:

!!! note The complexity of adding a new model depends heavily on the model's architecture. The process is considerably straightforward if the model shares a similar architecture with an existing model in vLLM. However, for models that include new operators (e.g., a new attention mechanism), the process can be a bit more complex.

!!! tip If you are encountering issues while integrating your model into vLLM, feel free to open a GitHub issue or ask on our developer slack. We will be happy to help you out!