vllm/examples
Maximilien de Bayser 799397ee4f
Support embedding models in V1 (#16188)
Signed-off-by: Max de Bayser <mbayser@br.ibm.com>
Signed-off-by: Max de Bayser <maxdebayser@gmail.com>
Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>
Co-authored-by: 22quinn <33176974+22quinn@users.noreply.github.com>
2025-06-18 21:36:33 -07:00
..
offline_inference Support embedding models in V1 (#16188) 2025-06-18 21:36:33 -07:00
online_serving [V1][P/D] An native implementation of xPyD based on P2P NCCL (#18242) 2025-06-18 06:32:36 +00:00
others [Bugfix] Update the example code, make it work with the latest lmcache (#19453) 2025-06-11 12:42:20 +00:00
pyproject.toml Convert `examples` to `ruff-format` (#18400) 2025-05-26 16:57:54 +00:00
template_alpaca.jinja
template_baichuan.jinja
template_chatglm.jinja
template_chatglm2.jinja
template_chatml.jinja
template_dse_qwen2_vl.jinja [Model] Adding Support for Qwen2VL as an Embedding Model. Using MrLight/dse-qwen2-2b-mrl-v1 (#9944) 2024-11-13 08:28:13 +00:00
template_falcon.jinja
template_falcon_180b.jinja
template_inkbot.jinja
template_teleflm.jinja [Model] Support Tele-FLM Model (#15023) 2025-03-22 02:04:44 -07:00
template_vlm2vec.jinja [Frontend] Use a proper chat template for VLM2Vec (#9912) 2024-11-01 14:09:07 +00:00
tool_chat_template_deepseekr1.jinja Add DeepSeek-R1-0528 function call chat template (#18874) 2025-06-04 13:24:18 +00:00
tool_chat_template_deepseekv3.jinja [Feature] Support DeepSeekV3 Function Call (#17784) 2025-05-12 00:45:21 -07:00
tool_chat_template_granite.jinja Change granite chat template to keep json list formatting for tool calls (#10452) 2024-11-19 18:16:54 -07:00
tool_chat_template_granite_20b_fc.jinja [Model] tool calling support for ibm-granite/granite-20b-functioncalling (#8339) 2024-10-29 15:07:37 -07:00
tool_chat_template_hermes.jinja [Bugfix] Fix Hermes tool call chat template bug (#8256) 2024-09-07 10:49:01 +08:00
tool_chat_template_internlm2_tool.jinja [Frontend][Feature] support tool calling for internlm/internlm2_5-7b-chat model (#8405) 2024-10-04 10:36:39 +08:00
tool_chat_template_llama3.1_json.jinja [Bugfix][Frontend] Update Llama Chat Templates to also support Non-Tool use (#10164) 2024-11-23 10:17:38 +08:00
tool_chat_template_llama3.2_json.jinja [Misc] Update llama 3.2 template to support system prompt with images (#10901) 2024-12-05 05:54:06 +00:00
tool_chat_template_llama3.2_pythonic.jinja [Frontend] Fix typo in tool chat templates for llama3.2 and toolace (#14501) 2025-04-06 07:44:36 +00:00
tool_chat_template_llama4_json.jinja Add chat template for Llama 4 models (#16428) 2025-04-24 20:19:36 +00:00
tool_chat_template_llama4_pythonic.jinja [Frontend][Bug Fix] Update llama4 pythonic jinja template and llama4_pythonic parser (#17917) 2025-05-22 16:44:08 -07:00
tool_chat_template_mistral.jinja [Feature] OpenAI-Compatible Tools API + Streaming for Hermes & Mistral models (#5649) 2024-09-04 13:18:13 -07:00
tool_chat_template_mistral3.jinja [Bugfix] Fix tool call template validation for Mistral models (#17644) 2025-05-08 09:47:19 +00:00
tool_chat_template_mistral_parallel.jinja [Bugfix] example template should not add parallel_tool_prompt if tools is none (#9007) 2024-10-03 03:04:17 +00:00
tool_chat_template_phi4_mini.jinja [Frontend] Add Phi-4-mini function calling support (#14886) 2025-03-31 22:50:05 -07:00
tool_chat_template_toolace.jinja [Frontend] Fix typo in tool chat templates for llama3.2 and toolace (#14501) 2025-04-06 07:44:36 +00:00