..
offline_inference
Support embedding models in V1 ( #16188 )
2025-06-18 21:36:33 -07:00
online_serving
[V1][P/D] An native implementation of xPyD based on P2P NCCL ( #18242 )
2025-06-18 06:32:36 +00:00
others
[Bugfix] Update the example code, make it work with the latest lmcache ( #19453 )
2025-06-11 12:42:20 +00:00
pyproject.toml
Convert `examples` to `ruff-format` ( #18400 )
2025-05-26 16:57:54 +00:00
template_alpaca.jinja
…
template_baichuan.jinja
…
template_chatglm.jinja
…
template_chatglm2.jinja
…
template_chatml.jinja
…
template_dse_qwen2_vl.jinja
[Model] Adding Support for Qwen2VL as an Embedding Model. Using MrLight/dse-qwen2-2b-mrl-v1 ( #9944 )
2024-11-13 08:28:13 +00:00
template_falcon.jinja
…
template_falcon_180b.jinja
…
template_inkbot.jinja
…
template_teleflm.jinja
[Model] Support Tele-FLM Model ( #15023 )
2025-03-22 02:04:44 -07:00
template_vlm2vec.jinja
[Frontend] Use a proper chat template for VLM2Vec ( #9912 )
2024-11-01 14:09:07 +00:00
tool_chat_template_deepseekr1.jinja
Add DeepSeek-R1-0528 function call chat template ( #18874 )
2025-06-04 13:24:18 +00:00
tool_chat_template_deepseekv3.jinja
[Feature] Support DeepSeekV3 Function Call ( #17784 )
2025-05-12 00:45:21 -07:00
tool_chat_template_granite.jinja
Change granite chat template to keep json list formatting for tool calls ( #10452 )
2024-11-19 18:16:54 -07:00
tool_chat_template_granite_20b_fc.jinja
[Model] tool calling support for ibm-granite/granite-20b-functioncalling ( #8339 )
2024-10-29 15:07:37 -07:00
tool_chat_template_hermes.jinja
[Bugfix] Fix Hermes tool call chat template bug ( #8256 )
2024-09-07 10:49:01 +08:00
tool_chat_template_internlm2_tool.jinja
[Frontend][Feature] support tool calling for internlm/internlm2_5-7b-chat model ( #8405 )
2024-10-04 10:36:39 +08:00
tool_chat_template_llama3.1_json.jinja
[Bugfix][Frontend] Update Llama Chat Templates to also support Non-Tool use ( #10164 )
2024-11-23 10:17:38 +08:00
tool_chat_template_llama3.2_json.jinja
[Misc] Update llama 3.2 template to support system prompt with images ( #10901 )
2024-12-05 05:54:06 +00:00
tool_chat_template_llama3.2_pythonic.jinja
[Frontend] Fix typo in tool chat templates for llama3.2 and toolace ( #14501 )
2025-04-06 07:44:36 +00:00
tool_chat_template_llama4_json.jinja
Add chat template for Llama 4 models ( #16428 )
2025-04-24 20:19:36 +00:00
tool_chat_template_llama4_pythonic.jinja
[Frontend][Bug Fix] Update llama4 pythonic jinja template and llama4_pythonic parser ( #17917 )
2025-05-22 16:44:08 -07:00
tool_chat_template_mistral.jinja
[Feature] OpenAI-Compatible Tools API + Streaming for Hermes & Mistral models ( #5649 )
2024-09-04 13:18:13 -07:00
tool_chat_template_mistral3.jinja
[Bugfix] Fix tool call template validation for Mistral models ( #17644 )
2025-05-08 09:47:19 +00:00
tool_chat_template_mistral_parallel.jinja
[Bugfix] example template should not add parallel_tool_prompt if tools is none ( #9007 )
2024-10-03 03:04:17 +00:00
tool_chat_template_phi4_mini.jinja
[Frontend] Add Phi-4-mini function calling support ( #14886 )
2025-03-31 22:50:05 -07:00
tool_chat_template_toolace.jinja
[Frontend] Fix typo in tool chat templates for llama3.2 and toolace ( #14501 )
2025-04-06 07:44:36 +00:00