vLLM/vllm - vllm - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Russell Bryant	6d0df0ebeb	[Docs] Generate correct github links for decorated functions (#17125 ) Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-04-24 10:39:43 -07:00
Harry Mellor	0422ce109f	Add `:markdownhelp:` to `EngineArgs` docs so markdown docstrings render properly (#17124 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-24 10:28:45 -07:00
Eyshika Agarwal	47bdee409c	Molmo Requirements (#17026 ) Signed-off-by: Eyshika Agarwal <eyshikaengineer@gmail.com> Signed-off-by: eyshika <eyshikaengineer@gmail.com>	2025-04-24 10:08:37 -07:00
Atilla	49f189439d	existing torch installation pip command fix for docs (#17059 )	2025-04-24 10:07:21 -07:00
wang.yuqi	67309a1cb5	[Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970 )	2025-04-24 07:06:28 -07:00
omer-dayan	2bc0f72ae5	Add docs for runai_streamer_sharded (#17093 ) Signed-off-by: Omer Dayan (SW-GPU) <omer@run.ai> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-04-24 01:03:21 -07:00
Reid	9c1244de57	[doc] update to hyperlink (#17096 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-24 00:58:08 -07:00
Reid	db2f8d915c	[V1] Update structured output (#16812 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-23 23:57:17 -07:00
Harry Mellor	2c8ed8ee48	More informative error when using Transformers backend (#16988 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-23 19:54:03 -07:00
Michael Yao	f7912cba3d	[Doc] Add top anchor and a note to quantization/bitblas.md (#17042 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-23 07:32:16 -07:00
Reid	eb8ef4224d	[doc] add download path tips (#17013 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-23 04:06:30 +00:00
Lei Wang	8d32dc603d	[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (#6036 ) Signed-off-by: xinyuxiao <xinyuxiao2024@gmail.com> Co-authored-by: xinyuxiao <xinyuxiao2024@gmail.com>	2025-04-22 09:01:36 +01:00
Michael Yao	3097ce3a32	[Doc] Update ai_accelerator/hpu-gaudi.inc.md (#16956 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-22 05:33:27 +00:00
Cyrus Leung	29f395c97c	[Doc] Remove unnecessary V1 flag (#16924 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-21 21:04:38 -04:00
David Xia	f728ab8e35	[Doc] mention how to install in CPU editable mode (#16923 ) Signed-off-by: David Xia <david@davidxia.com>	2025-04-21 17:45:51 +00:00
David Xia	63e26fff78	[doc] install required python3-dev apt package (#16888 ) Signed-off-by: David Xia <david@davidxia.com>	2025-04-21 16:15:18 +00:00
Yan Ma	fe3462c774	[XPU][Bugfix] minor fix for XPU (#15591 ) Signed-off-by: yan ma <yan.ma@intel.com>	2025-04-22 00:02:57 +08:00
Alex Brooks	b34f33438a	[Doc] Split dummy_processor_inputs() in Multimodal Docs (#16915 ) Signed-off-by: Alex-Brooks <Alex.Brooks@ibm.com>	2025-04-21 11:10:01 +00:00
Reid	d6195a748b	[doc] update hyperlink (#16877 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-19 16:40:38 +00:00
Roger Wang	5124f5bf51	[Model] Qwen2.5-Omni Cleanup (#16872 )	2025-04-19 09:37:02 +00:00
Isotr0py	83f3c3bd91	[Model] Refactor Phi-4-multimodal to use merged processor and support V1 (#15477 ) Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-19 02:26:11 -07:00
Nicolò Lucchesi	2ef0dc53b8	[Frontend] Add sampling params to `v1/audio/transcriptions` endpoint (#16591 ) Signed-off-by: Jannis Schönleber <joennlae@gmail.com> Signed-off-by: NickLucche <nlucches@redhat.com> Co-authored-by: Jannis Schönleber <joennlae@gmail.com>	2025-04-19 07:03:54 +00:00
Yang Fan	2c1bd848a6	[Model][VLM] Add Qwen2.5-Omni model support (thinker only) (#15130 ) Signed-off-by: fyabc <suyang.fy@alibaba-inc.com> Signed-off-by: Roger Wang <ywang@roblox.com> Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by: Roger Wang <ywang@roblox.com> Co-authored-by: Xiong Wang <wangxiongts@163.com>	2025-04-18 23:14:36 -07:00
Justin Ho	490b1698a5	[Doc] Updated Llama section in tool calling docs to have llama 3.2 config info (#16857 ) Signed-off-by: jmho <jaylenho734@gmail.com>	2025-04-18 23:28:53 +00:00
Michael Yao	26507f8973	[Docs] Fix a link and grammar issue in production-stack.md (#16809 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-18 06:42:58 +00:00
Nathan Weinberg	9c1d5b456d	[Doc] add podman setup instructions for official image (#16796 ) Signed-off-by: Nathan Weinberg <nweinber@redhat.com>	2025-04-18 06:10:49 +00:00
Harry Mellor	e78587a64c	Improve-mm-and-pooler-and-decoding-configs (#16789 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-17 22:13:32 -07:00
Cyrus Leung	c16fb5dae8	[Doc] Improve help examples for `--compilation-config` (#16729 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-17 21:22:34 -07:00
Mark McLoughlin	e4755f7fac	[V1][Metrics] Fix http metrics middleware (#15894 )	2025-04-17 19:52:18 +00:00
Insu Kim	7c02d6a137	[Doc] Changed explanation of generation_tokens_total and prompt_tokens_total counter type metrics to avoid confusion (#16784 ) Signed-off-by: insukim1994 <insu.kim@moreh.io>	2025-04-17 14:10:08 +00:00
wang.yuqi	11c3b98491	[Doc] Document Matryoshka Representation Learning support (#16770 )	2025-04-17 13:37:37 +00:00
Cyrus Leung	dbe7f07001	[Doc] Make sure to update vLLM when installing latest code (#16781 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-17 06:53:31 -06:00
Reid	c69bf4ee06	fix: hyperlink (#16778 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-17 11:34:20 +00:00
Michael Yao	207da28186	[Doc] Fix a 404 link in installation/cpu.md (#16773 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-17 10:46:21 +00:00
intervitens	5b1aca2ae3	[Bugfix] Fix GLM4 model (#16618 ) Signed-off-by: intervitens <intervitens@tutanota.com>	2025-04-17 03:35:07 -07:00
Reid	d8e557b5e5	[doc] add open-webui example (#16747 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-17 18:27:32 +08:00
Cyrus Leung	61a44a0b22	[Doc] Add more tips to avoid OOM (#16765 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-17 09:54:34 +00:00
Harry Mellor	3cd91dc955	Help user create custom model for Transformers backend remote code models (#16719 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-17 01:05:59 +00:00
xsank	ee378f3d49	[Model] support modernbert (#16648 ) Signed-off-by: 唯勤 <xsank.mz@alibaba-inc.com> Co-authored-by: 唯勤 <xsank.mz@alibaba-inc.com>	2025-04-16 05:30:15 -07:00
Cyrus Leung	facbe2a114	[Doc] Improve OOM troubleshooting (#16704 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-16 18:29:48 +08:00
Shinichi Hemmi	3badb0213b	[Model] Add PLaMo2 (#14323 ) Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com> Signed-off-by: shemmi <shemmi@preferred.jp> Co-authored-by: Kento Nozawa <nzw0301@preferred.jp> Co-authored-by: Hiroaki Mikami <mhiroaki@preferred.jp> Co-authored-by: Calvin Metzger <metzger@preferred.jp>	2025-04-15 19:31:30 -07:00
Angky William	fdcb850f14	[Misc] Enable vLLM to Dynamically Load LoRA from a Remote Server (#10546 ) Signed-off-by: Angky William <angkywilliam@Angkys-MacBook-Pro.local> Co-authored-by: Angky William <angkywilliam@Angkys-MacBook-Pro.local>	2025-04-15 22:31:38 +00:00
courage17340	b1308b84a3	[Model][VLM] Add Kimi-VL model support (#16387 ) Signed-off-by: courage17340 <courage17340@163.com>	2025-04-14 21:41:48 +00:00
Cyrus Leung	d9fc8cd9da	[V1] Enable multi-input by default (#15799 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-12 08:52:39 +00:00
Ye (Charlotte) Qi	802329dee9	[Doc] Update Llama4 Model Names in Supported Models (#16509 ) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com>	2025-04-12 02:53:10 +00:00
Christian Sears	c09632a66c	Update openai_compatible_server.md (#16507 ) Signed-off-by: Christian Sears <csears@redhat.com>	2025-04-11 22:54:58 +00:00
Ye (Charlotte) Qi	16eda8c43a	[Frontend] Added chat templates for LLaMa4 pythonic tool calling (#16463 ) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com> Co-authored-by: Kai Wu <kaiwu@meta.com>	2025-04-12 06:26:17 +08:00
Isotr0py	5285589f37	[Doc] Document InternVL3 support (#16495 ) Signed-off-by: Isotr0py <2037008807@qq.com>	2025-04-11 19:41:09 +00:00
Michael Goin	ed37599544	Update supported_hardware.md for TPU INT8 (#16437 )	2025-04-11 12:28:07 +08:00
Cyrus Leung	83b824c8b4	[VLM] Remove `BaseProcessingInfo.get_mm_max_tokens_per_item` (#16408 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-10 09:06:58 -07:00

1 2 3 4 5 ...

900 Commits