Commit Graph

152 Commits

Author SHA1 Message Date
youkaichao 2c17a9456a
rename and update date (#60) 2025-07-01 10:28:50 +08:00
qscqesze 1506191318
add minimax-m1 doc (#59)
* update minimax-m1.md

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* change title

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* Update _posts/2025-06-26-minimax-m1.md

* Update _posts/2025-06-26-minimax-m1.md

* Update _posts/2025-06-26-minimax-m1.md

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

---------

Signed-off-by: qingjun <qingjun@minimaxi.com>
Co-authored-by: youkaichao <youkaichao@gmail.com>
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2025-07-01 10:16:33 +08:00
youkaichao 71c0bbb4cd
hardware plugin blog post (#57)
* Add hardware plugin blog post

Signed-off-by: MengqingCao <cmq0113@163.com>

* rename to updated date

Signed-off-by: youkaichao <youkaichao@gmail.com>

* minor update

Signed-off-by: youkaichao <youkaichao@gmail.com>

* update next step and acknowledgement

Signed-off-by: Mengqing Cao <cmq0113@163.com>

* rename the team

Signed-off-by: youkaichao <youkaichao@gmail.com>

* fix typo

Signed-off-by: youkaichao <youkaichao@gmail.com>

* fix typo

Signed-off-by: youkaichao <youkaichao@gmail.com>

---------

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: youkaichao <youkaichao@gmail.com>
Signed-off-by: Mengqing Cao <cmq0113@163.com>
Co-authored-by: MengqingCao <cmq0113@163.com>
2025-05-14 14:40:54 +08:00
Harry Mellor 32a83da437
Update link to Transformers backend docs (#56)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-04-25 14:37:08 +01:00
youkaichao 800bfb147e
Add OpenRLHF blog (#54)
* init

* fix

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update _posts/2025-04-18-openrlhf-vllm.md

Co-authored-by: youkaichao <youkaichao@gmail.com>

* fix

* Update _posts/2025-04-18-openrlhf-vllm.md

Co-authored-by: youkaichao <youkaichao@gmail.com>

* Update _posts/2025-04-18-openrlhf-vllm.md

Co-authored-by: youkaichao <youkaichao@gmail.com>

* fix

* update

* Update _posts/2025-04-18-openrlhf-vllm.md

Co-authored-by: youkaichao <youkaichao@gmail.com>

* update

* minor fix

Signed-off-by: youkaichao <youkaichao@gmail.com>

* minor fix

Signed-off-by: youkaichao <youkaichao@gmail.com>

* update

* update

* rename files

Signed-off-by: youkaichao <youkaichao@gmail.com>

---------

Signed-off-by: youkaichao <youkaichao@gmail.com>
Co-authored-by: jianh <jianh@nvidia.com>
Co-authored-by: hijkzzz <janhu9527@gmail.com>
2025-04-24 15:13:04 +08:00
Aritra Roy Gosthipaty dcfdf596c1
[Add] Blog post on transformers backend integration with vLLM (#50)
* add transformers backend blog post

Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

OK

Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

* Apply suggestions from code review

Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

* Update _posts/2025-04-11-transformers-backend.md

Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

OK

Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

* Update _posts/2025-04-11-transformers-backend.md

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

---------

Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-04-16 12:31:00 +01:00
Lucia Fang 447290f5c1
remove tips for attn_temperature_tuning in llama4 blog (#51)
Since we auto-enable this with max-model-len > 32 in PR https://github.com/vllm-project/vllm/pull/16439, this tip can be removed to avoid confusion.
2025-04-15 11:17:59 -07:00
Lucia Fang e4a43dab00
remove attn_temperature_tuning in default user guide (#49)
Signed-off-by: Lu Fang <fanglu@fb.com>
2025-04-08 12:03:41 +01:00
Lucia Fang 3eb4d4d737
Update llama4 documentation to use the right settings for long context (#48) 2025-04-07 13:08:47 -07:00
Simon Mo de34b1ce54
llama4 post (#47)
* llama4 post

Signed-off-by: simon-mo <simon.mo@hey.com>

* 8 images

Signed-off-by: simon-mo <simon.mo@hey.com>

* comments

Signed-off-by: simon-mo <simon.mo@hey.com>

* charlotte edits

Signed-off-by: simon-mo <simon.mo@hey.com>

* remove dup paragraphs

Signed-off-by: Roger Wang <ywang@roblox.com>

* add multimodal

Signed-off-by: Roger Wang <ywang@roblox.com>

* update model id

Signed-off-by: Roger Wang <ywang@roblox.com>

* point to 0.8.3 branch

Signed-off-by: Roger Wang <ywang@roblox.com>

---------

Signed-off-by: simon-mo <simon.mo@hey.com>
Signed-off-by: Roger Wang <ywang@roblox.com>
Co-authored-by: Roger Wang <ywang@roblox.com>
2025-04-05 23:44:57 -07:00
TJian f5100370f3
add ptpc-fp8 amd blogpost (#45)
Signed-off-by: tanpinsiang <pinsiang.tan@embeddedllm.com>
Co-authored-by: tanpinsiang <pinsiang.tan@embeddedllm.com>
2025-03-20 17:22:42 +00:00
Harry Mellor 4d264ee9a0
Update installation doc URLs (#40)
Follow up to https://github.com/vllm-project/vllm/pull/14556.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-03-10 10:43:02 -07:00
Jiaxin Shan 57e3d78db2
Update figure size for aibrix release blog (#36)
Signed-off-by: Jiaxin Shan <seedjeffwan@gmail.com>
2025-02-21 14:31:58 -08:00
Jiaxin Shan 2c056850c0
Add aibrix release blog post (#35)
Signed-off-by: Jiaxin Shan <seedjeffwan@gmail.com>
2025-02-21 14:02:48 -08:00
Guspan Tanadi 2ae1f8efa2
docs(2025-01-21-stack-release): related repo links (#30) 2025-02-21 11:19:17 +00:00
Murali Andoorveedu a2efb9d767
Add distributed inference blog post (#27)
* Add distributed inference blog post

Signed-off-by: andoorve <37849411+andoorve@users.noreply.github.com>

* Update _posts/2025-02-17-distributed-inference.md

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

---------

Signed-off-by: andoorve <37849411+andoorve@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-02-19 17:29:26 +00:00
Guspan Tanadi d9c2a8e934
Fix links format Installation (#26)
* Fix links format Installation

* Update _posts/2024-09-05-perf-update.md

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

---------

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-02-18 10:04:23 +00:00
Yuan Tang 3d55f5c5cf
Mention the need to request access to Llama models in 2025-01-27-intro-to-llama-stack-with-vllm.md (#23) 2025-02-06 23:54:40 +00:00
Harry Mellor ff274c498a
Fix documentation link in V1 blog 2025-01-30 15:05:30 +00:00
Yuan Tang 288580d57f
Update 2025-01-27-intro-to-llama-stack-with-vllm.md 2025-01-29 12:19:18 -05:00
Yuan Tang cfd9e3faf4
Correct inference provider config for K8s deployment 2025-01-27-intro-to-llama-stack-with-vllm.md 2025-01-29 12:01:21 -05:00
Yuan Tang 1146ced6d0
Update 2025-01-27-intro-to-llama-stack-with-vllm.md 2025-01-27 14:49:59 -05:00
Simon Mo d28e88240d
Merge pull request #15 from Hanchenli/main
Adding production-stack post
2025-01-27 11:37:38 -08:00
Simon Mo c56753c0df
Merge pull request #18 from hmellor/fix-dead-links
Fix dead links to installation docs
2025-01-27 11:37:23 -08:00
Simon Mo 03662292d6
Merge pull request #16 from terrytangyuan/llama-stack
New blog: Introducing vLLM Inference Provider in Llama Stack
2025-01-27 11:37:16 -08:00
WoosukKwon 41638ef1c5 Add thumbnail to V1 blog post
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-27 08:15:55 -08:00
Harry Mellor 59933df72b Fix dead links to installation docs
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-01-27 13:08:50 +00:00
WoosukKwon 8f9835f21c Change date
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-27 00:06:46 -08:00
WoosukKwon 4ff97a8f9d Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 22:50:18 -08:00
WoosukKwon ca87df9e47 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 21:56:49 -08:00
WoosukKwon eef62fac45 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 21:43:55 -08:00
WoosukKwon e83a5be9ff Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:34:12 -08:00
WoosukKwon f90c18079b Change data
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:11:25 -08:00
WoosukKwon 4b78497736 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:10:46 -08:00
WoosukKwon 9548a39acc minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:09:05 -08:00
WoosukKwon e056d8e892 txt
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:08:39 -08:00
WoosukKwon 1aef324e23 Add Qwen2 fig
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 19:29:27 -08:00
WoosukKwon 30d4e438ca Add prefix caching
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 18:42:15 -08:00
WoosukKwon 35c5a5eee5 WIP
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 18:16:22 -08:00
WoosukKwon 7e26b212bd Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 17:17:10 -08:00
WoosukKwon 80f404ac8b mv
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:52:45 -08:00
WoosukKwon 90ac189fc1 fix
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:52:26 -08:00
WoosukKwon e890f28cea Fix
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:36:04 -08:00
WoosukKwon 02d5e058ff Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:20:20 -08:00
WoosukKwon d2343c442d Add links to github profiles
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:15:41 -08:00
WoosukKwon 45b7987946 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:07:18 -08:00
Yuan Tang e95f52795f
Update 2025-01-27-intro-to-llama-stack-with-vllm.md 2025-01-25 11:19:46 -05:00
Yuan Tang 4b3bc6dc25
Move image to assets
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-25 11:15:15 -05:00
Yuan Tang 5ea205d8d8
Minor edits
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-24 22:39:50 -05:00
Yuan Tang c8f485f16d
v0.1.0 with fix #879
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-24 20:49:08 -05:00