youkaichao
2c17a9456a
rename and update date ( #60 )
2025-07-01 10:28:50 +08:00
qscqesze
1506191318
add minimax-m1 doc ( #59 )
...
* update minimax-m1.md
Signed-off-by: qingjun <qingjun@minimaxi.com>
* update
Signed-off-by: qingjun <qingjun@minimaxi.com>
* update
Signed-off-by: qingjun <qingjun@minimaxi.com>
* update
Signed-off-by: qingjun <qingjun@minimaxi.com>
* update
Signed-off-by: qingjun <qingjun@minimaxi.com>
* change title
Signed-off-by: qingjun <qingjun@minimaxi.com>
* update
Signed-off-by: qingjun <qingjun@minimaxi.com>
* update
Signed-off-by: qingjun <qingjun@minimaxi.com>
* Update _posts/2025-06-26-minimax-m1.md
* Update _posts/2025-06-26-minimax-m1.md
* Update _posts/2025-06-26-minimax-m1.md
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
---------
Signed-off-by: qingjun <qingjun@minimaxi.com>
Co-authored-by: youkaichao <youkaichao@gmail.com>
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2025-07-01 10:16:33 +08:00
youkaichao
71c0bbb4cd
hardware plugin blog post ( #57 )
...
* Add hardware plugin blog post
Signed-off-by: MengqingCao <cmq0113@163.com>
* rename to updated date
Signed-off-by: youkaichao <youkaichao@gmail.com>
* minor update
Signed-off-by: youkaichao <youkaichao@gmail.com>
* update next step and acknowledgement
Signed-off-by: Mengqing Cao <cmq0113@163.com>
* rename the team
Signed-off-by: youkaichao <youkaichao@gmail.com>
* fix typo
Signed-off-by: youkaichao <youkaichao@gmail.com>
* fix typo
Signed-off-by: youkaichao <youkaichao@gmail.com>
---------
Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: youkaichao <youkaichao@gmail.com>
Signed-off-by: Mengqing Cao <cmq0113@163.com>
Co-authored-by: MengqingCao <cmq0113@163.com>
2025-05-14 14:40:54 +08:00
Harry Mellor
32a83da437
Update link to Transformers backend docs ( #56 )
...
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-04-25 14:37:08 +01:00
youkaichao
800bfb147e
Add OpenRLHF blog ( #54 )
...
* init
* fix
* update
* update
* update
* update
* update
* update
* update
* update
* update
* Update _posts/2025-04-18-openrlhf-vllm.md
Co-authored-by: youkaichao <youkaichao@gmail.com>
* fix
* Update _posts/2025-04-18-openrlhf-vllm.md
Co-authored-by: youkaichao <youkaichao@gmail.com>
* Update _posts/2025-04-18-openrlhf-vllm.md
Co-authored-by: youkaichao <youkaichao@gmail.com>
* fix
* update
* Update _posts/2025-04-18-openrlhf-vllm.md
Co-authored-by: youkaichao <youkaichao@gmail.com>
* update
* minor fix
Signed-off-by: youkaichao <youkaichao@gmail.com>
* minor fix
Signed-off-by: youkaichao <youkaichao@gmail.com>
* update
* update
* rename files
Signed-off-by: youkaichao <youkaichao@gmail.com>
---------
Signed-off-by: youkaichao <youkaichao@gmail.com>
Co-authored-by: jianh <jianh@nvidia.com>
Co-authored-by: hijkzzz <janhu9527@gmail.com>
2025-04-24 15:13:04 +08:00
Aritra Roy Gosthipaty
dcfdf596c1
[Add] Blog post on transformers backend integration with vLLM ( #50 )
...
* add transformers backend blog post
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>
OK
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>
* Apply suggestions from code review
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>
* Update _posts/2025-04-11-transformers-backend.md
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>
OK
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>
* Update _posts/2025-04-11-transformers-backend.md
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
---------
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-04-16 12:31:00 +01:00
Lucia Fang
447290f5c1
remove tips for attn_temperature_tuning in llama4 blog ( #51 )
...
Since we auto-enable this with max-model-len > 32 in PR https://github.com/vllm-project/vllm/pull/16439 , this tip can be removed to avoid confusion.
2025-04-15 11:17:59 -07:00
Lucia Fang
e4a43dab00
remove attn_temperature_tuning in default user guide ( #49 )
...
Signed-off-by: Lu Fang <fanglu@fb.com>
2025-04-08 12:03:41 +01:00
Lucia Fang
3eb4d4d737
Update llama4 documentation to use the right settings for long context ( #48 )
2025-04-07 13:08:47 -07:00
Simon Mo
de34b1ce54
llama4 post ( #47 )
...
* llama4 post
Signed-off-by: simon-mo <simon.mo@hey.com>
* 8 images
Signed-off-by: simon-mo <simon.mo@hey.com>
* comments
Signed-off-by: simon-mo <simon.mo@hey.com>
* charlotte edits
Signed-off-by: simon-mo <simon.mo@hey.com>
* remove dup paragraphs
Signed-off-by: Roger Wang <ywang@roblox.com>
* add multimodal
Signed-off-by: Roger Wang <ywang@roblox.com>
* update model id
Signed-off-by: Roger Wang <ywang@roblox.com>
* point to 0.8.3 branch
Signed-off-by: Roger Wang <ywang@roblox.com>
---------
Signed-off-by: simon-mo <simon.mo@hey.com>
Signed-off-by: Roger Wang <ywang@roblox.com>
Co-authored-by: Roger Wang <ywang@roblox.com>
2025-04-05 23:44:57 -07:00
TJian
f5100370f3
add ptpc-fp8 amd blogpost ( #45 )
...
Signed-off-by: tanpinsiang <pinsiang.tan@embeddedllm.com>
Co-authored-by: tanpinsiang <pinsiang.tan@embeddedllm.com>
2025-03-20 17:22:42 +00:00
Harry Mellor
4d264ee9a0
Update installation doc URLs ( #40 )
...
Follow up to https://github.com/vllm-project/vllm/pull/14556 .
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-03-10 10:43:02 -07:00
Jiaxin Shan
57e3d78db2
Update figure size for aibrix release blog ( #36 )
...
Signed-off-by: Jiaxin Shan <seedjeffwan@gmail.com>
2025-02-21 14:31:58 -08:00
Jiaxin Shan
2c056850c0
Add aibrix release blog post ( #35 )
...
Signed-off-by: Jiaxin Shan <seedjeffwan@gmail.com>
2025-02-21 14:02:48 -08:00
Guspan Tanadi
2ae1f8efa2
docs(2025-01-21-stack-release): related repo links ( #30 )
2025-02-21 11:19:17 +00:00
Murali Andoorveedu
a2efb9d767
Add distributed inference blog post ( #27 )
...
* Add distributed inference blog post
Signed-off-by: andoorve <37849411+andoorve@users.noreply.github.com>
* Update _posts/2025-02-17-distributed-inference.md
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
---------
Signed-off-by: andoorve <37849411+andoorve@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-02-19 17:29:26 +00:00
Guspan Tanadi
d9c2a8e934
Fix links format Installation ( #26 )
...
* Fix links format Installation
* Update _posts/2024-09-05-perf-update.md
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
---------
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-02-18 10:04:23 +00:00
Yuan Tang
3d55f5c5cf
Mention the need to request access to Llama models in 2025-01-27-intro-to-llama-stack-with-vllm.md ( #23 )
2025-02-06 23:54:40 +00:00
Harry Mellor
ff274c498a
Fix documentation link in V1 blog
2025-01-30 15:05:30 +00:00
Yuan Tang
288580d57f
Update 2025-01-27-intro-to-llama-stack-with-vllm.md
2025-01-29 12:19:18 -05:00
Yuan Tang
cfd9e3faf4
Correct inference provider config for K8s deployment 2025-01-27-intro-to-llama-stack-with-vllm.md
2025-01-29 12:01:21 -05:00
Yuan Tang
1146ced6d0
Update 2025-01-27-intro-to-llama-stack-with-vllm.md
2025-01-27 14:49:59 -05:00
Simon Mo
d28e88240d
Merge pull request #15 from Hanchenli/main
...
Adding production-stack post
2025-01-27 11:37:38 -08:00
Simon Mo
c56753c0df
Merge pull request #18 from hmellor/fix-dead-links
...
Fix dead links to installation docs
2025-01-27 11:37:23 -08:00
Simon Mo
03662292d6
Merge pull request #16 from terrytangyuan/llama-stack
...
New blog: Introducing vLLM Inference Provider in Llama Stack
2025-01-27 11:37:16 -08:00
WoosukKwon
41638ef1c5
Add thumbnail to V1 blog post
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-27 08:15:55 -08:00
Harry Mellor
59933df72b
Fix dead links to installation docs
...
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-01-27 13:08:50 +00:00
WoosukKwon
8f9835f21c
Change date
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-27 00:06:46 -08:00
WoosukKwon
4ff97a8f9d
Minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 22:50:18 -08:00
WoosukKwon
ca87df9e47
Minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 21:56:49 -08:00
WoosukKwon
eef62fac45
Minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 21:43:55 -08:00
WoosukKwon
e83a5be9ff
Minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:34:12 -08:00
WoosukKwon
f90c18079b
Change data
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:11:25 -08:00
WoosukKwon
4b78497736
Minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:10:46 -08:00
WoosukKwon
9548a39acc
minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:09:05 -08:00
WoosukKwon
e056d8e892
txt
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 20:08:39 -08:00
WoosukKwon
1aef324e23
Add Qwen2 fig
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 19:29:27 -08:00
WoosukKwon
30d4e438ca
Add prefix caching
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 18:42:15 -08:00
WoosukKwon
35c5a5eee5
WIP
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 18:16:22 -08:00
WoosukKwon
7e26b212bd
Minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 17:17:10 -08:00
WoosukKwon
80f404ac8b
mv
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:52:45 -08:00
WoosukKwon
90ac189fc1
fix
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:52:26 -08:00
WoosukKwon
e890f28cea
Fix
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:36:04 -08:00
WoosukKwon
02d5e058ff
Minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:20:20 -08:00
WoosukKwon
d2343c442d
Add links to github profiles
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:15:41 -08:00
WoosukKwon
45b7987946
Minor
...
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 16:07:18 -08:00
Yuan Tang
e95f52795f
Update 2025-01-27-intro-to-llama-stack-with-vllm.md
2025-01-25 11:19:46 -05:00
Yuan Tang
4b3bc6dc25
Move image to assets
...
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-25 11:15:15 -05:00
Yuan Tang
5ea205d8d8
Minor edits
...
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-24 22:39:50 -05:00
Yuan Tang
c8f485f16d
v0.1.0 with fix #879
...
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-24 20:49:08 -05:00