Commit Graph

35 Commits

Author SHA1 Message Date
qscqesze 1506191318
add minimax-m1 doc (#59)
* update minimax-m1.md

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* change title

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* update

Signed-off-by: qingjun <qingjun@minimaxi.com>

* Update _posts/2025-06-26-minimax-m1.md

* Update _posts/2025-06-26-minimax-m1.md

* Update _posts/2025-06-26-minimax-m1.md

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

---------

Signed-off-by: qingjun <qingjun@minimaxi.com>
Co-authored-by: youkaichao <youkaichao@gmail.com>
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2025-07-01 10:16:33 +08:00
youkaichao 800bfb147e
Add OpenRLHF blog (#54)
* init

* fix

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update _posts/2025-04-18-openrlhf-vllm.md

Co-authored-by: youkaichao <youkaichao@gmail.com>

* fix

* Update _posts/2025-04-18-openrlhf-vllm.md

Co-authored-by: youkaichao <youkaichao@gmail.com>

* Update _posts/2025-04-18-openrlhf-vllm.md

Co-authored-by: youkaichao <youkaichao@gmail.com>

* fix

* update

* Update _posts/2025-04-18-openrlhf-vllm.md

Co-authored-by: youkaichao <youkaichao@gmail.com>

* update

* minor fix

Signed-off-by: youkaichao <youkaichao@gmail.com>

* minor fix

Signed-off-by: youkaichao <youkaichao@gmail.com>

* update

* update

* rename files

Signed-off-by: youkaichao <youkaichao@gmail.com>

---------

Signed-off-by: youkaichao <youkaichao@gmail.com>
Co-authored-by: jianh <jianh@nvidia.com>
Co-authored-by: hijkzzz <janhu9527@gmail.com>
2025-04-24 15:13:04 +08:00
Aritra Roy Gosthipaty dcfdf596c1
[Add] Blog post on transformers backend integration with vLLM (#50)
* add transformers backend blog post

Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

OK

Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

* Apply suggestions from code review

Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

* Update _posts/2025-04-11-transformers-backend.md

Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

OK

Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>

* Update _posts/2025-04-11-transformers-backend.md

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

---------

Signed-off-by: ariG23498 <aritra.born2fly@gmail.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-04-16 12:31:00 +01:00
Simon Mo de34b1ce54
llama4 post (#47)
* llama4 post

Signed-off-by: simon-mo <simon.mo@hey.com>

* 8 images

Signed-off-by: simon-mo <simon.mo@hey.com>

* comments

Signed-off-by: simon-mo <simon.mo@hey.com>

* charlotte edits

Signed-off-by: simon-mo <simon.mo@hey.com>

* remove dup paragraphs

Signed-off-by: Roger Wang <ywang@roblox.com>

* add multimodal

Signed-off-by: Roger Wang <ywang@roblox.com>

* update model id

Signed-off-by: Roger Wang <ywang@roblox.com>

* point to 0.8.3 branch

Signed-off-by: Roger Wang <ywang@roblox.com>

---------

Signed-off-by: simon-mo <simon.mo@hey.com>
Signed-off-by: Roger Wang <ywang@roblox.com>
Co-authored-by: Roger Wang <ywang@roblox.com>
2025-04-05 23:44:57 -07:00
TJian f5100370f3
add ptpc-fp8 amd blogpost (#45)
Signed-off-by: tanpinsiang <pinsiang.tan@embeddedllm.com>
Co-authored-by: tanpinsiang <pinsiang.tan@embeddedllm.com>
2025-03-20 17:22:42 +00:00
Jiaxin Shan 2c056850c0
Add aibrix release blog post (#35)
Signed-off-by: Jiaxin Shan <seedjeffwan@gmail.com>
2025-02-21 14:02:48 -08:00
Harry Mellor 9b5ad206c3
Use remote theme instead of including it locally (#32)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-02-21 15:56:04 +00:00
Murali Andoorveedu a2efb9d767
Add distributed inference blog post (#27)
* Add distributed inference blog post

Signed-off-by: andoorve <37849411+andoorve@users.noreply.github.com>

* Update _posts/2025-02-17-distributed-inference.md

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

---------

Signed-off-by: andoorve <37849411+andoorve@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-02-19 17:29:26 +00:00
Simon Mo d28e88240d
Merge pull request #15 from Hanchenli/main
Adding production-stack post
2025-01-27 11:37:38 -08:00
Simon Mo 03662292d6
Merge pull request #16 from terrytangyuan/llama-stack
New blog: Introducing vLLM Inference Provider in Llama Stack
2025-01-27 11:37:16 -08:00
WoosukKwon 8f52fac61f Update qwen2vl
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 22:48:26 -08:00
WoosukKwon 1aef324e23 Add Qwen2 fig
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 19:29:27 -08:00
WoosukKwon 30d4e438ca Add prefix caching
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-26 18:42:15 -08:00
Yuan Tang 4b3bc6dc25
Move image to assets
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-25 11:15:15 -05:00
WoosukKwon 4cf76f3c75 Fig
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 14:07:23 -08:00
WoosukKwon c96ab351cc more figs
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:45:36 -08:00
WoosukKwon ac1befe287 figs
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:34:40 -08:00
WoosukKwon 7cfbb38745 Initial
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:27:53 -08:00
Hanchenli 47d9a477b7
Add files via upload 2025-01-24 11:27:27 -06:00
Hanchenli f16547db6d
Create temp 2025-01-24 11:25:51 -06:00
Simon Mo f9a15b52eb
Merge pull request #12 from vllm-project/vllm-2024-wrapped-2025-vision
vLLM 2024 Retrospective and 2025 Vision Blog
2025-01-14 15:58:16 -08:00
Aaron Pham 9917647a5f
fix: correct dates for posts
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
2025-01-10 23:29:44 -05:00
Aaron Pham 93a4592ffc
Add blog for introduction in structured decoding
fix: correct item

chore: update author with Red Hat

chore: address comments from Michael and Tyler

chore: update notes on batch support

chore: update target date to be next Tuesday

Co-authored-by: Michael Goin <mgoin@redhat.com>
Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
2025-01-10 23:16:27 -05:00
mgoin 831d2d044e vLLM 2024 Retrospective and 2025 Vision Blog
Signed-off-by: mgoin <michael@neuralmagic.com>
2025-01-10 15:49:36 -05:00
tunjiantan aa86e74ea6 add 2024-10-23-vllm-serving-amd blog post
Signed-off-by: tunjiantan <tunjian.tan@embeddedllm.com>
2024-10-23 10:29:32 +00:00
LiuXiaoxuanPKU 5c940c665f minor 2024-10-22 11:39:37 -07:00
LiuXiaoxuanPKU 0a12f21577 minor 2024-10-22 11:17:53 -07:00
LiuXiaoxuanPKU 98a2b59850 edit 2024-10-22 11:12:11 -07:00
simon-mo a26b36612f Add spec decode blog 2024-10-17 13:30:16 -07:00
Zhuohan Li ce90fa1339 Add v0.6.0 perf blog and also modify readme on how to publish a blogpost 2024-09-04 23:57:30 -07:00
simon-mo bd0a2e74c1 update figure 2024-07-25 15:14:57 -07:00
simon-mo 33d16cb301 initial draft for lfai post 2024-07-25 14:44:46 -07:00
Zhuohan Li f11c9ef0d2 Add Llama 3.1 blogpost (new files) 2024-07-25 13:35:26 -07:00
Zhuohan Li f54ea7342e Use new template for the website 2023-11-14 12:12:47 -08:00
Zhuohan Li 6cd15ede01 first commit 2023-06-21 23:36:19 +08:00