Commit Graph

149 Commits

Author SHA1 Message Date
WoosukKwon 1536bd7ce2 Fig
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 14:07:40 -08:00
WoosukKwon 4cf76f3c75 Fig
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 14:07:23 -08:00
WoosukKwon ce983a38d5 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 14:00:27 -08:00
WoosukKwon f0a7b016bb Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:54:36 -08:00
WoosukKwon f81c314751 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:54:01 -08:00
Yuan Tang 0121eb45e9
Update 2025-01-27-intro-to-llama-stack-with-vllm.md 2025-01-24 16:51:39 -05:00
WoosukKwon 41e379c103 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:50:38 -08:00
WoosukKwon a8f7abcc58 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:49:51 -08:00
WoosukKwon d67eaaa0d9 fix
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:48:36 -08:00
WoosukKwon abc8465d71 fix
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:48:03 -08:00
WoosukKwon 6eff449c37 minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:46:49 -08:00
WoosukKwon a143f260b1 minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:46:22 -08:00
WoosukKwon c96ab351cc more figs
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:45:36 -08:00
Yuan Tang d2264ca838
Move diagram to the right 2025-01-24 16:42:11 -05:00
WoosukKwon e6f0d55f50 typo
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:38:16 -08:00
WoosukKwon 364acfc37e Align
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:37:45 -08:00
WoosukKwon d3c74e10a2 figs
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:35:44 -08:00
Yuan Tang 22ba9bc19f
Update date
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-24 16:31:50 -05:00
WoosukKwon 7cfbb38745 Initial
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:27:53 -08:00
Yuan Tang 0c34d3c9dd
Update 2025-01-12-intro-to-llama-stack-with-vllm.md 2025-01-24 16:24:45 -05:00
Yuan Tang e9af025d3f
memory -> vector_io: inline::faiss
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-24 16:08:30 -05:00
Hanchenli 941fb04fbb
Update 2025-01-21-stack-release.md 2025-01-24 11:29:52 -06:00
Hanchenli 46d4516b54
Update 2025-01-21-stack-release.md 2025-01-24 11:29:01 -06:00
Hanchenli 0a0111f2ec
Rename 2025-01-21-stack-release (1).md to 2025-01-21-stack-release.md 2025-01-24 11:25:04 -06:00
Hanchenli 41cd6ebf99
Add files via upload 2025-01-24 11:24:43 -06:00
Yuan Tang eaf273e2b7
Update 2025-01-12-intro-to-llama-stack-with-vllm.md 2025-01-23 18:18:48 -05:00
Ashwin Bharambe 150fc2693c
Update 2025-01-12-intro-to-llama-stack-with-vllm.md
Added some motivational blurb for Llama Stack
2025-01-23 14:15:14 -08:00
Yuan Tang c0a464f2f4
edits
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-19 00:10:57 -05:00
Yuan Tang ed4835234b
edits
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-19 00:05:16 -05:00
Yuan Tang d63d39a314
edit
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-19 00:00:21 -05:00
Aaron Pham 793b30ceac
Remove invalid links for references
Ugh for some reason the links from internal notion from our side was still there, my bad.
2025-01-14 19:34:03 -05:00
Simon Mo f9a15b52eb
Merge pull request #12 from vllm-project/vllm-2024-wrapped-2025-vision
vLLM 2024 Retrospective and 2025 Vision Blog
2025-01-14 15:58:16 -08:00
Aaron Pham 7ba4e479cf
Fix bad bib references 2025-01-14 12:13:03 -05:00
Michael Goin 1db04f7221
Remove bad link in 2025-01-14-struct-decode-intro.md 2025-01-14 11:50:11 -05:00
Michael Goin 24605886e4
Attributions! 2025-01-14 11:40:58 -05:00
Michael Goin 26b31f1550
Add usage data section 2025-01-13 12:24:28 -07:00
Yuan Tang 940a264895
Acknowledgement
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-12 20:56:12 -05:00
Yuan Tang ff47b6e951
Initial draft on Llama Stack integration
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-12 20:50:26 -05:00
Aaron Pham 9917647a5f
fix: correct dates for posts
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
2025-01-10 23:29:44 -05:00
Aaron Pham 93a4592ffc
Add blog for introduction in structured decoding
fix: correct item

chore: update author with Red Hat

chore: address comments from Michael and Tyler

chore: update notes on batch support

chore: update target date to be next Tuesday

Co-authored-by: Michael Goin <mgoin@redhat.com>
Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
2025-01-10 23:16:27 -05:00
simon-mo 8ec5cdfb3e Claude edits
Signed-off-by: simon-mo <simon.mo@hey.com>
2025-01-10 15:05:52 -08:00
simon-mo 0b924ad8ae Simon edits
Signed-off-by: simon-mo <simon.mo@hey.com>
2025-01-10 14:58:18 -08:00
mgoin fc6e1dc50e Updates
Signed-off-by: mgoin <michael@neuralmagic.com>
2025-01-10 16:03:40 -05:00
mgoin d76e1989e3 Update
Signed-off-by: mgoin <michael@neuralmagic.com>
2025-01-10 15:53:56 -05:00
mgoin 831d2d044e vLLM 2024 Retrospective and 2025 Vision Blog
Signed-off-by: mgoin <michael@neuralmagic.com>
2025-01-10 15:49:36 -05:00
youkaichao 70f6a1559e polish
Signed-off-by: youkaichao <youkaichao@gmail.com>
2025-01-10 21:05:06 +08:00
youkaichao 9990f0075b polish
Signed-off-by: youkaichao <youkaichao@gmail.com>
2025-01-10 21:00:32 +08:00
youkaichao a16c6d7106 fix format
Signed-off-by: youkaichao <youkaichao@gmail.com>
2025-01-10 20:52:41 +08:00
youkaichao a6df2f4e0d initial draft from google doc
Signed-off-by: youkaichao <youkaichao@gmail.com>
2025-01-10 20:43:00 +08:00
simon-mo 4577c6ac65 retro add images 2024-12-15 15:16:12 -08:00
simon-mo cd252bd0e6 Merge branch 'main' of github.com:vllm-project/vllm-blog-source 2024-12-15 15:10:18 -08:00
simon-mo 1077607dc5 test image tag 2024-12-15 15:10:08 -08:00
Cornelius 1f18e35f9a
Fix typo in num-scheduler-steps parameter 2024-11-30 17:39:42 +01:00
tunjiantan 9769c02a65 amend data type
Signed-off-by: tunjiantan <tunjian.tan@embeddedllm.com>
2024-10-30 16:24:17 +00:00
tunjiantan cc0466fe0f amend benchmark command
Signed-off-by: tunjiantan <tunjian.tan@embeddedllm.com>
2024-10-30 16:24:17 +00:00
tunjiantan b254fde054 fix spell check
Signed-off-by: tunjiantan <tunjian.tan@embeddedllm.com>
2024-10-29 23:46:56 +00:00
simon-mo 78b72d36e3 amd post edits 2024-10-29 11:26:03 -07:00
tunjiantan aa86e74ea6 add 2024-10-23-vllm-serving-amd blog post
Signed-off-by: tunjiantan <tunjian.tan@embeddedllm.com>
2024-10-23 10:29:32 +00:00
LiuXiaoxuanPKU 5c940c665f minor 2024-10-22 11:39:37 -07:00
LiuXiaoxuanPKU c02becf2bd minor 2024-10-22 11:20:25 -07:00
LiuXiaoxuanPKU 0a12f21577 minor 2024-10-22 11:17:53 -07:00
LiuXiaoxuanPKU 98a2b59850 edit 2024-10-22 11:12:11 -07:00
simon-mo a9bae7a33e spec decode edits 2024-10-18 10:49:39 -07:00
simon-mo a26b36612f Add spec decode blog 2024-10-17 13:30:16 -07:00
Zhuohan Li ba30fb1b28 add limitation 2024-09-06 09:58:22 -07:00
Zhuohan Li 0dd37adf40 add missing paragraph 2024-09-05 11:36:55 -07:00
simon-mo 711ec962d4 Revert "try twitter header image"
This reverts commit 6fc0369073.
2024-09-05 10:20:35 -07:00
simon-mo 6fc0369073 try twitter header image 2024-09-05 09:56:33 -07:00
Zhuohan Li 12ae2a2d7a small fix 2024-09-05 09:54:01 -07:00
Zhuohan Li fadcbcc3cd change acknowledgement 2024-09-05 09:48:33 -07:00
Zhuohan Li 67d2c32341 fix minor issues 2024-09-05 09:44:03 -07:00
Zhuohan Li 1b304fef5c minor fixes 2024-09-05 00:18:44 -07:00
Zhuohan Li 5aa2180327 change will's name 2024-09-05 00:08:57 -07:00
Zhuohan Li 73405ded17 remove the in the author 2024-09-05 00:07:06 -07:00
Zhuohan Li 31cb4a5733 Change date 2024-09-05 00:03:11 -07:00
Zhuohan Li 321025b5d7 Add some hard-coded change in html to markdown 2024-09-05 00:02:30 -07:00
Zhuohan Li ce90fa1339 Add v0.6.0 perf blog and also modify readme on how to publish a blogpost 2024-09-04 23:57:30 -07:00
simon-mo 99c42c3c05 update snowflake to llama3.1 post 2024-08-07 14:27:00 -07:00
simon-mo d39c04f6f2 Add snowflake to llama3.1 post 2024-08-07 13:57:47 -07:00
simon-mo 90a64dddc0 typo 2024-07-25 15:03:48 -07:00
simon-mo d85b0ef5b5 backport llama changes 2024-07-25 14:56:53 -07:00
simon-mo 9227cfd6d5 update lfai 2024-07-25 14:56:21 -07:00
simon-mo 33d16cb301 initial draft for lfai post 2024-07-25 14:44:46 -07:00
Zhuohan Li f11c9ef0d2 Add Llama 3.1 blogpost (new files) 2024-07-25 13:35:26 -07:00
Woosuk Kwon d9970f9003 model & hardward 2023-11-14 23:11:49 +00:00
Woosuk Kwon 1d9d5b235d FastGen 2023-11-14 22:42:57 +00:00
Woosuk Kwon f67078d283 bold 2023-11-14 22:29:29 +00:00
Woosuk Kwon 783c7628b2
Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2023-11-14 12:45:39 -08:00
Woosuk Kwon 5232941cfe
Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2023-11-14 12:45:32 -08:00
Woosuk Kwon a0f139a454
Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2023-11-14 12:45:25 -08:00
Woosuk Kwon ba9eb7994f
Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2023-11-14 12:45:17 -08:00
Woosuk Kwon e51ece8b31
Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2023-11-14 12:45:03 -08:00
Woosuk Kwon 96d1e57523
Update _posts/2023-11-14-notes-vllm-vs-deepspeed.md
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
2023-11-14 12:44:56 -08:00
Woosuk Kwon 1e4fef6a7f Fix link{ 2023-11-14 20:40:26 +00:00
Woosuk Kwon 734b320fae Polish 2023-11-14 20:25:41 +00:00
Zhuohan Li 7270188294 fix file name 2023-11-14 12:14:07 -08:00
Zhuohan Li f54ea7342e Use new template for the website 2023-11-14 12:12:47 -08:00
Zhuohan Li 6a9337597c Add paper link 2023-09-14 13:45:16 -07:00
Zhuohan Li 6cd15ede01 first commit 2023-06-21 23:36:19 +08:00