Commit Graph

168 Commits

Author SHA1 Message Date
WoosukKwon f0a7b016bb Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:54:36 -08:00
WoosukKwon f81c314751 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:54:01 -08:00
Yuan Tang 0121eb45e9
Update 2025-01-27-intro-to-llama-stack-with-vllm.md 2025-01-24 16:51:39 -05:00
WoosukKwon 41e379c103 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:50:38 -08:00
WoosukKwon a8f7abcc58 Minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:49:51 -08:00
WoosukKwon d67eaaa0d9 fix
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:48:36 -08:00
WoosukKwon abc8465d71 fix
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:48:03 -08:00
WoosukKwon 6eff449c37 minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:46:49 -08:00
WoosukKwon a143f260b1 minor
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:46:22 -08:00
WoosukKwon c96ab351cc more figs
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:45:36 -08:00
Yuan Tang d2264ca838
Move diagram to the right 2025-01-24 16:42:11 -05:00
WoosukKwon e6f0d55f50 typo
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:38:16 -08:00
WoosukKwon 364acfc37e Align
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:37:45 -08:00
WoosukKwon d3c74e10a2 figs
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:35:44 -08:00
WoosukKwon ac1befe287 figs
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:34:40 -08:00
Yuan Tang 22ba9bc19f
Update date
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-24 16:31:50 -05:00
WoosukKwon 7cfbb38745 Initial
Signed-off-by: WoosukKwon <woosuk.kwon@berkeley.edu>
2025-01-24 13:27:53 -08:00
Yuan Tang 0c34d3c9dd
Update 2025-01-12-intro-to-llama-stack-with-vllm.md 2025-01-24 16:24:45 -05:00
Yuan Tang e9af025d3f
memory -> vector_io: inline::faiss
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-24 16:08:30 -05:00
Hanchenli 941fb04fbb
Update 2025-01-21-stack-release.md 2025-01-24 11:29:52 -06:00
Hanchenli 46d4516b54
Update 2025-01-21-stack-release.md 2025-01-24 11:29:01 -06:00
Hanchenli 47d9a477b7
Add files via upload 2025-01-24 11:27:27 -06:00
Hanchenli f16547db6d
Create temp 2025-01-24 11:25:51 -06:00
Hanchenli 0a0111f2ec
Rename 2025-01-21-stack-release (1).md to 2025-01-21-stack-release.md 2025-01-24 11:25:04 -06:00
Hanchenli 41cd6ebf99
Add files via upload 2025-01-24 11:24:43 -06:00
Yuan Tang eaf273e2b7
Update 2025-01-12-intro-to-llama-stack-with-vllm.md 2025-01-23 18:18:48 -05:00
Yuan Tang a4ea99965e
Merge pull request #1 from terrytangyuan/ashwinb-patch-1
Update 2025-01-12-intro-to-llama-stack-with-vllm.md
2025-01-23 18:02:06 -05:00
Ashwin Bharambe 150fc2693c
Update 2025-01-12-intro-to-llama-stack-with-vllm.md
Added some motivational blurb for Llama Stack
2025-01-23 14:15:14 -08:00
Yuan Tang c0a464f2f4
edits
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-19 00:10:57 -05:00
Yuan Tang ed4835234b
edits
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-19 00:05:16 -05:00
Yuan Tang d63d39a314
edit
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-19 00:00:21 -05:00
Simon Mo d7997c8484
Merge pull request #14 from aarnphm/patch-1
Remove invalid links for references
2025-01-14 16:38:19 -08:00
Aaron Pham 793b30ceac
Remove invalid links for references
Ugh for some reason the links from internal notion from our side was still there, my bad.
2025-01-14 19:34:03 -05:00
Simon Mo f9a15b52eb
Merge pull request #12 from vllm-project/vllm-2024-wrapped-2025-vision
vLLM 2024 Retrospective and 2025 Vision Blog
2025-01-14 15:58:16 -08:00
Michael Goin bf0f9ca91c
Merge pull request #13 from aarnphm/patch-1
Fix bad bibtex reference for structured decoding
2025-01-14 13:07:11 -05:00
Aaron Pham 7ba4e479cf
Fix bad bib references 2025-01-14 12:13:03 -05:00
Michael Goin 1db04f7221
Remove bad link in 2025-01-14-struct-decode-intro.md 2025-01-14 11:50:11 -05:00
Simon Mo 581cb09d9a
Merge pull request #10 from aarnphm/blog-structured-decoding-introduction
Add structured decoding introduction blog
2025-01-14 08:46:05 -08:00
Michael Goin 24605886e4
Attributions! 2025-01-14 11:40:58 -05:00
Michael Goin 26b31f1550
Add usage data section 2025-01-13 12:24:28 -07:00
Yuan Tang 940a264895
Acknowledgement
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-12 20:56:12 -05:00
Yuan Tang ff47b6e951
Initial draft on Llama Stack integration
Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
2025-01-12 20:50:26 -05:00
Aaron Pham 9917647a5f
fix: correct dates for posts
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
2025-01-10 23:29:44 -05:00
Aaron Pham 93a4592ffc
Add blog for introduction in structured decoding
fix: correct item

chore: update author with Red Hat

chore: address comments from Michael and Tyler

chore: update notes on batch support

chore: update target date to be next Tuesday

Co-authored-by: Michael Goin <mgoin@redhat.com>
Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
2025-01-10 23:16:27 -05:00
simon-mo 8ec5cdfb3e Claude edits
Signed-off-by: simon-mo <simon.mo@hey.com>
2025-01-10 15:05:52 -08:00
simon-mo 0b924ad8ae Simon edits
Signed-off-by: simon-mo <simon.mo@hey.com>
2025-01-10 14:58:18 -08:00
mgoin fc6e1dc50e Updates
Signed-off-by: mgoin <michael@neuralmagic.com>
2025-01-10 16:03:40 -05:00
mgoin d76e1989e3 Update
Signed-off-by: mgoin <michael@neuralmagic.com>
2025-01-10 15:53:56 -05:00
mgoin 831d2d044e vLLM 2024 Retrospective and 2025 Vision Blog
Signed-off-by: mgoin <michael@neuralmagic.com>
2025-01-10 15:49:36 -05:00
youkaichao 02c36e5964
Merge pull request #11 from vllm-project/dev_experience
Installing and Developing vLLM with Ease
2025-01-11 01:28:09 +08:00