youkaichao
|
e1faa2a598
|
[misc] improve ux on readme (#9147)
|
2024-10-07 22:26:25 -07:00 |
Simon Mo
|
8eeb857084
|
Add Slack to README (#9137)
|
2024-10-07 17:06:21 -07:00 |
Kuntai Du
|
c0d9a98d0c
|
[Doc] Include performance benchmark in README (#9135)
|
2024-10-07 15:04:06 -07:00 |
Zhuohan Li
|
a95354a36e
|
[Doc] Update README.md with Ray summit slides (#9088)
|
2024-10-05 02:54:45 +00:00 |
Simon Mo
|
36eecfbddb
|
Remove AMD Ray Summit Banner (#9075)
|
2024-10-04 10:17:16 -07:00 |
Simon Mo
|
a1d874224d
|
Add NVIDIA Meetup slides, announce AMD meetup, and add contact info (#8319)
|
2024-09-09 23:21:00 -07:00 |
Simon Mo
|
c5c7768264
|
Announce NVIDIA Meetup (#7483)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
|
2024-08-13 14:28:36 -07:00 |
Simon Mo
|
f020a6297e
|
[Docs] Update readme (#7316)
|
2024-08-11 17:13:37 -07:00 |
Simon Mo
|
5923532e15
|
Add Skywork AI as Sponsor (#7314)
|
2024-08-08 13:59:57 -07:00 |
Woosuk Kwon
|
b7215de2c5
|
[Docs] Publish 5th meetup slides (#6799)
|
2024-07-25 16:47:55 -07:00 |
Kuntai Du
|
6a1e25b151
|
[Doc] Add documentations for nightly benchmarks (#6412)
|
2024-07-25 11:57:16 -07:00 |
Woosuk Kwon
|
cb1362a889
|
[Docs] Announce llama3.1 support (#6688)
|
2024-07-23 08:18:15 -07:00 |
Woosuk Kwon
|
37d776606f
|
[Docs] Announce 5th meetup (#6458)
|
2024-07-15 21:04:58 -07:00 |
Woosuk Kwon
|
3dee97b05f
|
[Docs] Add Google Cloud to sponsor list (#6450)
|
2024-07-15 11:58:10 -07:00 |
Woosuk Kwon
|
d80aef3776
|
[Docs] Clean up latest news (#6401)
|
2024-07-12 19:36:53 -07:00 |
Saliya Ekanayake
|
a27f87da34
|
[Doc] Fix Typo in Doc (#6392)
Co-authored-by: Saliya Ekanayake <esaliya@d-matrix.ai>
|
2024-07-13 00:48:23 +00:00 |
Kuntai Du
|
a4feba929b
|
[CI/Build] Add nightly benchmarking for tgi, tensorrt-llm and lmdeploy (#5362)
|
2024-07-11 13:28:38 -07:00 |
youkaichao
|
2d23b42d92
|
[doc] update pipeline parallel in readme (#6347)
|
2024-07-11 11:38:40 -07:00 |
Jie Fu (傅杰)
|
439c84581a
|
[Doc] Update description of vLLM support for CPUs (#6003)
|
2024-07-10 21:15:29 -07:00 |
Kunshang Ji
|
cf90ae0123
|
[CI][Hardware][Intel GPU] add Intel GPU(XPU) ci pipeline (#5616)
|
2024-06-21 17:09:34 -07:00 |
Simon Mo
|
cdab68dcdb
|
[Docs] Add ZhenFund as a Sponsor (#5548)
|
2024-06-14 11:17:21 -07:00 |
Woosuk Kwon
|
a65634d3ae
|
[Docs] Add 4th meetup slides (#5509)
|
2024-06-13 10:18:26 -07:00 |
Li, Jiang
|
80aa7e91fc
|
[Hardware][Intel] Optimize CPU backend and add more performance tips (#4971)
Co-authored-by: Jianan Gu <jianan.gu@intel.com>
|
2024-06-13 09:33:14 -07:00 |
Woosuk Kwon
|
cb77ad836f
|
[Docs] Alphabetically sort sponsors (#5386)
|
2024-06-10 15:17:19 -05:00 |
Simon Mo
|
8f1729b829
|
[Docs] Add Ray Summit CFP (#5295)
|
2024-06-05 15:25:18 -07:00 |
Simon Mo
|
f270a39537
|
[Docs] Add Sequoia as sponsors (#5287)
|
2024-06-05 18:02:56 +00:00 |
Simon Mo
|
290f4ada2b
|
[Docs] Add Dropbox as sponsors (#5089)
|
2024-05-28 10:29:09 -07:00 |
Simon Mo
|
e941f88584
|
[Docs] Add acknowledgment for sponsors (#4925)
|
2024-05-21 00:17:25 -07:00 |
Zhuohan Li
|
361c461a12
|
[Doc] Highlight the fourth meetup in the README (#4842)
|
2024-05-15 11:38:49 -07:00 |
Simon Mo
|
29bc01bf3b
|
Add 4th meetup announcement to readme (#4817)
|
2024-05-14 18:33:06 -04:00 |
Zhuohan Li
|
ac1fbf7fd2
|
[Doc] Shorten README by removing supported model list (#4796)
|
2024-05-13 16:23:54 -07:00 |
Caio Mendes
|
bd7a8eef25
|
[Doc] README Phi-3 name fix. (#4372)
Co-authored-by: Caio Mendes <caiocesart@microsoft.com>
|
2024-04-25 10:32:00 -07:00 |
Isotr0py
|
fbf152d976
|
[Bugfix][Model] Refactor OLMo model to support new HF format in transformers 4.40.0 (#4324)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
|
2024-04-25 09:35:56 -07:00 |
Caio Mendes
|
96e90fdeb3
|
[Model] Adds Phi-3 support (#4298)
|
2024-04-25 03:06:57 +00:00 |
Simon Mo
|
705578ae14
|
[Docs] document that Meta Llama 3 is supported (#4175)
|
2024-04-18 10:55:48 -07:00 |
Simon Mo
|
aceb17cf2d
|
[Docs] document that mixtral 8x22b is supported (#4073)
|
2024-04-14 14:35:55 -07:00 |
ywfang
|
b4543c8f6b
|
[Model] add minicpm (#3893)
|
2024-04-08 18:28:36 +08:00 |
Woosuk Kwon
|
b95047f2da
|
[Misc] Publish 3rd meetup slides (#3835)
|
2024-04-03 15:46:10 -07:00 |
Robert Shaw
|
76b889bf1d
|
[Doc] Update README.md (#3806)
|
2024-04-02 23:11:10 -07:00 |
wenyujin333
|
d6ea427f04
|
[Model] Add support for Qwen2MoeModel (#3346)
|
2024-03-28 15:19:59 +00:00 |
hxer7963
|
098e1776ba
|
[Model] Add support for xverse (#3610)
Co-authored-by: willhe <hexin@xverse.cn>
Co-authored-by: root <root@localhost.localdomain>
|
2024-03-27 18:12:54 -07:00 |
Woosuk Kwon
|
6d9aa00fc4
|
[Docs] Add Command-R to supported models (#3669)
|
2024-03-27 15:20:00 -07:00 |
Megha Agarwal
|
e24336b5a7
|
[Model] Add support for DBRX (#3660)
|
2024-03-27 13:01:46 -07:00 |
Lalit Pradhan
|
4c07dd28c0
|
[🚀 Ready to be merged] Added support for Jais models (#3183)
|
2024-03-21 09:45:24 +00:00 |
Zhuohan Li
|
b30880a762
|
[Misc] Update README for the Third vLLM Meetup (#3479)
|
2024-03-18 15:58:38 -07:00 |
Seonghyeon
|
bfdcfa6a05
|
Support starcoder2 architecture (#3089)
|
2024-02-29 00:51:48 -08:00 |
张大成
|
48a8f4a7fd
|
Support Orion model (#2539)
Co-authored-by: zhangdacheng <zhangdacheng@ainirobot.com>
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
|
2024-02-26 19:17:06 -08:00 |
Zhuohan Li
|
a9c8212895
|
[FIX] Add Gemma model to the doc (#2966)
|
2024-02-21 09:46:15 -08:00 |
Isotr0py
|
ab3a5a8259
|
Support OLMo models. (#2832)
|
2024-02-18 21:05:15 -08:00 |
Simon Mo
|
bb8c697ee0
|
Update README for meetup slides (#2718)
|
2024-02-01 14:56:53 -08:00 |
Fengzhe Zhou
|
cd9e60c76c
|
Add Internlm2 (#2666)
|
2024-02-01 09:27:40 -08:00 |
Zhuohan Li
|
1af090b57d
|
Bump up version to v0.3.0 (#2656)
|
2024-01-31 00:07:07 -08:00 |
Hongxia Yang
|
6b7de1a030
|
[ROCm] add support to ROCm 6.0 and MI300 (#2274)
|
2024-01-26 12:41:10 -08:00 |
Junyang Lin
|
94b5edeb53
|
Add qwen2 (#2495)
|
2024-01-22 14:34:21 -08:00 |
Hyunsung Lee
|
e1957c6ebd
|
Add StableLM3B model (#2372)
|
2024-01-16 20:32:40 -08:00 |
Woosuk Kwon
|
2a18da257c
|
Announce the second vLLM meetup (#2444)
|
2024-01-15 14:11:59 -08:00 |
blueceiling
|
face83c7ec
|
[Docs] Add "About" Heading to README.md (#2260)
|
2023-12-25 16:37:07 -08:00 |
avideci
|
de60a3fb93
|
Added DeciLM-7b and DeciLM-7b-instruct (#2062)
|
2023-12-19 02:29:33 -08:00 |
Woosuk Kwon
|
f8c688d746
|
[Minor] Add Phi 2 to supported models (#2159)
|
2023-12-17 02:54:57 -08:00 |
Woosuk Kwon
|
26c52a5ea6
|
[Docs] Add CUDA graph support to docs (#2148)
|
2023-12-17 01:49:20 -08:00 |
Woosuk Kwon
|
b81a6a6bb3
|
[Docs] Add supported quantization methods to docs (#2135)
|
2023-12-15 13:29:22 -08:00 |
Antoni Baum
|
21d93c140d
|
Optimize Mixtral with expert parallelism (#2090)
|
2023-12-13 23:55:07 -08:00 |
Woosuk Kwon
|
31d2ab4aff
|
Remove python 3.10 requirement (#2040)
|
2023-12-11 12:26:42 -08:00 |
Ram
|
2eaa81b236
|
Update README.md to add megablocks requirement for mixtral (#2033)
|
2023-12-11 11:37:34 -08:00 |
Pierre Stock
|
b5f882cc98
|
Mixtral 8x7B support (#2011)
Co-authored-by: Pierre Stock <p@mistral.ai>
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
|
2023-12-11 01:09:15 -08:00 |
TJian
|
6ccc0bfffb
|
Merge EmbeddedLLM/vllm-rocm into vLLM main (#1836)
Co-authored-by: Philipp Moritz <pcmoritz@gmail.com>
Co-authored-by: Amir Balwel <amoooori04@gmail.com>
Co-authored-by: root <kuanfu.liu@akirakan.com>
Co-authored-by: tjtanaa <tunjian.tan@embeddedllm.com>
Co-authored-by: kuanfu <kuanfu.liu@embeddedllm.com>
Co-authored-by: miloice <17350011+kliuae@users.noreply.github.com>
|
2023-12-07 23:16:52 -08:00 |
Woosuk Kwon
|
e5452ddfd6
|
Normalize head weights for Baichuan 2 (#1876)
|
2023-11-30 20:03:58 -08:00 |
Zhuohan Li
|
32c927b53f
|
[FIX] Update the doc link in README.md (#1730)
|
2023-11-20 12:46:24 -08:00 |
Zhuohan Li
|
415d109527
|
[Fix] Update Supported Models List (#1690)
|
2023-11-16 14:47:26 -08:00 |
maximzubkov
|
521b35f799
|
Support Microsoft Phi 1.5 (#1664)
|
2023-11-16 14:28:39 -08:00 |
ldwang
|
6368e777a8
|
Add Aquila2 to README (#1331)
Signed-off-by: ldwang <ftgreat@gmail.com>
Co-authored-by: ldwang <ftgreat@gmail.com>
|
2023-10-12 12:11:16 -07:00 |
Zhuohan Li
|
9eed4d1f3e
|
Update README.md (#1292)
|
2023-10-08 23:15:50 -07:00 |
Woosuk Kwon
|
202351d5bf
|
Add Mistral to supported model list (#1221)
|
2023-09-28 14:33:04 -07:00 |
Woosuk Kwon
|
8d926e91f1
|
Announce the First vLLM Meetup (#1148)
|
2023-09-22 11:37:14 -07:00 |
Zhuohan Li
|
c1026311b5
|
[Community] Add vLLM Discord server (#1086)
|
2023-09-18 12:23:35 -07:00 |
Woosuk Kwon
|
eda1a7cad3
|
Announce paper release (#1036)
|
2023-09-13 17:38:13 -07:00 |
Ikko Eltociear Ashimine
|
3272d7a0b7
|
Fix typo in README.md (#1033)
|
2023-09-13 12:55:23 -07:00 |
Zhuohan Li
|
c128d69856
|
Fix README.md Link (#927)
|
2023-08-31 17:18:34 -07:00 |
Zhuohan Li
|
0080d8329d
|
Add acknowledgement to a16z grant
|
2023-08-30 02:26:47 -07:00 |
ldwang
|
85ebcda94d
|
Fix typo of Aquila in README.md (#836)
|
2023-08-22 20:48:36 -07:00 |
Zhuohan Li
|
14f9c72bfd
|
Update Supported Model List (#825)
|
2023-08-22 11:51:44 -07:00 |
Zhuohan Li
|
f7389f4763
|
[Doc] Add Baichuan 13B to supported models (#656)
|
2023-08-02 16:45:12 -07:00 |
Zhuohan Li
|
1b0bd0fe8a
|
Add Falcon support (new) (#592)
|
2023-08-02 14:04:39 -07:00 |
Zhuohan Li
|
df5dd3c68e
|
Add Baichuan-7B to README (#494)
|
2023-07-25 15:25:12 -07:00 |
Zhuohan Li
|
6fc2a38b11
|
Add support for LLaMA-2 (#505)
|
2023-07-20 11:38:27 -07:00 |
Andre Slavescu
|
c894836108
|
[Model] Add support for GPT-J (#226)
Co-authored-by: woWoosuk Kwon <woosuk.kwon@berkeley.edu>
|
2023-07-08 17:55:16 -07:00 |
Woosuk Kwon
|
404422f42e
|
[Model] Add support for MPT (#334)
|
2023-07-03 16:47:53 -07:00 |
Woosuk Kwon
|
e41f06702c
|
Add support for BLOOM (#331)
|
2023-07-03 13:12:35 -07:00 |
Zhanghao Wu
|
f72297562f
|
Add news for the vllm+skypilot example (#314)
|
2023-06-29 12:32:37 -07:00 |
Zhuohan Li
|
2cf1a333b6
|
[Doc] Documentation for distributed inference (#261)
|
2023-06-26 11:34:23 -07:00 |
Lianmin Zheng
|
6214dd6ce9
|
Update README.md (#236)
|
2023-06-25 16:58:06 -07:00 |
Woosuk Kwon
|
665c48963b
|
[Docs] Add GPTBigCode to supported models (#213)
|
2023-06-22 15:05:11 -07:00 |
Zhuohan Li
|
033f5c78f5
|
Remove e.g. in README (#167)
|
2023-06-20 14:00:28 +08:00 |
Woosuk Kwon
|
794e578de0
|
[Minor] Fix URLs (#166)
|
2023-06-19 22:57:14 -07:00 |
Zhuohan Li
|
fc72e39de3
|
Change image urls (#164)
|
2023-06-20 11:15:15 +08:00 |
Woosuk Kwon
|
b7e62d3454
|
Fix repo & documentation URLs (#163)
|
2023-06-19 20:03:40 -07:00 |
Woosuk Kwon
|
364536acd1
|
[Docs] Minor fix (#162)
|
2023-06-19 19:58:23 -07:00 |
Zhuohan Li
|
0b32a987dd
|
Add and list supported models in README (#161)
|
2023-06-20 10:57:46 +08:00 |
Zhuohan Li
|
a255885f83
|
Add logo and polish readme (#156)
|
2023-06-19 16:31:13 +08:00 |
Woosuk Kwon
|
dcda03b4cb
|
Write README and front page of doc (#147)
|
2023-06-18 03:19:38 -07:00 |