Default Branch

9e0726e5bf · [Meta] Official Eagle mm support, first enablement on llama4 (#20788) · Updated 2025-08-01 01:35:07 +08:00

Branches

e17250f0d2 · fix precommit · Updated 2025-06-19 12:17:43 +08:00    vLLM

970
1

b6553be1bc · [Misc] Slight improvement of the BNB (#19418) · Updated 2025-06-10 21:51:49 +08:00    vLLM

1124
0
Included

ca15f0afe6 · ci(Mergify): configuration update · Updated 2025-06-09 15:44:44 +08:00    vLLM

1155
1

d3b51c9bba · fix build · Updated 2025-06-09 08:38:37 +08:00    vLLM

1400
10

9a76ef07b9 · Add pandas and datasets for benchmarks · Updated 2025-06-04 21:51:59 +08:00    vLLM

1230
1

1236aebf0e · Merge remote-tracking branch 'origin/main' into fp8_ep_dp · Updated 2025-06-03 02:53:27 +08:00    vLLM

1283
20

5fbbfe9a4c · [BugFix] FA2 MLA Accuracy Issue (#18807) · Updated 2025-05-30 23:50:58 +08:00    vLLM

1398
1

2e773e55b3 · docs: merge v1 architecture with class hierarchy · Updated 2025-05-18 14:48:12 +08:00    vLLM

1607
1

221118dc85 · [Bugfix] Use a different prompt for benchmark_serving.py test prompt · Updated 2025-05-18 02:36:31 +08:00    vLLM

1608
1

f96a3cc713 · test · Updated 2025-05-10 04:31:08 +08:00    vLLM

1792
2

79acf80471 · Fast decode prepare path for prepare_inputs logic · Updated 2025-05-09 01:26:00 +08:00    vLLM

2056
1

bcf3c8230d · Merge branch 'main' into woosuk-jf · Updated 2025-05-05 02:16:07 +08:00    vLLM

1900
3

b73fdb927a · draft · Updated 2025-05-04 01:50:34 +08:00    vLLM

2051
1

3015d5634e · [BugFix][Attention] Fix sliding window attention in V1 giving incorrect results (#17574) · Updated 2025-05-03 02:02:48 +08:00    vLLM

2047
3

3ed73a0fe5 · Bump actions/setup-python from 5.4.0 to 5.6.0 · Updated 2025-04-28 12:54:55 +08:00    vLLM

2064
1

a7b809e0f0 · Merge remote-tracking branch 'upstream/main' into benchmark-output · Updated 2025-04-23 22:55:50 +08:00    vLLM

2181
8

ec69124eb4 · [Misc] Improve readability of get_open_port function. (#17024) · Updated 2025-04-23 14:16:53 +08:00    vLLM

2189
0
Included

161010c384 · Initial stubs for P/D scheduling changes · Updated 2025-04-19 04:42:49 +08:00    vLLM

2267
1

dc1b4a6f13 · [Core][V0] Enable regex support with xgrammar (#13228) · Updated 2025-04-14 10:13:38 +08:00    vLLM

2356
0
Included

ccd21e1993 · [V1] Fix profiling.py · Updated 2025-04-12 02:36:37 +08:00    vLLM

2385
1