Default Branch

0e3fe896e2 · Support Llama 4 for fused_marlin_moe (#20457) · Updated 2025-07-04 15:55:10 +08:00

Branches

1244c25908 · minimize fill_ · Updated 2025-02-05 06:03:51 +08:00    vLLM

3066
13

0a02744dc8 · fix TP · Updated 2025-01-31 09:18:56 +08:00    vLLM

3112
12

0405645a6c · initial · Updated 2025-01-31 08:55:49 +08:00    vLLM

3110
1

39c4a4cdb5 · review comments · Updated 2025-01-29 07:08:50 +08:00    vLLM

3184
7

a7ca0cc47f · Merge branch 'main' into moondream2 · Updated 2025-01-20 16:10:52 +08:00    vLLM

3260
2

1aa5adb1f7 · cuda · Updated 2025-01-17 03:15:23 +08:00    vLLM

3300
1

7097f31955 · test · Updated 2025-01-15 19:22:32 +08:00    vLLM

3499
22

c1d1875ba3 · Updates docs with correction about default cuda version · Updated 2025-01-08 06:29:07 +08:00    vLLM

3430
1

617fb893d5 · add compile · Updated 2024-07-27 10:29:36 +08:00    vLLM

5385
1

d5bf492f16 · Merge branch 'main' into optimize-prefix-caching-scheduling · Updated 2024-06-04 08:20:15 +08:00    vLLM

6007
4

1936d7bab0 · format · Updated 2024-06-02 08:02:54 +08:00    vLLM

6025
2

c00ddd6834 · Add buffer donation to benchmark · Updated 2024-05-01 05:58:47 +08:00    vLLM

6360
75