vllm/csrc/rocm
Charlie Fu 306d60401d
[ROCm][Kernel] Add gfx950 support for skinny gemms (#18010)
Signed-off-by: charlifu <charlifu@amd.com>
2025-05-31 07:40:05 -07:00
..
attention.cu [ROCm][Kernel][V1] Enable AMD Radeon GPU Custom Paged Attention on v1 (#17004) 2025-05-21 08:35:00 -07:00
ops.h [Easy] Eliminate c10::optional usage in vllm/csrc (#17819) 2025-05-08 03:05:10 -07:00
skinny_gemms.cu [ROCm][Kernel] Add gfx950 support for skinny gemms (#18010) 2025-05-31 07:40:05 -07:00
torch_bindings.cpp [ROCm][FP8][Kernel] FP8 quantization fused into Custom Paged Attention (#17139) 2025-05-07 07:12:35 -07:00