vllm/csrc/quantization/cutlass_w8a8/moe
ElizaWszola 84166fee97
[Kernel] Integrate CUTLASS MoE kernel with PPLX (#18762)
Signed-off-by: ElizaWszola <ewszola@redhat.com>
Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>
Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>
2025-06-06 18:26:11 -07:00
..
get_group_starts.cuh [Kernel] CUTLASS grouped gemm fp8 MoE kernel (#13972) 2025-03-27 00:54:44 +00:00
grouped_mm_c3x.cu [Kernel] Integrate CUTLASS MoE kernel with PPLX (#18762) 2025-06-06 18:26:11 -07:00
grouped_mm_c3x.cuh [Kernel] Integrate CUTLASS MoE kernel with PPLX (#18762) 2025-06-06 18:26:11 -07:00
moe_data.cu [Kernel] Integrate CUTLASS MoE kernel with PPLX (#18762) 2025-06-06 18:26:11 -07:00