vllm/csrc/core
Jinzhen Lin d74e5f37bc
[Kernel] fp4 marlin kernel (#17687)
Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>
2025-05-10 19:58:49 -07:00
..
exception.hpp [Bugfix] Fix Marlin MoE act order when is_k_full == False (#8741) 2024-09-28 18:19:40 -07:00
math.hpp [torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867) 2025-05-01 07:59:28 -07:00
registration.h [CI/Build] Per file CUDA Archs (improve wheel size and dev build times) (#8845) 2024-10-03 22:55:25 -04:00
scalar_type.hpp [Kernel] fp4 marlin kernel (#17687) 2025-05-10 19:58:49 -07:00