vllm/csrc/quantization/fp4
Pavani Majety debd6bbf09
[Kernel] Add ModelOpt FP4 Checkpoint Support (#12520)
Signed-off-by: Pavani Majety <pmajety@nvidia.com>
2025-03-12 05:13:11 +00:00
..
nvfp4_quant_entry.cu [NVIDIA] Support nvfp4 quantization (#12784) 2025-02-12 19:51:51 -08:00
nvfp4_quant_kernels.cu [NVIDIA] Fix an issue to use current stream for the nvfp4 quant (#13632) 2025-02-20 22:01:48 -08:00
nvfp4_scaled_mm_entry.cu [Kernel] Add ModelOpt FP4 Checkpoint Support (#12520) 2025-03-12 05:13:11 +00:00
nvfp4_scaled_mm_kernels.cu [Kernel] Add ModelOpt FP4 Checkpoint Support (#12520) 2025-03-12 05:13:11 +00:00