vllm/csrc/quantization/fused_kernels
Michael Goin e31446b6c8
[Perf] Tune `scaled_fp8_quant` by increasing vectorization (#18844)
Signed-off-by: mgoin <mgoin64@gmail.com>
2025-06-03 13:48:25 -07:00
..
fused_layernorm_dynamic_per_token_quant.cu [Bugfix] Fix `numel()` downcast in fused_layernorm_dynamic_per_token_quant.cu (#17316) 2025-04-28 19:23:18 -07:00
layernorm_utils.cuh [Perf] Tune `scaled_fp8_quant` by increasing vectorization (#18844) 2025-06-03 13:48:25 -07:00
quant_conversions.cuh [ROCm]: Fix build from source failure with gcc14 and ROCm 6.3 (#13779) 2025-05-12 20:36:33 -07:00