vllm/fused_kernels at add-utils - vllm - Gitea: Git with a cup of tea

History

Michael Goin e31446b6c8 [Perf] Tune `scaled_fp8_quant` by increasing vectorization (#18844 ) Signed-off-by: mgoin <mgoin64@gmail.com>		2025-06-03 13:48:25 -07:00
..
fused_layernorm_dynamic_per_token_quant.cu	[Bugfix] Fix `numel()` downcast in fused_layernorm_dynamic_per_token_quant.cu (#17316 )	2025-04-28 19:23:18 -07:00
layernorm_utils.cuh	[Perf] Tune `scaled_fp8_quant` by increasing vectorization (#18844 )	2025-06-03 13:48:25 -07:00
quant_conversions.cuh	[ROCm]: Fix build from source failure with gcc14 and ROCm 6.3 (#13779 )	2025-05-12 20:36:33 -07:00