vllm/kernels at lwilkinson/refactor-cmake - vllm

History

Feng XiaoLong 4fc1bf813a [Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking (#18454 ) Signed-off-by: Crucifixion-Fxl <xmufxl@gmail.com> Co-authored-by: Crucifixion-Fxl <xmufxl@gmail.com>		2025-05-23 16:16:26 -07:00
..
deepgemm	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_aqlm.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_bitblas.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_cutlass_fp4_moe.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_grouped_gemm_cutlass.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_layernorm.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_lora.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_machete.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_marlin.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_moe.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_moe_permute_unpermute.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_paged_attention.py	[ROCm][Kernel][V1] Enable AMD Radeon GPU Custom Paged Attention on v1 (#17004 )	2025-05-21 08:35:00 -07:00
benchmark_quant.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_rmsnorm.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_rope.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
benchmark_shapes.py	[Kernel] CUTLASS grouped gemm fp8 MoE kernel (#13972 )	2025-03-27 00:54:44 +00:00
benchmark_w8a8_block_fp8.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
graph_machete_bench.py	[Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking (#18454 )	2025-05-23 16:16:26 -07:00
requirements.txt	[Kernel] (2/N) Machete - Integrate into CompressedTensorsWNA16 and GPTQMarlin (#7701 )	2024-09-23 13:46:26 -04:00
utils.py	Convert `benchmarks` to `ruff format` (#18068 )	2025-05-13 13:43:29 +00:00
weight_shapes.py	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )	2025-02-02 11:58:18 -08:00