vllm/benchmarks/kernels
Feng XiaoLong 4fc1bf813a
[Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking (#18454)
Signed-off-by: Crucifixion-Fxl <xmufxl@gmail.com>
Co-authored-by: Crucifixion-Fxl <xmufxl@gmail.com>
2025-05-23 16:16:26 -07:00
..
deepgemm Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_aqlm.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_bitblas.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_cutlass_fp4_moe.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_grouped_gemm_cutlass.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_layernorm.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_lora.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_machete.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_marlin.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_moe.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_moe_permute_unpermute.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_paged_attention.py [ROCm][Kernel][V1] Enable AMD Radeon GPU Custom Paged Attention on v1 (#17004) 2025-05-21 08:35:00 -07:00
benchmark_quant.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_rmsnorm.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_rope.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
benchmark_shapes.py [Kernel] CUTLASS grouped gemm fp8 MoE kernel (#13972) 2025-03-27 00:54:44 +00:00
benchmark_w8a8_block_fp8.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
graph_machete_bench.py [Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking (#18454) 2025-05-23 16:16:26 -07:00
requirements.txt [Kernel] (2/N) Machete - Integrate into CompressedTensorsWNA16 and GPTQMarlin (#7701) 2024-09-23 13:46:26 -04:00
utils.py Convert `benchmarks` to `ruff format` (#18068) 2025-05-13 13:43:29 +00:00
weight_shapes.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00