vllm/benchmarks/kernels
Jee Jee Li 1caca5a589
[Misc] Add SPDX-FileCopyrightText (#20428)
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
2025-07-04 07:40:42 +00:00
..
deepgemm Revert "[Feature] Integrate new deepgemm (#19820)" (#20049) 2025-06-24 19:45:22 -07:00
bench_fp8_gemm.py [Misc] Add SPDX-FileCopyrightText (#20428) 2025-07-04 07:40:42 +00:00
bench_int8_gemm.py [Benchmark] Refactor benchmark script for fp8 & int8 (#19627) 2025-06-15 15:15:37 +08:00
benchmark_aqlm.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_bitblas.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_cutlass_fp4_moe.py [Hardware][NVIDIA] FP4 MoE kernel optimization (#19110) 2025-06-05 09:48:26 -07:00
benchmark_grouped_gemm_cutlass.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
benchmark_layernorm.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_lora.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_machete.py Enable ZP Support for Machete (#20268) 2025-07-01 07:12:20 +00:00
benchmark_marlin.py [Bugfix][Benchmark] Fix Marlin benchmark (#19929) 2025-06-24 07:25:12 +09:00
benchmark_moe.py Revert "[Feature] Integrate new deepgemm (#19820)" (#20049) 2025-06-24 19:45:22 -07:00
benchmark_moe_align_block_size.py [Frontend] Expose custom args in OpenAI APIs (#16862) 2025-06-18 17:41:11 -07:00
benchmark_moe_permute_unpermute.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_paged_attention.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_quant.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_rmsnorm.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_rope.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_shapes.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
benchmark_w8a8_block_fp8.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
graph_machete_bench.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
requirements.txt [Kernel] (2/N) Machete - Integrate into CompressedTensorsWNA16 and GPTQMarlin (#7701) 2024-09-23 13:46:26 -04:00
utils.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
weight_shapes.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00