vllm/tests/kernels/moe
Wentao Ye ffb2cd6b54
[Perf] Optimize `moe_align_block_size` CUDA kernel (#19572)
Signed-off-by: yewentao256 <zhyanwentao@126.com>
Co-authored-by: mgoin <mgoin64@gmail.com>
2025-06-17 11:49:26 -07:00
..
__init__.py [Kernel] DeepEP dispatch-combine kernel integration (#18434) 2025-06-03 12:30:02 -07:00
deepep_utils.py [Kernel] Integrate batched/masked deepgemm kernel (#19111) 2025-06-04 21:59:18 +00:00
test_batched_moe.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_cutlass_moe.py [Kernels] Add activation chunking logic to FusedMoEModularKernel (#19168) 2025-06-11 12:53:10 -04:00
test_deepep_deepgemm_moe.py [Kernel] Integrate batched/masked deepgemm kernel (#19111) 2025-06-04 21:59:18 +00:00
test_deepep_moe.py [Kernel] DeepEP dispatch-combine kernel integration (#18434) 2025-06-03 12:30:02 -07:00
test_moe.py [Kernels] Add activation chunking logic to FusedMoEModularKernel (#19168) 2025-06-11 12:53:10 -04:00
test_moe_align_block_size.py [Perf] Optimize `moe_align_block_size` CUDA kernel (#19572) 2025-06-17 11:49:26 -07:00
test_moe_permute_unpermute.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_nvfp4_moe.py [Hardware][NVIDIA] FP4 MoE kernel optimization (#19110) 2025-06-05 09:48:26 -07:00
test_pplx_cutlass_moe.py [Kernels] Add activation chunking logic to FusedMoEModularKernel (#19168) 2025-06-11 12:53:10 -04:00
test_pplx_moe.py [Kernels] Add activation chunking logic to FusedMoEModularKernel (#19168) 2025-06-11 12:53:10 -04:00
test_rocm_aiter_topk.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_triton_moe_ptpc_fp8.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00