vllm/tests/kernels/moe
bnellnm c1909e7e8c
[Kernels] MoE refactor (#19636)
Signed-off-by: Bill Nell <bnell@redhat.com>
Signed-off-by: ElizaWszola <ewszola@redhat.com>
Co-authored-by: ElizaWszola <ewszola@redhat.com>
2025-07-02 06:08:27 -07:00
..
__init__.py [Kernel] DeepEP dispatch-combine kernel integration (#18434) 2025-06-03 12:30:02 -07:00
parallel_utils.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_batched_moe.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_block_fp8.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_block_int8.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_cutlass_moe.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_deepep_deepgemm_moe.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_deepep_moe.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_deepgemm.py [Unit Test] Add unit test for deep gemm (#20090) 2025-06-30 10:26:42 -06:00
test_moe.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_moe_align_block_size.py [Perf] Optimize `moe_align_block_size` CUDA kernel (#19572) 2025-06-17 11:49:26 -07:00
test_moe_permute_unpermute.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_nvfp4_moe.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_pplx_cutlass_moe.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_pplx_moe.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00
test_rocm_aiter_topk.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_silu_mul_fp8_quant_deep_gemm.py [EP+DP] Optimize the little operations in the DeepGEMM + DeepEP low latency case (#19885) 2025-06-23 11:07:47 -07:00
test_triton_moe_ptpc_fp8.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
utils.py [Kernels] MoE refactor (#19636) 2025-07-02 06:08:27 -07:00