vllm/moe at fix-precommit - vllm - Gitea: Git with a cup of tea

History

Wentao Ye ffb2cd6b54 [Perf] Optimize `moe_align_block_size` CUDA kernel (#19572 ) Signed-off-by: yewentao256 <zhyanwentao@126.com> Co-authored-by: mgoin <mgoin64@gmail.com>		2025-06-17 11:49:26 -07:00
..
__init__.py	[Kernel] DeepEP dispatch-combine kernel integration (#18434 )	2025-06-03 12:30:02 -07:00
deepep_utils.py	[Kernel] Integrate batched/masked deepgemm kernel (#19111 )	2025-06-04 21:59:18 +00:00
test_batched_moe.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00
test_cutlass_moe.py	[Kernels] Add activation chunking logic to FusedMoEModularKernel (#19168 )	2025-06-11 12:53:10 -04:00
test_deepep_deepgemm_moe.py	[Kernel] Integrate batched/masked deepgemm kernel (#19111 )	2025-06-04 21:59:18 +00:00
test_deepep_moe.py	[Kernel] DeepEP dispatch-combine kernel integration (#18434 )	2025-06-03 12:30:02 -07:00
test_moe.py	[Kernels] Add activation chunking logic to FusedMoEModularKernel (#19168 )	2025-06-11 12:53:10 -04:00
test_moe_align_block_size.py	[Perf] Optimize `moe_align_block_size` CUDA kernel (#19572 )	2025-06-17 11:49:26 -07:00
test_moe_permute_unpermute.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00
test_nvfp4_moe.py	[Hardware][NVIDIA] FP4 MoE kernel optimization (#19110 )	2025-06-05 09:48:26 -07:00
test_pplx_cutlass_moe.py	[Kernels] Add activation chunking logic to FusedMoEModularKernel (#19168 )	2025-06-11 12:53:10 -04:00
test_pplx_moe.py	[Kernels] Add activation chunking logic to FusedMoEModularKernel (#19168 )	2025-06-11 12:53:10 -04:00
test_rocm_aiter_topk.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00
test_triton_moe_ptpc_fp8.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00