vllm/tests/kernels
Kaixi Hou 41aa578428
[NVIDIA] Add Cutlass MLA backend (#17625)
2025-06-03 21:40:26 -07:00
..
attention [CPU] V1 support for the CPU backend (#16441) 2025-06-03 18:43:01 -07:00
core [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
mamba [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
moe [Kernel] DeepEP dispatch-combine kernel integration (#18434) 2025-06-03 12:30:02 -07:00
quantization [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
__init__.py [CI/Build] Move `test_utils.py` to `tests/utils.py` (#4425) 2024-05-13 23:50:09 +09:00
allclose_default.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
quant_utils.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_apply_repetition_penalties.py [KERNEL] Sampler. CUDA kernel for applying repetition penalty (#18437) 2025-06-03 21:13:01 -07:00
test_cutlass_mla_decode.py [NVIDIA] Add Cutlass MLA backend (#17625) 2025-06-03 21:40:26 -07:00
test_fused_quant_activation.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_triton_flash_attention.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
utils.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00