vllm/cmake/external_projects
yexin(叶鑫) b22980a1dc
[Perf]Optimize rotary_emb implementation to use Triton operator for improved inference performance (#16457)
Signed-off-by: cynthieye <yexin93@qq.com>
Co-authored-by: MagnetoWang <magnetowang@outlook.com>
2025-04-25 14:52:28 +08:00
..
flashmla.cmake [Kernel] FlashMLA integration (#13747) 2025-02-27 10:35:08 +08:00
vllm_flash_attn.cmake [Perf]Optimize rotary_emb implementation to use Triton operator for improved inference performance (#16457) 2025-04-25 14:52:28 +08:00