vllm/external_projects at 55f1a468d97fbf9387e577e901b3f290ed8aa15b - vllm

mirror of https://github.com/vllm-project/vllm.git

History

yexin(叶鑫) b22980a1dc [Perf]Optimize rotary_emb implementation to use Triton operator for improved inference performance (#16457 ) Signed-off-by: cynthieye <yexin93@qq.com> Co-authored-by: MagnetoWang <magnetowang@outlook.com>		2025-04-25 14:52:28 +08:00
..
flashmla.cmake	[Kernel] FlashMLA integration (#13747 )	2025-02-27 10:35:08 +08:00
vllm_flash_attn.cmake	[Perf]Optimize rotary_emb implementation to use Triton operator for improved inference performance (#16457 )	2025-04-25 14:52:28 +08:00