vllm/compile at add-utils - vllm - Gitea: Git with a cup of tea

History

Boyuan Feng c01d1c5aba use .dev for version comparison with pytorch nightly release (#20031 ) Signed-off-by: Boyuan Feng <boyuan@meta.com>		2025-06-24 21:52:16 +00:00
..
piecewise	[Feature][ROCm] Add full graph capture support for TritonAttentionBackend (#19158 )	2025-06-17 17:03:06 -04:00
__init__.py	[torch.compile] register allreduce operations as custom ops (#8526 )	2024-09-16 22:57:57 -07:00
backend.py	[torch.compile][ROCm] Fuse quantization onto attention using a torch.compile pass (#16756 )	2025-06-12 08:31:04 -07:00
test_async_tp.py	[torch.compile][ROCm] Fuse quantization onto attention using a torch.compile pass (#16756 )	2025-06-12 08:31:04 -07:00
test_basic_correctness.py	Support embedding models in V1 (#16188 )	2025-06-18 21:36:33 -07:00
test_config.py	use .dev for version comparison with pytorch nightly release (#20031 )	2025-06-24 21:52:16 +00:00
test_full_graph.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00
test_functionalization.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00
test_fusion.py	[torch.compile][ROCm] Fuse quantization onto attention using a torch.compile pass (#16756 )	2025-06-12 08:31:04 -07:00
test_fusion_attn.py	[torch.compile][ROCm] Fuse quantization onto attention using a torch.compile pass (#16756 )	2025-06-12 08:31:04 -07:00
test_pass_manager.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00
test_sequence_parallelism.py	[Feature] Support sequence parallelism for static fp8 quantization (#19181 )	2025-06-23 16:09:02 -04:00
test_silu_mul_quant_fusion.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00
test_wrapper.py	[Misc] Add SPDX-FileCopyrightText (#19100 )	2025-06-03 11:20:17 -07:00