vllm/tests at 55f1a468d97fbf9387e577e901b3f290ed8aa15b - vllm

Jinzhen Lin e73b7dfd69 [Bugfix] fix `an illegal memory access was encountered` of marlin kernel + act_order (#18245 )	2025-05-16 16:02:44 -07:00
..
async_engine	[V1] DP scale-out (2/N): Decouple engine process management and comms (#15977 )	2025-05-13 10:48:21 -07:00
basic_correctness	[Core][Feature] Input metadata dump on crash (#13407 )	2025-05-07 22:15:09 +00:00
benchmarks	Add `vllm bench [latency, throughput]` CLI commands (#16508 )	2025-04-14 23:10:35 -07:00
compile	[AMD][torch.compile] Enable silu+fp8_quant fusion for rocm (#18082 )	2025-05-13 22:13:56 -07:00
config	[Feature] specify model in config.yaml (#15798 )	2025-04-01 01:20:06 -07:00
core	[Core] [Bugfix] Add Input Embeddings (#15428 )	2025-05-02 01:06:39 -07:00
detokenizer	[V1] V1 Enablement Oracle (#13726 )	2025-03-14 22:02:20 -07:00
distributed	[Feature] Support Pipeline Parallism in torchrun SPMD offline inference for V1 (#17827 )	2025-05-15 22:28:27 -07:00
encoder_decoder	[V1] V1 Enablement Oracle (#13726 )	2025-03-14 22:02:20 -07:00
engine	Allow users to pass arbitrary JSON keys from CLI (#18208 )	2025-05-15 21:05:34 -07:00
entrypoints	add tools into TokenizeChatRequest (#18187 )	2025-05-15 04:01:49 -07:00
fastsafetensors_loader	[Core] Integrate `fastsafetensors` loader for loading model weights (#10647 )	2025-03-24 08:08:02 -07:00
kernels	[Bugfix] fix rotary embedding test for _get_padded_tensor_shape (#18229 )	2025-05-16 01:32:45 +00:00
kv_transfer	[CI] Actually run tests/kv_transfer/test_disagg.py in CI (#17555 )	2025-05-02 04:05:04 +00:00
lora	fix: typos (#18151 )	2025-05-15 02:16:15 -07:00
metrics	[V1][Spec Decode] Remove deprecated spec decode config params (#15466 )	2025-03-31 09:19:35 -07:00
mistral_tool_use	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
model_executor	fix: typos (#18151 )	2025-05-15 02:16:15 -07:00
models	[Misc] Consolidate Audio tests into multimodal common generation tests (#18214 )	2025-05-16 09:18:08 +00:00
mq_llm_engine	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
multi_step	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
multimodal	Support custom implementations of VideoLoader backends. (#18091 )	2025-05-15 13:26:49 +08:00
neuron	Make key optional for rotary embedding (#17566 )	2025-05-07 00:11:46 -07:00
plugins	[Lora][Frontend]Add default local directory LoRA resolver plugin. (#16855 )	2025-05-12 10:39:10 -07:00
plugins_tests	[V1] Scheduler Refactoring [1/N] - Add Scheduler Interface (#15250 )	2025-03-20 17:50:43 -07:00
prefix_caching	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
prompt_adapter	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )	2025-02-02 11:58:18 -08:00
prompts	[BugFix] Fix input positions for long context with sliding window (#2088 )	2023-12-13 12:28:13 -08:00
quantization	Add support for loading torchao models with `AOPerModuleConfig` (#17826 )	2025-05-14 16:24:59 -07:00
reasoning	[Bugfix] add qwen3 reasoning-parser fix content is None when disable … (#17369 )	2025-04-29 16:32:40 +00:00
runai_model_streamer_test	[Misc] Split model loader (#17712 )	2025-05-07 12:42:26 +08:00
samplers	[Sampler] Adapt to FlashInfer 0.2.3 sampler API (#15777 )	2025-05-16 15:14:03 -07:00
spec_decode	[CI] Disable Failing Tests (#18165 )	2025-05-14 13:49:56 -07:00
standalone_tests	[Build] Make sure local main branch is synced when VLLM_USE_PRECOMPILED=1 (#13921 )	2025-03-03 16:43:14 +08:00
system_messages	[V1] Implement Cascade Attention (#11635 )	2025-01-01 21:56:46 +09:00
tensorizer_loader	[CI/Build] Automatically retry flaky tests (#17856 )	2025-05-09 09:55:17 -06:00
tokenization	Add full API docs and improve the UX of navigating them (#17485 )	2025-05-03 19:42:43 -07:00
tool_use	Add chat template for Llama 4 models (#16428 )	2025-04-24 20:19:36 +00:00
tpu	[Hardware][TPU][V1] Multi-LoRA implementation for the V1 TPU backend (#14238 )	2025-05-07 16:28:47 -04:00
tracing	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
v1	[Sampler] Adapt to FlashInfer 0.2.3 sampler API (#15777 )	2025-05-16 15:14:03 -07:00
vllm_test_utils	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
weight_loading	[Bugfix] fix `an illegal memory access was encountered` of marlin kernel + act_order (#18245 )	2025-05-16 16:02:44 -07:00
worker	[Core] Gate `prompt_embeds` behind a feature flag (#17607 )	2025-05-04 00:19:20 +08:00
__init__.py	[Small] Formatter only checks lints in changed files (#1528 )	2023-10-31 15:39:38 -07:00
build_cython.py	[Build] Cython compilation support fix (#14296 )	2025-03-24 23:37:54 +00:00
conftest.py	[Model] Broadcast Ovis2 implementation to fit Ovis1.6 (#17861 )	2025-05-11 17:56:30 -07:00
test_cache_block_hashing.py	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
test_config.py	Add `pt_load_map_location` to allow loading to cuda (#16869 )	2025-05-01 23:23:42 -07:00
test_embedded_commit.py	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )	2025-02-02 11:58:18 -08:00
test_inputs.py	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
test_logger.py	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
test_logits_processor.py	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
test_regression.py	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
test_sampling_params.py	[Bugfix][Frontend] respect provided default guided decoding backend (#15476 )	2025-04-09 05:11:10 -07:00
test_scalartype.py	[Misc] Fix ScalarType float4 naming (#17690 )	2025-05-06 01:07:15 -07:00
test_seed_behavior.py	[Bugfix] fix flaky test (#13089 )	2025-02-11 14:41:20 +00:00
test_sequence.py	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )	2025-02-02 11:58:18 -08:00
test_sharded_state_loader.py	[Misc] Split model loader (#17712 )	2025-05-07 12:42:26 +08:00
test_triton_utils.py	[Bugfix] Fix triton import with local TritonPlaceholder (#17446 )	2025-05-06 17:53:09 +08:00
test_utils.py	Allow users to pass arbitrary JSON keys from CLI (#18208 )	2025-05-15 21:05:34 -07:00
test_version.py	[Metrics] Add `--show-hidden-metrics-for-version` CLI arg (#13295 )	2025-02-22 00:20:45 -08:00
test_vllm_port.py	Throw better error for when running into k8s service discovery issue (#18209 )	2025-05-15 21:07:28 -07:00
utils.py	[Misc] Split model loader (#17712 )	2025-05-07 12:42:26 +08:00