vllm/tests
Richard Zou ed33349738
[BugFix] Fix use_cudagraph=False (#19612)
Signed-off-by: Richard Zou <zou3519@gmail.com>
2025-06-19 08:23:12 +08:00
..
async_engine [Frontend] Make TIMEOUT_KEEP_ALIVE configurable through env var (#18472) 2025-06-09 21:41:21 +00:00
basic_correctness [MISC][Bugfix] Use less CPU when message queue has been empty for some time (#16226) 2025-06-05 16:53:08 +00:00
benchmarks [Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets (#18847) 2025-06-12 10:30:56 -07:00
compile [BugFix] Fix use_cudagraph=False (#19612) 2025-06-19 08:23:12 +08:00
config [Feature] specify model in config.yaml (#15798) 2025-04-01 01:20:06 -07:00
core [Bugfix][Core] Prefix caching causes incorrect outputs due to outdated ComputedBlocksTracker (#18957) 2025-06-15 21:56:37 -07:00
detokenizer [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
distributed feat: add data parallel rank to KVEventBatch (#18925) 2025-06-03 17:14:20 -07:00
encoder_decoder [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
engine Fix interaction between `Optional` and `Annotated` in CLI typing (#19093) 2025-06-03 21:07:19 +00:00
entrypoints [Frontend] add chunking audio for > 30s audio (#19597) 2025-06-17 11:34:00 +08:00
fastsafetensors_loader [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
kernels [Perf] Optimize `moe_align_block_size` CUDA kernel (#19572) 2025-06-17 11:49:26 -07:00
kv_transfer [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
lora [CI] change spell checker from codespell to typos (#18711) 2025-06-11 19:57:10 -07:00
metrics [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
mistral_tool_use [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
model_executor [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
models [v1] Support mamba2 (#19327) 2025-06-18 20:34:15 +00:00
mq_llm_engine [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
multi_step Adding "AMD: Multi-step Tests" to amdproduction. (#19508) 2025-06-13 17:08:51 -07:00
multimodal [Core] Do not copy array during hashing (#19484) 2025-06-18 15:36:55 -07:00
neuron [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
plugins [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
plugins_tests [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
prefix_caching [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
prompt_adapter [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
prompts [BugFix] Fix input positions for long context with sliding window (#2088) 2023-12-13 12:28:13 -08:00
quantization [Quantization] Remove FP4 emulation; Fall-back to marlin for device < 100 (#19563) 2025-06-16 17:33:51 -04:00
reasoning [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
runai_model_streamer_test [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
samplers [CI] change spell checker from codespell to typos (#18711) 2025-06-11 19:57:10 -07:00
spec_decode [Bugfix][2/n] Fix speculative decoding CI - Fix test_ngram_e2e_greedy_correctness (#19644) 2025-06-14 21:15:41 -07:00
standalone_tests [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
system_messages [V1] Implement Cascade Attention (#11635) 2025-01-01 21:56:46 +09:00
tensorizer_loader [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
tokenization Allow AsyncLLMEngine.generate to target a specific DP rank (#19102) 2025-06-04 08:26:47 -07:00
tool_use [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
tpu [TPU] update torch_xla pin (#19231) 2025-06-06 04:27:38 +00:00
tracing [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
v1 [v1] Support mamba2 (#19327) 2025-06-18 20:34:15 +00:00
vllm_test_utils [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
weight_loading [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
worker [CI] change spell checker from codespell to typos (#18711) 2025-06-11 19:57:10 -07:00
__init__.py [Small] Formatter only checks lints in changed files (#1528) 2023-10-31 15:39:38 -07:00
build_cython.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
conftest.py [CI] Add mteb testing for rerank models (#19344) 2025-06-16 01:36:43 -07:00
test_cache_block_hashing.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_config.py [Misc] Fix skipped max-model-len validation when deriving max model length from tokenizer config (#19660) 2025-06-16 06:30:29 +00:00
test_embedded_commit.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_inputs.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_logger.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_outputs.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_regression.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_sampling_params.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_scalartype.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_seed_behavior.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_sequence.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_sharded_state_loader.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_triton_utils.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_utils.py Support non-string values in JSON keys from CLI (#19471) 2025-06-11 09:34:04 +00:00
test_version.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
test_vllm_port.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
utils.py [CUDA] Enable full cudagraph for FlashMLA (#18581) 2025-06-13 18:12:26 +00:00