vllm/tests
Harry Mellor 0fa939e2d1
Improve configs - `LoRAConfig` + `PromptAdapterConfig` (#16980)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-04-24 10:29:34 -07:00
..
async_engine [V1] V1 Enablement Oracle (#13726) 2025-03-14 22:02:20 -07:00
basic_correctness [core] Add tags parameter to wake_up() (#15500) 2025-04-02 01:59:27 -07:00
benchmarks Add `vllm bench [latency, throughput]` CLI commands (#16508) 2025-04-14 23:10:35 -07:00
compile [BugFix] Fix fusion test and add them to CI (#16287) 2025-04-08 23:46:45 -07:00
config [Feature] specify model in config.yaml (#15798) 2025-04-01 01:20:06 -07:00
core [Kernel][ROCM] Upstream prefix prefill speed up for vLLM V1 (#13305) 2025-04-22 19:11:56 -07:00
detokenizer [V1] V1 Enablement Oracle (#13726) 2025-03-14 22:02:20 -07:00
distributed [Distributed] Add custom allreduce support for ROCM (#14125) 2025-03-31 22:49:12 -07:00
encoder_decoder [V1] V1 Enablement Oracle (#13726) 2025-03-14 22:02:20 -07:00
engine Fix `nullable_kvs` fallback (#16837) 2025-04-18 05:58:39 -07:00
entrypoints [Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970) 2025-04-24 07:06:28 -07:00
fastsafetensors_loader [Core] Integrate `fastsafetensors` loader for loading model weights (#10647) 2025-03-24 08:08:02 -07:00
kernels Add missing rocm_skinny_gemms kernel test to CI (#17060) 2025-04-24 07:49:37 -07:00
kv_transfer [Misc] Replace os environ to monkeypatch in test suite (#14516) 2025-03-16 20:35:57 -07:00
lora Improve configs - `LoRAConfig` + `PromptAdapterConfig` (#16980) 2025-04-24 10:29:34 -07:00
metrics [V1][Spec Decode] Remove deprecated spec decode config params (#15466) 2025-03-31 09:19:35 -07:00
mistral_tool_use Update deprecated Python 3.8 typing (#13971) 2025-03-02 17:34:51 -08:00
model_executor [FEAT][ROCm] Integrate Fused MoE Kernels from AITER (#14967) 2025-03-26 16:30:30 +08:00
models [Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970) 2025-04-24 07:06:28 -07:00
mq_llm_engine [Misc] Replace os environ to monkeypatch in test suite (#14516) 2025-03-16 20:35:57 -07:00
multi_step [Misc] Replace os environ to monkeypatch in test suite (#14516) 2025-03-16 20:35:57 -07:00
multimodal [V1] Enable multi-input by default (#15799) 2025-04-12 08:52:39 +00:00
neuron [Neuron][kernel] Fuse kv cache into a single tensor (#15911) 2025-04-03 09:51:32 -07:00
plugins Update deprecated Python 3.8 typing (#13971) 2025-03-02 17:34:51 -08:00
plugins_tests [V1] Scheduler Refactoring [1/N] - Add Scheduler Interface (#15250) 2025-03-20 17:50:43 -07:00
prefix_caching [Misc] Replace os environ to monkeypatch in test suite (#14516) 2025-03-16 20:35:57 -07:00
prompt_adapter [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
prompts
quantization [Misc] Update `compressed-tensors` WNA16 to support zero-points (#14211) 2025-04-15 07:33:51 -06:00
reasoning [Bugfix][Frontend] Eliminate regex based check in reasoning full generator (#14821) 2025-03-28 11:20:35 +00:00
runai_model_streamer_test [1/n][CI] Load models in CI from S3 instead of HF (#13205) 2025-02-19 07:34:59 +00:00
samplers [Core][Bugfix] Fix Offline MM Beam Search (#16390) 2025-04-15 10:33:02 +08:00
spec_decode [Chore] Remove Sampler from Model Code (#17084) 2025-04-24 02:49:33 -07:00
standalone_tests [Build] Make sure local main branch is synced when VLLM_USE_PRECOMPILED=1 (#13921) 2025-03-03 16:43:14 +08:00
system_messages [V1] Implement Cascade Attention (#11635) 2025-01-01 21:56:46 +09:00
tensorizer_loader [Core] Update dtype detection and defaults (#14858) 2025-03-19 13:49:33 +08:00
tokenization Simplify `TokenizerGroup` (#16790) 2025-04-24 04:43:56 -07:00
tool_use [Frontend] Added chat templates for LLaMa4 pythonic tool calling (#16463) 2025-04-12 06:26:17 +08:00
tpu [TPU][V1] Refine tpu_model_runner to mitigate future recompilation issues (#16275) 2025-04-09 18:51:51 -06:00
tracing [Misc] Replace os environ to monkeypatch in test suite (#14516) 2025-03-16 20:35:57 -07:00
v1 [V1][Spec Decoding] Add num_drafts and num_accepted_tokens_per_position metrics (#16665) 2025-04-24 08:57:40 -07:00
vllm_test_utils Update deprecated Python 3.8 typing (#13971) 2025-03-02 17:34:51 -08:00
weight_loading [V1] V1 Enablement Oracle (#13726) 2025-03-14 22:02:20 -07:00
worker [V1] V1 Enablement Oracle (#13726) 2025-03-14 22:02:20 -07:00
__init__.py
build_cython.py [Build] Cython compilation support fix (#14296) 2025-03-24 23:37:54 +00:00
conftest.py Simplify `TokenizerGroup` (#16790) 2025-04-24 04:43:56 -07:00
test_cache_block_hashing.py Update deprecated Python 3.8 typing (#13971) 2025-03-02 17:34:51 -08:00
test_config.py Improve configs - `TokenizerPoolConfig` + `DeviceConfig` (#16603) 2025-04-17 11:19:42 +00:00
test_embedded_commit.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
test_inputs.py Update deprecated Python 3.8 typing (#13971) 2025-03-02 17:34:51 -08:00
test_logger.py Update deprecated Python 3.8 typing (#13971) 2025-03-02 17:34:51 -08:00
test_logits_processor.py Update deprecated Python 3.8 typing (#13971) 2025-03-02 17:34:51 -08:00
test_regression.py [Misc] Replace os environ to monkeypatch in test suite (#14516) 2025-03-16 20:35:57 -07:00
test_sampling_params.py [Bugfix][Frontend] respect provided default guided decoding backend (#15476) 2025-04-09 05:11:10 -07:00
test_scalartype.py [NVIDIA] Support nvfp4 quantization (#12784) 2025-02-12 19:51:51 -08:00
test_seed_behavior.py [Bugfix] fix flaky test (#13089) 2025-02-11 14:41:20 +00:00
test_sequence.py [Misc] Add SPDX-License-Identifier headers to python source files (#12628) 2025-02-02 11:58:18 -08:00
test_sharded_state_loader.py [Misc] Fix test_sharded_state_loader.py(#16004) (#16005) 2025-04-09 14:47:30 +08:00
test_utils.py [Bugfix] Multi-modal caches not acting like LRU caches (#16593) 2025-04-14 09:24:16 -07:00
test_version.py [Metrics] Add `--show-hidden-metrics-for-version` CLI arg (#13295) 2025-02-22 00:20:45 -08:00
utils.py [Metrics] Hide deprecated metrics (#15458) 2025-04-02 07:37:19 -07:00