vllm/tests at 6d0df0ebebd4e347e1ebcdea4be010a4b54b901b - vllm

Harry Mellor 0fa939e2d1 Improve configs - `LoRAConfig` + `PromptAdapterConfig` (#16980 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-24 10:29:34 -07:00
..
async_engine	[V1] V1 Enablement Oracle (#13726 )	2025-03-14 22:02:20 -07:00
basic_correctness	[core] Add tags parameter to wake_up() (#15500 )	2025-04-02 01:59:27 -07:00
benchmarks	Add `vllm bench [latency, throughput]` CLI commands (#16508 )	2025-04-14 23:10:35 -07:00
compile	[BugFix] Fix fusion test and add them to CI (#16287 )	2025-04-08 23:46:45 -07:00
config	[Feature] specify model in config.yaml (#15798 )	2025-04-01 01:20:06 -07:00
core	[Kernel][ROCM] Upstream prefix prefill speed up for vLLM V1 (#13305 )	2025-04-22 19:11:56 -07:00
detokenizer	[V1] V1 Enablement Oracle (#13726 )	2025-03-14 22:02:20 -07:00
distributed	[Distributed] Add custom allreduce support for ROCM (#14125 )	2025-03-31 22:49:12 -07:00
encoder_decoder	[V1] V1 Enablement Oracle (#13726 )	2025-03-14 22:02:20 -07:00
engine	Fix `nullable_kvs` fallback (#16837 )	2025-04-18 05:58:39 -07:00
entrypoints	[Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970 )	2025-04-24 07:06:28 -07:00
fastsafetensors_loader	[Core] Integrate `fastsafetensors` loader for loading model weights (#10647 )	2025-03-24 08:08:02 -07:00
kernels	Add missing rocm_skinny_gemms kernel test to CI (#17060 )	2025-04-24 07:49:37 -07:00
kv_transfer	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
lora	Improve configs - `LoRAConfig` + `PromptAdapterConfig` (#16980 )	2025-04-24 10:29:34 -07:00
metrics	[V1][Spec Decode] Remove deprecated spec decode config params (#15466 )	2025-03-31 09:19:35 -07:00
mistral_tool_use	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
model_executor	[FEAT][ROCm] Integrate Fused MoE Kernels from AITER (#14967 )	2025-03-26 16:30:30 +08:00
models	[Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970 )	2025-04-24 07:06:28 -07:00
mq_llm_engine	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
multi_step	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
multimodal	[V1] Enable multi-input by default (#15799 )	2025-04-12 08:52:39 +00:00
neuron	[Neuron][kernel] Fuse kv cache into a single tensor (#15911 )	2025-04-03 09:51:32 -07:00
plugins	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
plugins_tests	[V1] Scheduler Refactoring [1/N] - Add Scheduler Interface (#15250 )	2025-03-20 17:50:43 -07:00
prefix_caching	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
prompt_adapter	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )	2025-02-02 11:58:18 -08:00
prompts	…
quantization	[Misc] Update `compressed-tensors` WNA16 to support zero-points (#14211 )	2025-04-15 07:33:51 -06:00
reasoning	[Bugfix][Frontend] Eliminate regex based check in reasoning full generator (#14821 )	2025-03-28 11:20:35 +00:00
runai_model_streamer_test	[1/n][CI] Load models in CI from S3 instead of HF (#13205 )	2025-02-19 07:34:59 +00:00
samplers	[Core][Bugfix] Fix Offline MM Beam Search (#16390 )	2025-04-15 10:33:02 +08:00
spec_decode	[Chore] Remove Sampler from Model Code (#17084 )	2025-04-24 02:49:33 -07:00
standalone_tests	[Build] Make sure local main branch is synced when VLLM_USE_PRECOMPILED=1 (#13921 )	2025-03-03 16:43:14 +08:00
system_messages	[V1] Implement Cascade Attention (#11635 )	2025-01-01 21:56:46 +09:00
tensorizer_loader	[Core] Update dtype detection and defaults (#14858 )	2025-03-19 13:49:33 +08:00
tokenization	Simplify `TokenizerGroup` (#16790 )	2025-04-24 04:43:56 -07:00
tool_use	[Frontend] Added chat templates for LLaMa4 pythonic tool calling (#16463 )	2025-04-12 06:26:17 +08:00
tpu	[TPU][V1] Refine tpu_model_runner to mitigate future recompilation issues (#16275 )	2025-04-09 18:51:51 -06:00
tracing	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
v1	[V1][Spec Decoding] Add num_drafts and num_accepted_tokens_per_position metrics (#16665 )	2025-04-24 08:57:40 -07:00
vllm_test_utils	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
weight_loading	[V1] V1 Enablement Oracle (#13726 )	2025-03-14 22:02:20 -07:00
worker	[V1] V1 Enablement Oracle (#13726 )	2025-03-14 22:02:20 -07:00
__init__.py	…
build_cython.py	[Build] Cython compilation support fix (#14296 )	2025-03-24 23:37:54 +00:00
conftest.py	Simplify `TokenizerGroup` (#16790 )	2025-04-24 04:43:56 -07:00
test_cache_block_hashing.py	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
test_config.py	Improve configs - `TokenizerPoolConfig` + `DeviceConfig` (#16603 )	2025-04-17 11:19:42 +00:00
test_embedded_commit.py	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )	2025-02-02 11:58:18 -08:00
test_inputs.py	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
test_logger.py	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
test_logits_processor.py	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
test_regression.py	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
test_sampling_params.py	[Bugfix][Frontend] respect provided default guided decoding backend (#15476 )	2025-04-09 05:11:10 -07:00
test_scalartype.py	[NVIDIA] Support nvfp4 quantization (#12784 )	2025-02-12 19:51:51 -08:00
test_seed_behavior.py	[Bugfix] fix flaky test (#13089 )	2025-02-11 14:41:20 +00:00
test_sequence.py	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )	2025-02-02 11:58:18 -08:00
test_sharded_state_loader.py	[Misc] Fix test_sharded_state_loader.py(#16004 ) (#16005 )	2025-04-09 14:47:30 +08:00
test_utils.py	[Bugfix] Multi-modal caches not acting like LRU caches (#16593 )	2025-04-14 09:24:16 -07:00
test_version.py	[Metrics] Add `--show-hidden-metrics-for-version` CLI arg (#13295 )	2025-02-22 00:20:45 -08:00
utils.py	[Metrics] Hide deprecated metrics (#15458 )	2025-04-02 07:37:19 -07:00