.. |
async_engine
|
[V1] V1 Enablement Oracle (#13726)
|
2025-03-14 22:02:20 -07:00 |
basic_correctness
|
[core] Add tags parameter to wake_up() (#15500)
|
2025-04-02 01:59:27 -07:00 |
benchmarks
|
Add `vllm bench [latency, throughput]` CLI commands (#16508)
|
2025-04-14 23:10:35 -07:00 |
compile
|
[BugFix] Fix fusion test and add them to CI (#16287)
|
2025-04-08 23:46:45 -07:00 |
config
|
[Feature] specify model in config.yaml (#15798)
|
2025-04-01 01:20:06 -07:00 |
core
|
[Kernel][ROCM] Upstream prefix prefill speed up for vLLM V1 (#13305)
|
2025-04-22 19:11:56 -07:00 |
detokenizer
|
[V1] V1 Enablement Oracle (#13726)
|
2025-03-14 22:02:20 -07:00 |
distributed
|
[Distributed] Add custom allreduce support for ROCM (#14125)
|
2025-03-31 22:49:12 -07:00 |
encoder_decoder
|
[V1] V1 Enablement Oracle (#13726)
|
2025-03-14 22:02:20 -07:00 |
engine
|
Fix `nullable_kvs` fallback (#16837)
|
2025-04-18 05:58:39 -07:00 |
entrypoints
|
[Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970)
|
2025-04-24 07:06:28 -07:00 |
fastsafetensors_loader
|
[Core] Integrate `fastsafetensors` loader for loading model weights (#10647)
|
2025-03-24 08:08:02 -07:00 |
kernels
|
Add missing rocm_skinny_gemms kernel test to CI (#17060)
|
2025-04-24 07:49:37 -07:00 |
kv_transfer
|
[Misc] Replace os environ to monkeypatch in test suite (#14516)
|
2025-03-16 20:35:57 -07:00 |
lora
|
Improve configs - `LoRAConfig` + `PromptAdapterConfig` (#16980)
|
2025-04-24 10:29:34 -07:00 |
metrics
|
[V1][Spec Decode] Remove deprecated spec decode config params (#15466)
|
2025-03-31 09:19:35 -07:00 |
mistral_tool_use
|
Update deprecated Python 3.8 typing (#13971)
|
2025-03-02 17:34:51 -08:00 |
model_executor
|
[FEAT][ROCm] Integrate Fused MoE Kernels from AITER (#14967)
|
2025-03-26 16:30:30 +08:00 |
models
|
[Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970)
|
2025-04-24 07:06:28 -07:00 |
mq_llm_engine
|
[Misc] Replace os environ to monkeypatch in test suite (#14516)
|
2025-03-16 20:35:57 -07:00 |
multi_step
|
[Misc] Replace os environ to monkeypatch in test suite (#14516)
|
2025-03-16 20:35:57 -07:00 |
multimodal
|
[V1] Enable multi-input by default (#15799)
|
2025-04-12 08:52:39 +00:00 |
neuron
|
[Neuron][kernel] Fuse kv cache into a single tensor (#15911)
|
2025-04-03 09:51:32 -07:00 |
plugins
|
Update deprecated Python 3.8 typing (#13971)
|
2025-03-02 17:34:51 -08:00 |
plugins_tests
|
[V1] Scheduler Refactoring [1/N] - Add Scheduler Interface (#15250)
|
2025-03-20 17:50:43 -07:00 |
prefix_caching
|
[Misc] Replace os environ to monkeypatch in test suite (#14516)
|
2025-03-16 20:35:57 -07:00 |
prompt_adapter
|
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
|
2025-02-02 11:58:18 -08:00 |
prompts
|
…
|
|
quantization
|
[Misc] Update `compressed-tensors` WNA16 to support zero-points (#14211)
|
2025-04-15 07:33:51 -06:00 |
reasoning
|
[Bugfix][Frontend] Eliminate regex based check in reasoning full generator (#14821)
|
2025-03-28 11:20:35 +00:00 |
runai_model_streamer_test
|
[1/n][CI] Load models in CI from S3 instead of HF (#13205)
|
2025-02-19 07:34:59 +00:00 |
samplers
|
[Core][Bugfix] Fix Offline MM Beam Search (#16390)
|
2025-04-15 10:33:02 +08:00 |
spec_decode
|
[Chore] Remove Sampler from Model Code (#17084)
|
2025-04-24 02:49:33 -07:00 |
standalone_tests
|
[Build] Make sure local main branch is synced when VLLM_USE_PRECOMPILED=1 (#13921)
|
2025-03-03 16:43:14 +08:00 |
system_messages
|
[V1] Implement Cascade Attention (#11635)
|
2025-01-01 21:56:46 +09:00 |
tensorizer_loader
|
[Core] Update dtype detection and defaults (#14858)
|
2025-03-19 13:49:33 +08:00 |
tokenization
|
Simplify `TokenizerGroup` (#16790)
|
2025-04-24 04:43:56 -07:00 |
tool_use
|
[Frontend] Added chat templates for LLaMa4 pythonic tool calling (#16463)
|
2025-04-12 06:26:17 +08:00 |
tpu
|
[TPU][V1] Refine tpu_model_runner to mitigate future recompilation issues (#16275)
|
2025-04-09 18:51:51 -06:00 |
tracing
|
[Misc] Replace os environ to monkeypatch in test suite (#14516)
|
2025-03-16 20:35:57 -07:00 |
v1
|
[V1][Spec Decoding] Add num_drafts and num_accepted_tokens_per_position metrics (#16665)
|
2025-04-24 08:57:40 -07:00 |
vllm_test_utils
|
Update deprecated Python 3.8 typing (#13971)
|
2025-03-02 17:34:51 -08:00 |
weight_loading
|
[V1] V1 Enablement Oracle (#13726)
|
2025-03-14 22:02:20 -07:00 |
worker
|
[V1] V1 Enablement Oracle (#13726)
|
2025-03-14 22:02:20 -07:00 |
__init__.py
|
…
|
|
build_cython.py
|
[Build] Cython compilation support fix (#14296)
|
2025-03-24 23:37:54 +00:00 |
conftest.py
|
Simplify `TokenizerGroup` (#16790)
|
2025-04-24 04:43:56 -07:00 |
test_cache_block_hashing.py
|
Update deprecated Python 3.8 typing (#13971)
|
2025-03-02 17:34:51 -08:00 |
test_config.py
|
Improve configs - `TokenizerPoolConfig` + `DeviceConfig` (#16603)
|
2025-04-17 11:19:42 +00:00 |
test_embedded_commit.py
|
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
|
2025-02-02 11:58:18 -08:00 |
test_inputs.py
|
Update deprecated Python 3.8 typing (#13971)
|
2025-03-02 17:34:51 -08:00 |
test_logger.py
|
Update deprecated Python 3.8 typing (#13971)
|
2025-03-02 17:34:51 -08:00 |
test_logits_processor.py
|
Update deprecated Python 3.8 typing (#13971)
|
2025-03-02 17:34:51 -08:00 |
test_regression.py
|
[Misc] Replace os environ to monkeypatch in test suite (#14516)
|
2025-03-16 20:35:57 -07:00 |
test_sampling_params.py
|
[Bugfix][Frontend] respect provided default guided decoding backend (#15476)
|
2025-04-09 05:11:10 -07:00 |
test_scalartype.py
|
[NVIDIA] Support nvfp4 quantization (#12784)
|
2025-02-12 19:51:51 -08:00 |
test_seed_behavior.py
|
[Bugfix] fix flaky test (#13089)
|
2025-02-11 14:41:20 +00:00 |
test_sequence.py
|
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
|
2025-02-02 11:58:18 -08:00 |
test_sharded_state_loader.py
|
[Misc] Fix test_sharded_state_loader.py(#16004) (#16005)
|
2025-04-09 14:47:30 +08:00 |
test_utils.py
|
[Bugfix] Multi-modal caches not acting like LRU caches (#16593)
|
2025-04-14 09:24:16 -07:00 |
test_version.py
|
[Metrics] Add `--show-hidden-metrics-for-version` CLI arg (#13295)
|
2025-02-22 00:20:45 -08:00 |
utils.py
|
[Metrics] Hide deprecated metrics (#15458)
|
2025-04-02 07:37:19 -07:00 |