.. |
async_engine
|
[Frontend] Make TIMEOUT_KEEP_ALIVE configurable through env var (#18472)
|
2025-06-09 21:41:21 +00:00 |
basic_correctness
|
[MISC][Bugfix] Use less CPU when message queue has been empty for some time (#16226)
|
2025-06-05 16:53:08 +00:00 |
benchmarks
|
[Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets (#18847)
|
2025-06-12 10:30:56 -07:00 |
compile
|
Support embedding models in V1 (#16188)
|
2025-06-18 21:36:33 -07:00 |
config
|
[Feature] specify model in config.yaml (#15798)
|
2025-04-01 01:20:06 -07:00 |
core
|
[Bugfix][Core] Prefix caching causes incorrect outputs due to outdated ComputedBlocksTracker (#18957)
|
2025-06-15 21:56:37 -07:00 |
detokenizer
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
distributed
|
feat: add data parallel rank to KVEventBatch (#18925)
|
2025-06-03 17:14:20 -07:00 |
encoder_decoder
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
engine
|
Fix interaction between `Optional` and `Annotated` in CLI typing (#19093)
|
2025-06-03 21:07:19 +00:00 |
entrypoints
|
Support embedding models in V1 (#16188)
|
2025-06-18 21:36:33 -07:00 |
fastsafetensors_loader
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
kernels
|
[Perf] Optimize `moe_align_block_size` CUDA kernel (#19572)
|
2025-06-17 11:49:26 -07:00 |
kv_transfer
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
lora
|
[CI] change spell checker from codespell to typos (#18711)
|
2025-06-11 19:57:10 -07:00 |
metrics
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
mistral_tool_use
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
model_executor
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
models
|
Support embedding models in V1 (#16188)
|
2025-06-18 21:36:33 -07:00 |
mq_llm_engine
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
multi_step
|
Adding "AMD: Multi-step Tests" to amdproduction. (#19508)
|
2025-06-13 17:08:51 -07:00 |
multimodal
|
[Core] Do not copy array during hashing (#19484)
|
2025-06-18 15:36:55 -07:00 |
neuron
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
plugins
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
plugins_tests
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
prefix_caching
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
prompt_adapter
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
prompts
|
[BugFix] Fix input positions for long context with sliding window (#2088)
|
2023-12-13 12:28:13 -08:00 |
quantization
|
[Quantization] Remove FP4 emulation; Fall-back to marlin for device < 100 (#19563)
|
2025-06-16 17:33:51 -04:00 |
reasoning
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
runai_model_streamer_test
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
samplers
|
[CI] change spell checker from codespell to typos (#18711)
|
2025-06-11 19:57:10 -07:00 |
spec_decode
|
[Bugfix][2/n] Fix speculative decoding CI - Fix test_ngram_e2e_greedy_correctness (#19644)
|
2025-06-14 21:15:41 -07:00 |
standalone_tests
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
system_messages
|
[V1] Implement Cascade Attention (#11635)
|
2025-01-01 21:56:46 +09:00 |
tensorizer_loader
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
tokenization
|
Support embedding models in V1 (#16188)
|
2025-06-18 21:36:33 -07:00 |
tool_use
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
tpu
|
[TPU] update torch_xla pin (#19231)
|
2025-06-06 04:27:38 +00:00 |
tracing
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
v1
|
[Bugfix] Add check_health to v1 async client. (#19821)
|
2025-06-18 21:47:01 -07:00 |
vllm_test_utils
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
weight_loading
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
worker
|
[CI] change spell checker from codespell to typos (#18711)
|
2025-06-11 19:57:10 -07:00 |
__init__.py
|
[Small] Formatter only checks lints in changed files (#1528)
|
2023-10-31 15:39:38 -07:00 |
build_cython.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
conftest.py
|
Support embedding models in V1 (#16188)
|
2025-06-18 21:36:33 -07:00 |
test_cache_block_hashing.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_config.py
|
[Misc] Fix skipped max-model-len validation when deriving max model length from tokenizer config (#19660)
|
2025-06-16 06:30:29 +00:00 |
test_embedded_commit.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_inputs.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_logger.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_outputs.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_regression.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_sampling_params.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_scalartype.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_seed_behavior.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_sequence.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_sharded_state_loader.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_triton_utils.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_utils.py
|
Support non-string values in JSON keys from CLI (#19471)
|
2025-06-11 09:34:04 +00:00 |
test_version.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
test_vllm_port.py
|
[Misc] Add SPDX-FileCopyrightText (#19100)
|
2025-06-03 11:20:17 -07:00 |
utils.py
|
[CUDA] Enable full cudagraph for FlashMLA (#18581)
|
2025-06-13 18:12:26 +00:00 |