vllm/offline_inference at 6d0df0ebebd4e347e1ebcdea4be010a4b54b901b - vllm

Cyrus Leung 205d84aaa9 [VLM] Clean up models (#16873 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-19 12:13:06 +00:00
..
basic	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
disaggregated-prefill-v1	[P/D][V1] KV Connector API V1 (#15960 )	2025-04-17 13:22:40 -07:00
openai	[CI/Build] Auto-fix Markdown files (#12941 )	2025-02-08 04:25:15 -08:00
profiling_tpu	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
qwen2_5_omni	[Model][VLM] Add Qwen2.5-Omni model support (thinker only) (#15130 )	2025-04-18 23:14:36 -07:00
audio_language.py	[Model] Refactor Phi-4-multimodal to use merged processor and support V1 (#15477 )	2025-04-19 02:26:11 -07:00
batch_llm_inference.py	[Ray] Improve documentation on batch inference (#16609 )	2025-04-16 22:19:26 -07:00
chat_with_tools.py	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )	2025-02-02 11:58:18 -08:00
cpu_offload_lmcache.py	[Misc] Update outdated note: LMCache now supports chunked prefill (#16697 )	2025-04-18 05:12:42 +00:00
data_parallel.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
disaggregated_prefill.py	[Misc] refactor examples (#16563 )	2025-04-14 09:59:15 +00:00
disaggregated_prefill_lmcache.py	[Misc] refactor examples (#16563 )	2025-04-14 09:59:15 +00:00
eagle.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
embed_jina_embeddings_v3.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
embed_matryoshka_fy.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
encoder_decoder.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
encoder_decoder_multimodal.py	[Bugfix] Update Florence-2 tokenizer to make grounding tasks work (#16734 )	2025-04-17 04:17:39 +00:00
llm_engine_example.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
load_sharded_state.py	[Bugfix][V1] Fix bug from putting llm_engine.model_executor in a background process (#15367 )	2025-04-03 07:32:10 +00:00
lora_with_quantization_inference.py	[Misc] Clean up the BitsAndBytes arguments (#15140 )	2025-03-20 19:17:12 -07:00
mistral-small.py	[VLM] Clean up models (#16873 )	2025-04-19 12:13:06 +00:00
mlpspeculator.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
multilora_inference.py	[Misc] format and refactor some examples (#16252 )	2025-04-08 10:42:32 +00:00
neuron.py	[Misc] format and refactor some examples (#16252 )	2025-04-08 10:42:32 +00:00
neuron_int8_quantization.py	[Misc] format and refactor some examples (#16252 )	2025-04-08 10:42:32 +00:00
prefix_caching.py	[Misc] format and refactor some examples (#16252 )	2025-04-08 10:42:32 +00:00
prithvi_geospatial_mae.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
profiling.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
reproduciblity.py	[Misc] format and refactor some examples (#16252 )	2025-04-08 10:42:32 +00:00
rlhf.py	[Misc] format and refactor some examples (#16252 )	2025-04-08 10:42:32 +00:00
rlhf_colocate.py	[RLHF] use worker_extension_cls for compatibility with V0 and V1 (#14185 )	2025-03-07 00:32:46 +08:00
rlhf_utils.py	[RLHF] use worker_extension_cls for compatibility with V0 and V1 (#14185 )	2025-03-07 00:32:46 +08:00
save_sharded_state.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
simple_profiling.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
structured_outputs.py	[Misc] refactor Structured Outputs example (#16322 )	2025-04-09 23:32:42 +00:00
torchrun_example.py	[Misc] format and refactor some examples (#16252 )	2025-04-08 10:42:32 +00:00
tpu.py	[Misc] format and refactor some examples (#16252 )	2025-04-08 10:42:32 +00:00
vision_language.py	[VLM] Clean up models (#16873 )	2025-04-19 12:13:06 +00:00
vision_language_embedding.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
vision_language_multi_image.py	[Model] Refactor Phi-4-multimodal to use merged processor and support V1 (#15477 )	2025-04-19 02:26:11 -07:00