vllm/examples/offline_inference
Jee Jee Li 1caca5a589
[Misc] Add SPDX-FileCopyrightText (#20428)
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
2025-07-04 07:40:42 +00:00
..
basic Support embedding models in V1 (#16188) 2025-06-18 21:36:33 -07:00
disaggregated-prefill-v1 [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
openai_batch [Docs] Fix syntax highlighting of shell commands (#19870) 2025-06-23 17:59:09 +00:00
profiling_tpu [Misc] small update (#20462) 2025-07-03 20:33:44 -07:00
qwen2_5_omni [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
audio_language.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
automatic_prefix_caching.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
batch_llm_inference.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
chat_with_tools.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
context_extension.py [Misc] refactor context extension (#19246) 2025-06-07 05:13:21 +00:00
data_parallel.py fix ci issue distributed 4 gpu test (#20204) 2025-06-27 22:50:00 -07:00
disaggregated_prefill.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
embed_jina_embeddings_v3.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
embed_matryoshka_fy.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
encoder_decoder.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
encoder_decoder_multimodal.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
llm_engine_example.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
load_sharded_state.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
lora_with_quantization_inference.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
metrics.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
mistral-small.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
mlpspeculator.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
multilora_inference.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
neuron.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
neuron_eagle.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
neuron_int8_quantization.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
neuron_multimodal.py [Misc] refactor neuron_multimodal and profiling (#19397) 2025-06-10 06:12:42 +00:00
neuron_speculation.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
prefix_caching.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
prithvi_geospatial_mae.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
profiling.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
prompt_embed_inference.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
qwen3_reranker.py refactor example - qwen3_reranker (#19847) 2025-06-24 14:03:20 +00:00
qwen_1m.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
reproducibility.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
rlhf.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
rlhf_colocate.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
rlhf_utils.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
save_sharded_state.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
simple_profiling.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
spec_decode.py [Misc] Add SPDX-FileCopyrightText (#20428) 2025-07-04 07:40:42 +00:00
structured_outputs.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
torchrun_example.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
tpu.py [Misc] Add SPDX-FileCopyrightText (#19100) 2025-06-03 11:20:17 -07:00
vision_language.py Add ignore consolidated file in mistral example code (#20420) 2025-07-04 02:55:07 +00:00
vision_language_embedding.py Support embedding models in V1 (#16188) 2025-06-18 21:36:33 -07:00
vision_language_multi_image.py Add ignore consolidated file in mistral example code (#20420) 2025-07-04 02:55:07 +00:00