vllm/examples/online_serving
David Xia 5c04bb8b86
[doc] fix multimodal example script (#18089)
Signed-off-by: David Xia <david@davidxia.com>
2025-05-16 06:05:34 +00:00
..
chart-helm Update PyTorch to 2.7.0 (#16859) 2025-04-29 19:08:04 -07:00
disaggregated_serving Improve examples rendering in docs and GitHub (#18203) 2025-05-15 15:57:49 +00:00
opentelemetry Improve examples rendering in docs and GitHub (#18203) 2025-05-15 15:57:49 +00:00
prometheus_grafana fix typo of grafana dashboard, with correct datasource (#13668) 2025-02-21 18:21:05 +00:00
api_client.py [Misc] refactor argument parsing in examples (#16635) 2025-04-15 08:05:30 +00:00
cohere_rerank_client.py [Misc] refactor examples (#16563) 2025-04-14 09:59:15 +00:00
disaggregated_prefill.sh [Frontend][Bugfix] support prefill decode disaggregation on deepseek (#14824) 2025-03-20 00:00:33 -07:00
gradio_openai_chatbot_webserver.py [Misc] refactor examples series (#16708) 2025-04-16 10:16:36 +00:00
gradio_webserver.py [Misc] refactor argument parsing in examples (#16635) 2025-04-15 08:05:30 +00:00
jinaai_rerank_client.py [Misc] refactor examples (#16563) 2025-04-14 09:59:15 +00:00
kv_events_subscriber.py [V1][Metrics] add support for kv event publishing (#16750) 2025-04-30 07:44:45 -07:00
multi-node-serving.sh [Misc] Adding script to setup ray for multi-node vllm deployments (#12913) 2025-02-20 21:16:40 -08:00
openai_chat_completion_client.py [Misc] refactor examples (#16563) 2025-04-14 09:59:15 +00:00
openai_chat_completion_client_for_multimodal.py [doc] fix multimodal example script (#18089) 2025-05-16 06:05:34 +00:00
openai_chat_completion_client_with_tools.py [Misc] remove --model from vllm serve usage (#17944) 2025-05-10 13:23:31 +00:00
openai_chat_completion_client_with_tools_required.py [Misc] refactor examples series (#16708) 2025-04-16 10:16:36 +00:00
openai_chat_completion_structured_outputs.py [Chore][Doc] uses model id determined from OpenAI client (#17815) 2025-05-08 01:48:57 +00:00
openai_chat_completion_structured_outputs_structural_tag.py [Chore][Doc] uses model id determined from OpenAI client (#17815) 2025-05-08 01:48:57 +00:00
openai_chat_completion_structured_outputs_with_reasoning.py [Chore][Doc] uses model id determined from OpenAI client (#17815) 2025-05-08 01:48:57 +00:00
openai_chat_completion_tool_calls_with_reasoning.py [Feature][Frontend]: Deprecate --enable-reasoning (#17452) 2025-05-01 06:46:16 -07:00
openai_chat_completion_with_reasoning.py [Feature][Frontend]: Deprecate --enable-reasoning (#17452) 2025-05-01 06:46:16 -07:00
openai_chat_completion_with_reasoning_streaming.py [Feature][Frontend]: Deprecate --enable-reasoning (#17452) 2025-05-01 06:46:16 -07:00
openai_chat_embedding_client_for_multimodal.py [Misc] refactor examples series (#16708) 2025-04-16 10:16:36 +00:00
openai_classification_client.py [Frontend] Add /classify endpoint (#17032) 2025-05-11 07:57:07 +00:00
openai_completion_client.py [Misc] refactor examples series (#16708) 2025-04-16 10:16:36 +00:00
openai_cross_encoder_score.py [Misc] refactor examples series (#16708) 2025-04-16 10:16:36 +00:00
openai_embedding_client.py [Misc] refactor examples series (#16708) 2025-04-16 10:16:36 +00:00
openai_embedding_matryoshka_fy.py [Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970) 2025-04-24 07:06:28 -07:00
openai_pooling_client.py [Misc] refactor examples series (#16708) 2025-04-16 10:16:36 +00:00
openai_transcription_client.py [BugFix] Fix authorization of openai_transcription_client.py (#17321) 2025-04-30 09:51:05 -07:00
ray_serve_deepseek.py [Misc] Add references in ray_serve_deepseek example (#17907) 2025-05-09 16:59:36 +00:00
retrieval_augmented_generation_with_langchain.py [doc] Add RAG Integration example (#17692) 2025-05-06 16:10:23 +00:00
retrieval_augmented_generation_with_llamaindex.py [doc] Add RAG Integration example (#17692) 2025-05-06 16:10:23 +00:00
run_cluster.sh [Doc] Move examples into categories (#11840) 2025-01-08 13:09:53 +00:00
sagemaker-entrypoint.sh [Doc] Move examples into categories (#11840) 2025-01-08 13:09:53 +00:00
streamlit_openai_chatbot_webserver.py [doc] add streamlit integration (#17522) 2025-05-01 13:34:02 +00:00
utils.py [doc] fix multimodal example script (#18089) 2025-05-16 06:05:34 +00:00