vllm/online_serving at 6d0df0ebebd4e347e1ebcdea4be010a4b54b901b - vllm

History

Reid 1bcbcbf574 [Misc] refactor example series - structured outputs (#17040 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>		2025-04-24 07:49:48 -07:00
..
chart-helm	[CI/Build] Auto-fix Markdown files (#12941 )	2025-02-08 04:25:15 -08:00
disagg_examples	[Feature][Disaggregated] Support XpYd disaggregated prefill with MooncakeStore (#12957 )	2025-03-29 04:01:46 -07:00
opentelemetry	Deprecate `best_of` Sampling Parameter in anticipation for vLLM V1 (#13997 )	2025-03-05 20:22:43 +00:00
prometheus_grafana	fix typo of grafana dashboard, with correct datasource (#13668 )	2025-02-21 18:21:05 +00:00
api_client.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
cohere_rerank_client.py	[Misc] refactor examples (#16563 )	2025-04-14 09:59:15 +00:00
disaggregated_prefill.sh	[Frontend][Bugfix] support prefill decode disaggregation on deepseek (#14824 )	2025-03-20 00:00:33 -07:00
gradio_openai_chatbot_webserver.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
gradio_webserver.py	[Misc] refactor argument parsing in examples (#16635 )	2025-04-15 08:05:30 +00:00
jinaai_rerank_client.py	[Misc] refactor examples (#16563 )	2025-04-14 09:59:15 +00:00
multi-node-serving.sh	[Misc] Adding script to setup ray for multi-node vllm deployments (#12913 )	2025-02-20 21:16:40 -08:00
openai_chat_completion_client.py	[Misc] refactor examples (#16563 )	2025-04-14 09:59:15 +00:00
openai_chat_completion_client_for_multimodal.py	Improve-mm-and-pooler-and-decoding-configs (#16789 )	2025-04-17 22:13:32 -07:00
openai_chat_completion_client_with_tools.py	[Misc] refactor examples series - Chat Completion Client With Tools (#16829 )	2025-04-18 23:24:42 +00:00
openai_chat_completion_client_with_tools_required.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
openai_chat_completion_structured_outputs.py	[Misc] refactor example series - structured outputs (#17040 )	2025-04-24 07:49:48 -07:00
openai_chat_completion_structured_outputs_with_reasoning.py	[Misc] refactor example series - structured outputs (#17040 )	2025-04-24 07:49:48 -07:00
openai_chat_completion_tool_calls_with_reasoning.py	[Misc] refactor example series (#16972 )	2025-04-22 11:44:21 +00:00
openai_chat_completion_with_reasoning.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
openai_chat_completion_with_reasoning_streaming.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
openai_chat_embedding_client_for_multimodal.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
openai_completion_client.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
openai_cross_encoder_score.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
openai_embedding_client.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
openai_embedding_matryoshka_fy.py	[Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970 )	2025-04-24 07:06:28 -07:00
openai_pooling_client.py	[Misc] refactor examples series (#16708 )	2025-04-16 10:16:36 +00:00
openai_transcription_client.py	[Frontend] Add sampling params to `v1/audio/transcriptions` endpoint (#16591 )	2025-04-19 07:03:54 +00:00
run_cluster.sh	[Doc] Move examples into categories (#11840 )	2025-01-08 13:09:53 +00:00
sagemaker-entrypoint.sh	[Doc] Move examples into categories (#11840 )	2025-01-08 13:09:53 +00:00