..
basic
Support embedding models in V1 ( #16188 )
2025-06-18 21:36:33 -07:00
disaggregated-prefill-v1
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
openai_batch
[Frontend] add run batch to CLI ( #18804 )
2025-05-28 07:08:57 -07:00
profiling_tpu
[Misc] refactor neuron_multimodal and profiling ( #19397 )
2025-06-10 06:12:42 +00:00
qwen2_5_omni
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
audio_language.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
automatic_prefix_caching.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
batch_llm_inference.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
chat_with_tools.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
context_extension.py
[Misc] refactor context extension ( #19246 )
2025-06-07 05:13:21 +00:00
data_parallel.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
disaggregated_prefill.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
eagle.py
[Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets ( #18847 )
2025-06-12 10:30:56 -07:00
embed_jina_embeddings_v3.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
embed_matryoshka_fy.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
encoder_decoder.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
encoder_decoder_multimodal.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
llm_engine_example.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
load_sharded_state.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
lora_with_quantization_inference.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
metrics.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
mistral-small.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
mlpspeculator.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
multilora_inference.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
neuron.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
neuron_eagle.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
neuron_int8_quantization.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
neuron_multimodal.py
[Misc] refactor neuron_multimodal and profiling ( #19397 )
2025-06-10 06:12:42 +00:00
neuron_speculation.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
prefix_caching.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
prithvi_geospatial_mae.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
profiling.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
prompt_embed_inference.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
qwen3_reranker.py
[New Model]: Support Qwen3 Embedding & Reranker ( #19260 )
2025-06-10 20:07:30 -07:00
qwen_1m.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
reproducibility.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
rlhf.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
rlhf_colocate.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
rlhf_utils.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
save_sharded_state.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
simple_profiling.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
spec_decode.py
[Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets ( #18847 )
2025-06-12 10:30:56 -07:00
structured_outputs.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
torchrun_example.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
tpu.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
vision_language.py
[Misc] Add SPDX-FileCopyrightText ( #19100 )
2025-06-03 11:20:17 -07:00
vision_language_embedding.py
Support embedding models in V1 ( #16188 )
2025-06-18 21:36:33 -07:00
vision_language_multi_image.py
[Doc] Add missing llava family multi-image examples ( #19698 )
2025-06-17 07:05:21 +00:00