vllm/1_core at 55f1a468d97fbf9387e577e901b3f290ed8aa15b - vllm

History

Yong Hoon Shin 98c89e16ff Make key optional for rotary embedding (#17566 ) Signed-off-by: Yong Hoon Shin <yhshin@meta.com>		2025-05-07 00:11:46 -07:00
..
test_activation.py	[Neuron] flatten test parameterization for neuron attention kernels (#14712 )	2025-03-13 20:46:56 -07:00
test_block_table.py	[Misc] Replace os environ to monkeypatch in test suite (#14516 )	2025-03-16 20:35:57 -07:00
test_cache.py	[Neuron][kernel] Fuse kv cache into a single tensor (#15911 )	2025-04-03 09:51:32 -07:00
test_layernorm.py	[Neuron] flatten test parameterization for neuron attention kernels (#14712 )	2025-03-13 20:46:56 -07:00
test_logits_processor.py	[Neuron] flatten test parameterization for neuron attention kernels (#14712 )	2025-03-13 20:46:56 -07:00
test_neuron_model_runner.py	Add NeuronxDistributedInference support, Speculative Decoding, Dynamic on-device sampling (#16357 )	2025-05-07 00:07:30 -07:00
test_prefix_prefill.py	[Neuron][kernel] Fuse kv cache into a single tensor (#15911 )	2025-04-03 09:51:32 -07:00
test_rotary_embedding.py	Make key optional for rotary embedding (#17566 )	2025-05-07 00:11:46 -07:00