vllm/tests/neuron/1_core
Yong Hoon Shin 98c89e16ff
Make key optional for rotary embedding (#17566)
Signed-off-by: Yong Hoon Shin <yhshin@meta.com>
2025-05-07 00:11:46 -07:00
..
test_activation.py [Neuron] flatten test parameterization for neuron attention kernels (#14712) 2025-03-13 20:46:56 -07:00
test_block_table.py [Misc] Replace os environ to monkeypatch in test suite (#14516) 2025-03-16 20:35:57 -07:00
test_cache.py [Neuron][kernel] Fuse kv cache into a single tensor (#15911) 2025-04-03 09:51:32 -07:00
test_layernorm.py [Neuron] flatten test parameterization for neuron attention kernels (#14712) 2025-03-13 20:46:56 -07:00
test_logits_processor.py [Neuron] flatten test parameterization for neuron attention kernels (#14712) 2025-03-13 20:46:56 -07:00
test_neuron_model_runner.py Add NeuronxDistributedInference support, Speculative Decoding, Dynamic on-device sampling (#16357) 2025-05-07 00:07:30 -07:00
test_prefix_prefill.py [Neuron][kernel] Fuse kv cache into a single tensor (#15911) 2025-04-03 09:51:32 -07:00
test_rotary_embedding.py Make key optional for rotary embedding (#17566) 2025-05-07 00:11:46 -07:00