vllm/csrc at woosuk/async-sched - vllm - Gitea: Git with a cup of tea

History

Li, Jiang 6cc1e7d96d [CPU] Update custom ops for the CPU backend (#20255 ) Signed-off-by: jiang1.li <jiang1.li@intel.com>		2025-07-01 07:25:03 +00:00
..
attention	…
core	…
cpu	…
cutlass_extensions	…
mamba	…
moe	…
prepare_inputs	…
quantization	…
quickreduce	…
rocm	…
sparse/cutlass	…
activation_kernels.cu	…
cache.h	…
cache_kernels.cu	…
cuda_compat.h	…
cuda_utils.h	…
cuda_utils_kernels.cu	…
cuda_view.cu	…
cumem_allocator.cpp	…
custom_all_reduce.cu	…
custom_all_reduce.cuh	…
custom_all_reduce_test.cu	…
custom_quickreduce.cu	…
dispatch_utils.h	…
layernorm_kernels.cu	…
layernorm_quant_kernels.cu	…
ops.h	…
permute_cols.cu	…
pos_encoding_kernels.cu	…
sampler.cu	…
torch_bindings.cpp	…
type_convert.cuh	…