vllm/csrc/quantization/gptq
CHU Tianxiang 01a5d18a53
Add Support for 2/3/8-bit GPTQ Quantization Models (#2330)
2024-02-28 21:52:23 -08:00
..
compat.cuh Add GPTQ support (#916) 2023-12-15 03:04:22 -08:00
matrix_view.cuh Add Support for 2/3/8-bit GPTQ Quantization Models (#2330) 2024-02-28 21:52:23 -08:00
q_gemm.cu Add Support for 2/3/8-bit GPTQ Quantization Models (#2330) 2024-02-28 21:52:23 -08:00
qdq_2.cuh Add Support for 2/3/8-bit GPTQ Quantization Models (#2330) 2024-02-28 21:52:23 -08:00
qdq_3.cuh Add Support for 2/3/8-bit GPTQ Quantization Models (#2330) 2024-02-28 21:52:23 -08:00
qdq_4.cuh Add Support for 2/3/8-bit GPTQ Quantization Models (#2330) 2024-02-28 21:52:23 -08:00
qdq_8.cuh Add Support for 2/3/8-bit GPTQ Quantization Models (#2330) 2024-02-28 21:52:23 -08:00
qdq_util.cuh Add GPTQ support (#916) 2023-12-15 03:04:22 -08:00