vllm/quantization at 6d0df0ebebd4e347e1ebcdea4be010a4b54b901b - vllm

History

Reid 9c1244de57 [doc] update to hyperlink (#17096 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>		2025-04-24 00:58:08 -07:00
..
auto_awq.md	[doc] update to hyperlink (#17096 )	2025-04-24 00:58:08 -07:00
bitblas.md	[Doc] Add top anchor and a note to quantization/bitblas.md (#17042 )	2025-04-23 07:32:16 -07:00
bnb.md	[Misc] Auto detect bitsandbytes pre-quantized models (#16027 )	2025-04-04 23:30:45 -07:00
fp8.md	[Doc] Convert docs to use colon fences (#12471 )	2025-01-29 11:38:29 +08:00
gguf.md	doc: fix some typos in doc (#16154 )	2025-04-07 05:32:06 +00:00
gptqmodel.md	[Docs] Add GPTQModel (#14056 )	2025-03-03 21:59:09 +00:00
index.md	[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (#6036 )	2025-04-22 09:01:36 +01:00
int4.md	[Doc] int4 w4a16 example (#12585 )	2025-01-31 15:38:48 -08:00
int8.md	[Doc] int4 w4a16 example (#12585 )	2025-01-31 15:38:48 -08:00
quantized_kvcache.md	[FP8][Kernel] Dynamic kv cache scaling factors computation (#11906 )	2025-01-23 18:04:03 +00:00
quark.md	[Doc] Quark quantization documentation (#15861 )	2025-04-01 08:32:45 -07:00
supported_hardware.md	[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (#6036 )	2025-04-22 09:01:36 +01:00
torchao.md	Torchao (#14231 )	2025-04-07 19:39:28 -04:00