Enum KVCacheQuantizationMode
- Namespace
- AiDotNet.Configuration
- Assembly
- AiDotNet.dll
Controls optional KV-cache quantization for inference.
public enum KVCacheQuantizationMode
Fields
Int8 = 1Signed int8 quantization with scaling (advanced, opt-in).
None = 0No quantization (default).