scaledown.quantization.weight_quantization
class scaledown.quantization.weight_quantization(level='float16')
Parameters
- level: str, ['float16', 'int8'] Used for setting the level of quantization
Attributes
- quantized_model: Returns the quantized model.
Methods
- scaledown