Skip to content

scaledown.quantization.weight_quantization

class scaledown.quantization.weight_quantization(level='float16')

Parameters

  • level: str, ['float16', 'int8'] Used for setting the level of quantization

Attributes

  • quantized_model: Returns the quantized model.

Methods

  • scaledown