pymllm.quantize.cast2fp32_pass

Classes

Cast2Fp32QuantizePass

Helper class that provides a standard way to create an ABC using

Module Contents

class pymllm.quantize.cast2fp32_pass.Cast2Fp32QuantizePass

Bases: pymllm.quantize.quantize_pass.QuantizeBasePass

Helper class that provides a standard way to create an ABC using inheritance.

prepare(quantize_config, tensor_dict, **kwargs)
Parameters:

tensor_dict (Dict)

Return type:

pymllm.quantize.quantize_pass.QuantizePlanPayload

match(quantize_config, tensor_dict, **kwargs)
Parameters:

tensor_dict (Dict)

Return type:

bool

run(quantize_config, tensor_dict, **kwargs)
Parameters:

tensor_dict (Dict)

Return type:

Dict