pymllm.quantize.solver¶
Classes¶
Module Contents¶
- class pymllm.quantize.solver.QuantizeSolver¶
- passes: List[pymllm.quantize.quantize_pass.QuantizeBasePass] = []¶
- register_pass(pass_)¶
- Parameters:
- stream_quantize_params_size(quant_cfg, tensor_dict, **kwargs)¶
- Parameters:
tensor_dict (Dict)
- Return type:
int
- stream_quantize(quant_cfg, tensor_dict, writer, **kwargs)¶
- Parameters:
tensor_dict (Dict)
writer (pymllm.convertor.ModelFileV2)
- Return type:
bool