当前位置：首页 > news >正文

TensorRT量化工具pytorch_quantization代码解析(二）

news 2025/10/16 2:25:56

有些地方看的不是透彻，后续继续补充！

继续看张量量化函数，代码位于：tools\pytorch-quantization\pytorch_quantization\tensor_quant.py

ScaledQuantDescriptor

量化的支持描述符:描述张量应该如何量化。QuantDescriptor和张量定义了量化张量。

class ScaledQuantDescriptor():def __init__(self, num_bits=8, name=None, **kwargs):if not isinstance(num_bits, int):raise TypeError("num_bits must be an integer, not {}.".format(type(num_bits)))if num_bits < 0:raise ValueError("num_bits must be >= 0, not {}.".format(num_bits))if num_bits == 0:logging.error("num_bits is 0. This will result in the tensor being quantized to all zeros."" This mode should only be used for debugging purposes.")self._num_bits = num_bitsif not isinstance(name, str) and name is not None:raise TypeError("name must be a string or None, not {}.".format(type(name)))self._name = nameself._fake_quant = kwargs.pop('fake_quant', True)self._axis = kwargs.pop('axis', None)if self._axis is not None:logging.debug("Meaning of axis has changed since v2.0. Make sure to update.")self._learn_amax = kwargs.pop('learn_amax', False)if self._learn_amax and self._axis is not None:raise TypeError("axis is ignored and must be None when learn_amax is true, got {}.".format(type(self._axis)))amax = kwargs.pop('amax', None)if amax is not None:if not isinstance(amax, float) and not isinstance(amax, list) and not isinstance(amax, np.ndarray):raise TypeError("amax must be float, list or ndarray, not {}".format(type(amax)))# Make it single precision arrayself._amax = np.array(amax, dtype=np.float32)else:self._amax = amaxself._scale_amax = kwargs.pop('scale_amax', None)self._calib_method = kwargs.pop('calib_method', "max")self._unsigned = kwargs.pop('unsigned', False)self._narrow_range = kwargs.pop('narrow_range', False)if kwargs:raise TypeError("Unused keys: {}".format(kwargs.keys()))

参数：

num_bits:int,量化位数,用于计算比例因子。默认值8。
name:看起来很不错

关键字参数：

fake_quant：布尔值。如果为True，则使用fake量化模式。默认为True
axis：None, int或整数的tuple，轴将利用自己的最大值以计算缩放因子，默认None。
- 如果None（默认值），则使用per tensor scale。
  确保在范围[-rank（input_tensor），rank（输入_tensor））内。
  例如，对于KCRS权重张量，quant_axis=（0）将产生per channel scaling。
amax：用户指定的绝对最大范围的float或list/ndarray。如果提供，忽略quant_axis并使用它进行量化。如果learn_amax为True，将用于初始化可学习的amax。默认None
learn_amax：boolean，如果为True，学习amax。默认为False。
scale_amax：float，如果提供，将amax乘以scale_amax，默认无。
calib_method：string，[“max”，“histogram”]中的一个校准要使用的指标。除了
max calibration，其他都是基于hisogram的。默认值“max”。
unsigned：Boolean，如果为True，则使用无符号。默认为False。

Raises:

TypeError：如果传入了不支持的类型。

Read-only properties:

fake_quant:
name:
learn_amax:
scale_amax:
axis:
calib_method:
num_bits:
amax:
unsigned:

QuantDescriptor定义了张量应该如何量化。预定义的QuantDescriptor张量描述符如下：

QuantDescriptor = ScaledQuantDescriptor# Predefined descriptors
QUANT_DESC_8BIT_PER_TENSOR = QuantDescriptor(num_bits=8)
QUANT_DESC_UNSIGNED_8BIT_PER_TENSOR = QuantDescriptor(num_bits=8, unsigned=True)
QUANT_DESC_8BIT_CONV1D_WEIGHT_PER_CHANNEL = QuantDescriptor(num_bits=8, axis=(0))
QUANT_DESC_8BIT_CONV2D_WEIGHT_PER_CHANNEL = QuantDescriptor(num_bits=8, axis=(0))
QUANT_DESC_8BIT_CONV3D_WEIGHT_PER_CHANNEL = QuantDescriptor(num_bits=8, axis=(0))
QUANT_DESC_8BIT_LINEAR_WEIGHT_PER_ROW = QuantDescriptor(num_bits=8, axis=(0))
QUANT_DESC_8BIT_CONVTRANSPOSE1D_WEIGHT_PER_CHANNEL = QuantDescriptor(num_bits=8, axis=(0))
QUANT_DESC_8BIT_CONVTRANSPOSE2D_WEIGHT_PER_CHANNEL = QuantDescriptor(num_bits=8, axis=(0))
QUANT_DESC_8BIT_CONVTRANSPOSE3D_WEIGHT_PER_CHANNEL = QuantDescriptor(num_bits=8, axis=(0))

如果在QuantDescriptor中给出最amax，TensorQuantizer将使用它进行量化。否则，TensorQuantizer将计算amax，然后进行量化。amax被计算通过指定的axis轴。注意QuantDescriptor将剩余轴指定与max()轴相反。

例子：

from pytorch_quantization.tensor_quant import QuantDescriptor
from pytorch_quantization.nn.modules.tensor_quantizer import TensorQuantizerquant_desc = QuantDescriptor(num_bits=4, fake_quant=False, axis=(0), unsigned=True)

接下来看量化函数：pytorch_quantization提供3个自定义的张量量化函数算子，继承torch.autograd.function，实现函数的前向传播、反向传播

TensorQuantFunction

通用的张量量化函数TensorQuantFunction

class TensorQuantFunction(Function):"""一个输入张量，输出一个量化张量。`scale`的粒度可以从amax的形状来解释"""

forward

在前向过程中，对浮点权重和激活进行伪量化，并使用这些伪量化的权重和激活来执行层的操作

    @staticmethoddef forward(ctx, inputs, amax, num_bits=8, unsigned=False, narrow_range=True):ctx.save_for_backward(inputs, amax)outputs, scale = _tensor_quant(inputs, amax, num_bits, unsigned, narrow_range)# Check if scale overflows FP16if outputs.dtype == torch.half and scale.max() > 65504:raise ValueError("scale is too large for FP16 with amax={}".format(amax))return outputs, scale.to(inputs.dtype)

output_dtype指示量化值是以整数还是浮点形式存储。希望将其存储在浮点中的原因是pytorch函数接受量化值，它可能不接受整数输入，例如Conv2D。
它使用 $2^{num\_bits-1}$ 值，例如，对于num_bits＝8，使用[-127，127]

遵循tensorflow约定，传入最大值并用于确定比例，而不是直接输入比例。尽管直接输入比例可能更自然。

参数：

ctx：一个用于向后存储张量的Context对象。
inputs：float32型张量。
amax：float32型张量。输入将在[-amax，amax]范围内量化，amax将广播到inputs tensor。
num_bits：用于计算缩放因子的整数， $scale=(2^{num\_bits-1}-1)/max$ 。默认值8。
output_dtype：张量的一种类型。torch.int32或torch.float32。希望存储为float，pytorch函数接受float量化值，它可能不接受整数输入。
unsigned：boolean，使用无符号整数范围。例如，对于num_bits=8，[0，255]。默认为False。
narrow_range：布尔值。使用对称整数范围进行有符号量化
例如，对于num_bits=8，用[-127，127]代替[-128，127]。默认为True。

Returns:

outputs：output_dtype类型的张量。
scale：float32型张量。outputs / scale将对输出张量进行反量化。

Raises:

ValueError:

backward

通过clipping实现直通估计。对于-amax<=input<=amax，梯度直接通过，否则梯度为零。
参数：

ctx：一个上下文对象，其中保存了来自forward的张量。
grad_outputs：outputs梯度张量。
grad_scale：scale梯度张量。

Returns:

grad_inputs：梯度张量。

    @staticmethoddef backward(ctx, grad_outputs, grad_scale):"""Implements straight through estimation with clipping. For -amax <= input <= amaxthe gradient passes straight through, otherwise the gradient is zero.Args:ctx: A Context object with saved tensors from forward.grad_outputs: A tensor of gradient of outputs.grad_scale: A tensor of gradient of scale.Returns:grad_inputs: A tensor of gradient."""inputs, amax = ctx.saved_tensorszero = grad_outputs.new_zeros(1) # create a zero tensor with the same type and devicegrad_inputs = torch.where(inputs.abs() <= amax, grad_outputs, zero)return grad_inputs, None, None, None, None

tensor_quant = TensorQuantFunction.apply

给TensorQuantFunction.apply赋予一个别名tensor_quant，这样可以直接调用tensor_quant进行量化，例如：

from pytorch_quantization import tensor_quant# Generate random input. With fixed seed 12345, x should be
# tensor([0.9817, 0.8796, 0.9921, 0.4611, 0.0832, 0.1784, 0.3674, 0.5676, 0.3376, 0.2119])
torch.manual_seed(12345)
x = torch.rand(10)# quantize tensor x. quant_x will be
# tensor([126., 113., 127.,  59.,  11.,  23.,  47.,  73.,  43.,  27.])
# with scale=128.0057
quant_x, scale = tensor_quant.tensor_quant(x, x.abs().max())

FakeTensorQuantFunction

class FakeTensorQuantFunction(Function):"""Fake version of TensorQuantFunctionSee comments of TensorQuantFunction, arguments are the same."""@staticmethoddef forward(ctx, inputs, amax, num_bits=8, unsigned=False, narrow_range=True):ctx.save_for_backward(inputs, amax)outputs, scale = _tensor_quant(inputs, amax, num_bits, unsigned, narrow_range)return outputs / scale.to(inputs.dtype)@staticmethoddef backward(ctx, grad_outputs):inputs, amax = ctx.saved_tensorszero = grad_outputs.new_zeros(1)grad_inputs = torch.where(inputs.abs() <= amax, grad_outputs, zero)return grad_inputs, None, None, None, None

在向后过程中，使用权重的渐变来更新浮点权重。为了处理量化梯度，除了未定义的点之外，几乎所有地方都是零，可以使用直通估计器（ STE ），它通过伪量化操作符传递梯度。

fake_tensor_quant = FakeTensorQuantFunction.apply

给TensorQuantFunction.apply赋予一个别名fake_tensor_quant，这样可以直接调用fake_tensor_quant进行量化，例如：

from pytorch_quantization import tensor_quant# Generate random input. With fixed seed 12345, x should be
# tensor([0.9817, 0.8796, 0.9921, 0.4611, 0.0832, 0.1784, 0.3674, 0.5676, 0.3376, 0.2119])
torch.manual_seed(12345)
x = torch.rand(10)# fake quantize tensor x. fake_quant_x will be
# tensor([0.9843, 0.8828, 0.9921, 0.4609, 0.0859, 0.1797, 0.3672, 0.5703, 0.3359, 0.2109])
fake_quant_x = tensor_quant.fake_tensor_quant(x, x.abs().max())

_tensor_quant

def _tensor_quant(inputs, amax, num_bits=8, unsigned=False, narrow_range=True):"""Shared function body between TensorQuantFunction and FakeTensorQuantFunction"""# Fine scale, per channel scale will be handled by broadcasting, which could be tricky. Pop a warning.if isinstance(amax, torch.Tensor) and inputs.dim() != amax.dim():logging.debug("amax %s has different shape than inputs %s. Make sure broadcast works as expected!",amax.size(), inputs.size())logging.debug("{} bits quantization on shape {} tensor.".format(num_bits, inputs.size()))if unsigned:if inputs.min() < 0.:raise TypeError("Negative values encountered in unsigned quantization.")# Computation must be in FP32 to prevent potential over flow.input_dtype = inputs.dtypeif inputs.dtype == torch.half:inputs = inputs.float()if amax.dtype == torch.half:amax = amax.float()min_amax = amax.min()if min_amax < 0:raise ValueError("Negative values in amax")max_bound = torch.tensor((2.0**(num_bits - 1 + int(unsigned))) - 1.0, device=amax.device)if unsigned:min_bound = 0elif narrow_range:min_bound = -max_boundelse:min_bound = -max_bound - 1scale = max_bound / amaxepsilon = 1. / (1<<24)if min_amax <= epsilon:  # Treat amax smaller than minimum representable of fp16 0zero_amax_mask = (amax <= epsilon)scale[zero_amax_mask] = 0  # Value quantized with amax=0 should all be 0outputs = torch.clamp((inputs * scale).round_(), min_bound, max_bound)if min_amax <= epsilon:scale[zero_amax_mask] = 1.  # Return 1 makes more sense for values quantized to 0 with amax=0if input_dtype == torch.half:outputs = outputs.half()return outputs, scale

待梳理！！！

TensorRT量化工具pytorch_quantization代码解析(二）

ScaledQuantDescriptor

TensorQuantFunction

FakeTensorQuantFunction

_tensor_quant

相关文章：

TensorRT量化工具pytorch_quantization代码解析(二）

buu [BJDCTF2020]easyrsa 1

taobao.user.openuid.getbyorder( 根据订单获取买家openuid )

Mac iTerm2 rz sz

高通平台开发系列讲解（Sensor篇）Gsensor基础知识

图像处理实战--Opencv实现人像迁移

OnlyOffice验证（二）在Centos7上部署OnlyOffice编译结果

6.补充和总结【Java面试第三季】

基于ssm框架大学生社团管理系统(源码+数据库+文档)

vulnhub靶场NAPPING: 1.0.1教程

Docker基本介绍

可用于标记蛋白质216699-36-4，6-ROX，SE，6-羧基-X-罗丹明琥珀酰亚胺酯

高数：极限的定义

大数据技术之Hadoop

一文带你搞懂Go语言函数选项模式，Go函数一等公民。

Window.location 详细介绍

js侧滑显示删除按钮

Python - DIY - 使用dump取json某些键值对合成新的json文件

深度剖析指针（中）——“C”

论文阅读 | Video Frame Synthesis using Deep Voxel Flow

网络编程（Modbus进阶）

LBE-LEX系列工业语音播放器|预警播报器|喇叭蜂鸣器的上位机配置操作说明

AI-调查研究-01-正念冥想有用吗？对健康的影响及科学指南

使用VSCode开发Django指南

基于FPGA的PID算法学习———实现PID比例控制算法

简易版抽奖活动的设计技术方案

visual studio 2022更改主题为深色

React Native在HarmonyOS 5.0阅读类应用开发中的实践

电脑插入多块移动硬盘后经常出现卡顿和蓝屏

【Web 进阶篇】优雅的接口设计：统一响应、全局异常处理与参数校验