Yuantao Feng

Add tools for quantization and quantized models (#36)

55654c5 over 3 years ago

999 Bytes

	# Quantization with ONNXRUNTIME

	ONNXRUNTIME is used for quantization in the Zoo.

	Install dependencies before trying quantization:
	```shell
	pip install -r requirements.txt
	```

	## Usage

	Quantize all models in the Zoo:
	```shell
	python quantize.py
	```

	Quantize one of the models in the Zoo:
	```shell
	# python quantize.py <key_in_models>
	python quantize.py yunet
	```

	Customizing quantization configs:
	```python
	# add model into `models` dict in quantize.py
	models = dict(
	# ...
	model1=Quantize(model_path='/path/to/model1.onnx'
	calibration_image_dir='/path/to/images',
	transforms=Compose([''' transforms ''']), # transforms can be found in transforms.py
	per_channel=False, # set False to quantize in per-tensor style
	act_type='int8', # available types: 'int8', 'uint8'
	wt_type='int8' # available types: 'int8', 'uint8'
	)
	)
	# quantize the added models
	python quantize.py model1
	```