MMDet / mmdetection /docs /en /user_guides /inference.md

MMdet Model for Image Segmentation

6c9ac8f about 2 years ago

5.69 kB

	# Inference with existing models

	MMDetection provides hundreds of pre-trained detection models in [Model Zoo](https://mmdetection.readthedocs.io/en/latest/model_zoo.html).
	This note will show how to inference, which means using trained models to detect objects on images.

	In MMDetection, a model is defined by a [configuration file](https://mmdetection.readthedocs.io/en/latest/user_guides/config.html) and existing model parameters are saved in a checkpoint file.

	To start with, we recommend [RTMDet](https://github.com/open-mmlab/mmdetection/tree/main/configs/rtmdet) with this [configuration file](https://github.com/open-mmlab/mmdetection/blob/main/configs/rtmdet/rtmdet_l_8xb32-300e_coco.py) and this [checkpoint file](https://download.openmmlab.com/mmdetection/v3.0/rtmdet/rtmdet_l_8xb32-300e_coco/rtmdet_l_8xb32-300e_coco_20220719_112030-5a0be7c4.pth). It is recommended to download the checkpoint file to `checkpoints` directory.

	## High-level APIs for inference

	MMDetection provides high-level Python APIs for inference on images. Here is an example of building the model and inference on given images or videos.

	```python
	import cv2
	import mmcv
	from mmcv.transforms import Compose
	from mmengine.utils import track_iter_progress
	from mmdet.registry import VISUALIZERS
	from mmdet.apis import init_detector, inference_detector


	# Specify the path to model config and checkpoint file
	config_file = 'configs/rtmdet/rtmdet_l_8xb32-300e_coco.py'
	checkpoint_file = 'checkpoints/rtmdet_l_8xb32-300e_coco_20220719_112030-5a0be7c4.pth'

	# Build the model from a config file and a checkpoint file
	model = init_detector(config_file, checkpoint_file, device='cuda:0')

	# Init visualizer
	visualizer = VISUALIZERS.build(model.cfg.visualizer)
	# The dataset_meta is loaded from the checkpoint and
	# then pass to the model in init_detector
	visualizer.dataset_meta = model.dataset_meta

	# Test a single image and show the results
	img = 'test.jpg' # or img = mmcv.imread(img), which will only load it once
	result = inference_detector(model, img)

	# Show the results
	img = mmcv.imread(img)
	img = mmcv.imconvert(img, 'bgr', 'rgb')


	visualizer.add_datasample(
	'result',
	img,
	data_sample=result,
	draw_gt=False,
	show=True)

	# Test a video and show the results
	# Build test pipeline
	model.cfg.test_dataloader.dataset.pipeline[0].type = 'LoadImageFromNDArray'
	test_pipeline = Compose(model.cfg.test_dataloader.dataset.pipeline)

	# visualizer has been created in line 31 and 34, if you run this demo in one notebook,
	# you need not build the visualizer again.

	# Init visualizer
	visualizer = VISUALIZERS.build(model.cfg.visualizer)
	# The dataset_meta is loaded from the checkpoint and
	# then pass to the model in init_detector
	visualizer.dataset_meta = model.dataset_meta

	# The interval of show (ms), 0 is block
	wait_time = 1

	video_reader = mmcv.VideoReader('video.mp4')

	cv2.namedWindow('video', 0)

	for frame in track_iter_progress(video_reader):
	result = inference_detector(model, frame, test_pipeline=test_pipeline)
	visualizer.add_datasample(
	name='video',
	image=frame,
	data_sample=result,
	draw_gt=False,
	show=False)
	frame = visualizer.get_image()
	mmcv.imshow(frame, 'video', wait_time)

	cv2.destroyAllWindows()
	```

	A notebook demo can be found in [demo/inference_demo.ipynb](https://github.com/open-mmlab/mmdetection/blob/main/demo/inference_demo.ipynb).

	Note: `inference_detector` only supports single-image inference for now.

	## Demos

	We also provide three demo scripts, implemented with high-level APIs and supporting functionality codes.
	Source codes are available [here](https://github.com/open-mmlab/mmdetection/blob/main/demo).

	### Image demo

	This script performs inference on a single image.

	```shell
	python demo/image_demo.py \
	${IMAGE_FILE} \
	${CONFIG_FILE} \
	[--weights ${WEIGHTS}] \
	[--device ${GPU_ID}] \
	[--pred-score-thr ${SCORE_THR}]
	```

	Examples:

	```shell
	python demo/image_demo.py demo/demo.jpg \
	configs/rtmdet/rtmdet_l_8xb32-300e_coco.py \
	--weights checkpoints/rtmdet_l_8xb32-300e_coco_20220719_112030-5a0be7c4.pth \
	--device cpu
	```

	#### Webcam demo

	This is a live demo from a webcam.

	```shell
	python demo/webcam_demo.py \
	${CONFIG_FILE} \
	${CHECKPOINT_FILE} \
	[--device ${GPU_ID}] \
	[--camera-id ${CAMERA-ID}] \
	[--score-thr ${SCORE_THR}]
	```

	Examples:

	```shell
	python demo/webcam_demo.py \
	configs/rtmdet/rtmdet_l_8xb32-300e_coco.py \
	checkpoints/rtmdet_l_8xb32-300e_coco_20220719_112030-5a0be7c4.pth
	```

	### Video demo

	This script performs inference on a video.

	```shell
	python demo/video_demo.py \
	${VIDEO_FILE} \
	${CONFIG_FILE} \
	${CHECKPOINT_FILE} \
	[--device ${GPU_ID}] \
	[--score-thr ${SCORE_THR}] \
	[--out ${OUT_FILE}] \
	[--show] \
	[--wait-time ${WAIT_TIME}]
	```

	Examples:

	```shell
	python demo/video_demo.py demo/demo.mp4 \
	configs/rtmdet/rtmdet_l_8xb32-300e_coco.py \
	checkpoints/rtmdet_l_8xb32-300e_coco_20220719_112030-5a0be7c4.pth \
	--out result.mp4
	```

	#### Video demo with GPU acceleration

	This script performs inference on a video with GPU acceleration.

	```shell
	python demo/video_gpuaccel_demo.py \
	${VIDEO_FILE} \
	${CONFIG_FILE} \
	${CHECKPOINT_FILE} \
	[--device ${GPU_ID}] \
	[--score-thr ${SCORE_THR}] \
	[--nvdecode] \
	[--out ${OUT_FILE}] \
	[--show] \
	[--wait-time ${WAIT_TIME}]
	```

	Examples:

	```shell
	python demo/video_gpuaccel_demo.py demo/demo.mp4 \
	configs/rtmdet/rtmdet_l_8xb32-300e_coco.py \
	checkpoints/rtmdet_l_8xb32-300e_coco_20220719_112030-5a0be7c4.pth \
	--nvdecode --out result.mp4
	```