Q-YOLO: Efficient Inference for Real-time Object Detection

Wang, Mingze; Sun, Huixin; Shi, Jun; Liu, Xuhui; Zhang, Baochang; Cao, Xianbin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.04816 (cs)

[Submitted on 1 Jul 2023]

Title:Q-YOLO: Efficient Inference for Real-time Object Detection

Authors:Mingze Wang, Huixin Sun, Jun Shi, Xuhui Liu, Baochang Zhang, Xianbin Cao

View PDF

Abstract:Real-time object detection plays a vital role in various computer vision applications. However, deploying real-time object detectors on resource-constrained platforms poses challenges due to high computational and memory requirements. This paper describes a low-bit quantization method to build a highly efficient one-stage detector, dubbed as Q-YOLO, which can effectively address the performance degradation problem caused by activation distribution imbalance in traditional quantized YOLO models. Q-YOLO introduces a fully end-to-end Post-Training Quantization (PTQ) pipeline with a well-designed Unilateral Histogram-based (UH) activation quantization scheme, which determines the maximum truncation values through histogram analysis by minimizing the Mean Squared Error (MSE) quantization errors. Extensive experiments on the COCO dataset demonstrate the effectiveness of Q-YOLO, outperforming other PTQ methods while achieving a more favorable balance between accuracy and computational cost. This research contributes to advancing the efficient deployment of object detection models on resource-limited edge devices, enabling real-time detection with reduced computational and memory overhead.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2307.04816 [cs.CV]
	(or arXiv:2307.04816v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.04816

Submission history

From: Huixin Sun [view email]
[v1] Sat, 1 Jul 2023 03:50:32 UTC (532 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Q-YOLO: Efficient Inference for Real-time Object Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Q-YOLO: Efficient Inference for Real-time Object Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators