A Benchmark Review of YOLO Algorithm Developments for Object Detection

Hua, Zhengmao; Aranganadin, Kaviya; Yeh, Cheng-Cheng; Hai, Xinhe; Huang, Chen-Yun; Leung, Tsan-Chuen; Hsu, Hua-Yi; Lan, Yung-Chiang; Lin, Ming-Chieh

doi:10.1109/ACCESS.2025.3586673

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A Benchmark Review of YOLO Algorithm Developments for Object Detectionopen access

Authors: Hua, Zhengmao; Aranganadin, Kaviya; Yeh, Cheng-Cheng; Hai, Xinhe; Huang, Chen-Yun; Leung, Tsan-Chuen; Hsu, Hua-Yi; Lan, Yung-Chiang; Lin, Ming-Chieh

Issue Date: Jul-2025

Publisher: Institute of Electrical and Electronics Engineers Inc.

Keywords: YOLO; Feature extraction; Accuracy; Detectors; Deep learning; Benchmark testing; Computer vision; Reviews; Convolutional neural networks; Classification algorithms; COCO2017; computer vision; object detection; mAP; VOC07+12

Citation: IEEE Access, v.13, pp 123515 - 123545

Pages: 31

Indexed: SCIE
SCOPUS

Journal Title: IEEE Access

Volume: 13

Start Page: 123515

End Page: 123545

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208424

DOI: 10.1109/ACCESS.2025.3586673

ISSN: 2169-3536
2169-3536

Abstract: You Only Look Once (YOLO) has established itself as a prominent object detection framework due to its excellent balance between speed and accuracy. This article provides a thorough review of the YOLO series, from YOLOv1 to YOLOv10, including YOLOX, emphasizing their architectural advancements, loss function improvements, and performance enhancements. We have benchmarked the officially released versions from YOLOv3 to YOLOv10 and YOLOX, using widely recognized datasets VOC07+12 and COCO2017, on diverse hardware platforms: NVIDIA GTX Titan X, RTX 3060, and Tesla V100. The benchmark provides significant insights, such as YOLOv9-E achieving the highest mean average precision (mAP) of 76.0% on VOC07+12 and also showing superior detection accuracy on COCO2017 with an mAP of 56.6% which is 1.2% higher than that of the latest YOLOv10-X. YOLOv9-E stands out for its superior detection accuracy making it more suitable for detection that needs high accuracy such as analysis of medical images, while some lightweight versions like YOLOv5-S, YOLOv7-S, YOLOv8-S, and YOLOv10-S offer the great balance of accuracy and speed, making them ideal for real-time applications. Among them, YOLOv7-S has the highest mAP value among these lightweight models. Inference benchmarks highlight lightweight YOLO models such as YOLOv10-S for their exceptional inference speed on all GPUs and results of training time also indicate YOLOv9-E would take the longest time to converge among all versions using both datasets. This study would provide researchers and developers with some strategies in choosing appropriate YOLO models based on accuracy, resource availability, and application-specific needs.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Lin, Ming Chieh photo

Lin, Ming Chieh: COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE