Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Benchmark Review of YOLO Algorithm Developments for Object Detectionopen access

Authors
Hua, ZhengmaoAranganadin, KaviyaYeh, Cheng-ChengHai, XinheHuang, Chen-YunLeung, Tsan-ChuenHsu, Hua-YiLan, Yung-ChiangLin, Ming-Chieh
Issue Date
Jul-2025
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
YOLO; Feature extraction; Accuracy; Detectors; Deep learning; Benchmark testing; Computer vision; Reviews; Convolutional neural networks; Classification algorithms; COCO2017; computer vision; object detection; mAP; VOC07+12
Citation
IEEE Access, v.13, pp 123515 - 123545
Pages
31
Indexed
SCIE
SCOPUS
Journal Title
IEEE Access
Volume
13
Start Page
123515
End Page
123545
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208424
DOI
10.1109/ACCESS.2025.3586673
ISSN
2169-3536
2169-3536
Abstract
You Only Look Once (YOLO) has established itself as a prominent object detection framework due to its excellent balance between speed and accuracy. This article provides a thorough review of the YOLO series, from YOLOv1 to YOLOv10, including YOLOX, emphasizing their architectural advancements, loss function improvements, and performance enhancements. We have benchmarked the officially released versions from YOLOv3 to YOLOv10 and YOLOX, using widely recognized datasets VOC07+12 and COCO2017, on diverse hardware platforms: NVIDIA GTX Titan X, RTX 3060, and Tesla V100. The benchmark provides significant insights, such as YOLOv9-E achieving the highest mean average precision (mAP) of 76.0% on VOC07+12 and also showing superior detection accuracy on COCO2017 with an mAP of 56.6% which is 1.2% higher than that of the latest YOLOv10-X. YOLOv9-E stands out for its superior detection accuracy making it more suitable for detection that needs high accuracy such as analysis of medical images, while some lightweight versions like YOLOv5-S, YOLOv7-S, YOLOv8-S, and YOLOv10-S offer the great balance of accuracy and speed, making them ideal for real-time applications. Among them, YOLOv7-S has the highest mAP value among these lightweight models. Inference benchmarks highlight lightweight YOLO models such as YOLOv10-S for their exceptional inference speed on all GPUs and results of training time also indicate YOLOv9-E would take the longest time to converge among all versions using both datasets. This study would provide researchers and developers with some strategies in choosing appropriate YOLO models based on accuracy, resource availability, and application-specific needs.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lin, Ming Chieh photo

Lin, Ming Chieh
COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE