DW-YOLO: An Efficient Object Detector for Drones and Self-driving Vehicles
- Authors
- Chen, Yunfan; Zheng, Wenqi; Zhao, Yangyi; Song, Tae Hun; Shin, Hyunchul
- Issue Date
- Feb-2023
- Publisher
- SPRINGER HEIDELBERG
- Keywords
- Object detection; Self-driving; Drone vision; Deep learning; Optimization
- Citation
- Arabian Journal For Science and Engineering, v.48, no.2, pp 1 - 10
- Pages
- 10
- Indexed
- SCIE
- Journal Title
- Arabian Journal For Science and Engineering
- Volume
- 48
- Number
- 2
- Start Page
- 1
- End Page
- 10
- URI
- https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/111274
- DOI
- 10.1007/s13369-022-06874-7
- ISSN
- 2193-567X
2191-4281
- Abstract
- Object detection is frequently a challenging task due to poor visual cues of objects in an image. In this paper, a new efficient deep learning-based detection method, named as deeper and wider YOLO (DW-YOLO), has been proposed for various-sized objects from various perspectives. DW-YOLO is based on YOLOv5 and the following two enhancements have been developed to make the entire network deeper and wider. First, residual blocks in each cross stage partial structure are optimized to strengthen the ability of feature extraction in high-resolution drone images. Second, the entire network becomes wider by increasing the number of convolution kernels, aiming to obtain more discriminative features to fit complex data. The learning ability of a CNN model is related to its complexity. Making the network deeper can increase its complexity so that the ability of feature extraction is improved and the relationship between high-dimensional features can be easily learned. Increasing the network width can make each layer learn richer features in different directions and frequencies. Furthermore, a new large and diverse drone dataset named HDrone for object detection in real drone-view scenarios is introduced. This dataset involves six types of annotations in a wide range of scenarios, which is not limited to the traffic scenario. The experimental results on three datasets among which HDrone and VisDrone are the datasets for drone vision, and KITTI is the dataset for self-driving showing that the proposed DW-YOLO achieves the state-of-the-art results and can detect small-scaled objects well along with large-scaled objects.
- Files in This Item
-
Go to Link
- Appears in
Collections - COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/111274)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.