Influence of data source and volume on CNN applications in construction
- Authors
- Rafieizonooz, Mahdi; Pham, Hieu T.T.L.; Han, Sanguk; Seo, JoonOh; Khankhaje, Elnaz
- Issue Date
- Nov-2025
- Publisher
- Elsevier BV
- Keywords
- Construction Safety; Convolutional Neural Networks; Data Source; Data Volume; Image Classification; Image Segmentation; Object Detection; Structural Health Monitoring
- Citation
- Automation in Construction, v.179, pp 1 - 17
- Pages
- 17
- Indexed
- SCIE
SCOPUS
- Journal Title
- Automation in Construction
- Volume
- 179
- Start Page
- 1
- End Page
- 17
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209267
- DOI
- 10.1016/j.autcon.2025.106476
- ISSN
- 0926-5805
1872-7891
- Abstract
- Convolutional Neural Networks (CNNs) are widely used in construction. However, the impact of data characteristics on their performance remains underexplored. This review aims to summarize previous papers from the perspectives of data source and volume and to understand their relationship with accuracy. A literature review of 162 papers revealed that 75% of the papers utilized data from a single source, often resulting in higher performance than multiple and public sources. The mean sample numbers of models with and without pre-training were 9,307 and 554,305, respectively. The review results indicated that the relationship between the number of samples and accuracy was moderately positive, and pre-training may allow for performance improvement even with fewer samples. This review highlights efforts toward improving publicly available data and pre-trained models in the construction community and using diverse data sources for validation to ensure the generalization of CNN algorithms for practical applications.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 건설환경공학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.