Cited 0 time in
Toward construction-specialized, small language models: The interplay of domain adaptation, model scale and data volume
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Wang, Shuyi | - |
| dc.contributor.author | Fu, Yuguang | - |
| dc.contributor.author | Kim, Jinwoo | - |
| dc.date.accessioned | 2025-12-02T05:00:19Z | - |
| dc.date.available | 2025-12-02T05:00:19Z | - |
| dc.date.issued | 2026-01 | - |
| dc.identifier.issn | 1474-0346 | - |
| dc.identifier.issn | 1873-5320 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209431 | - |
| dc.description.abstract | While language models (LMs) are central to construction digitalization and automation, existing general-purpose LMs struggle with complex engineering contexts and domain-aligned responses. This study presents construction-specialized LMs at large, medium and small scales using four representative domain adaptation strategies: prompt engineering, retrieval-augmented generation, task-specific fine-tuning and pretraining-and-fine-tuning. Evaluated on a construction-specific question answering (QA) dataset, we show that a small-scale LM adapted via pretraining-and-fine-tuning achieves the best performance, improving F1-score by 14.6 %, S-BERT by 10.2 % and inference speed fourfold over larger-scale counterparts. Further evaluation across data regimes-from zero-shot to many-shot-reveals that training-free adaptations (prompt engineering and retrieval-augmented generation) on large-scale models excels in data-scarce settings, whereas training-required strategies (task-specific fine-tuning and pretraining-and-fine-tuning) unlock the potential of smaller models under sufficient supervision. These findings illuminate the interplay among domain adaptation strategies, model scale and data volume, providing a roadmap for developing more scalable, construction-specialized LMs in diverse field conditions. | - |
| dc.format.extent | 16 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Pergamon Press Ltd. | - |
| dc.title | Toward construction-specialized, small language models: The interplay of domain adaptation, model scale and data volume | - |
| dc.type | Article | - |
| dc.publisher.location | 영국 | - |
| dc.identifier.doi | 10.1016/j.aei.2025.104035 | - |
| dc.identifier.scopusid | 2-s2.0-105022207656 | - |
| dc.identifier.wosid | 001612271900003 | - |
| dc.identifier.bibliographicCitation | Advanced Engineering Informatics, v.69, pp 1 - 16 | - |
| dc.citation.title | Advanced Engineering Informatics | - |
| dc.citation.volume | 69 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 16 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Multidisciplinary | - |
| dc.subject.keywordPlus | Artificial intelligence | - |
| dc.subject.keywordPlus | Digital storage | - |
| dc.subject.keywordPlus | Question answering | - |
| dc.subject.keywordAuthor | Language model | - |
| dc.subject.keywordAuthor | Construction-specialized | - |
| dc.subject.keywordAuthor | Question answering (QA) | - |
| dc.subject.keywordAuthor | Domain adaptation | - |
| dc.subject.keywordAuthor | Model scale | - |
| dc.subject.keywordAuthor | Data volume | - |
| dc.identifier.url | https://www.sciencedirect.com/science/article/pii/S1474034625009280?via%3Dihub | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
