Enhancing Maritime Data Integration for Platform Services with Sequence-to-Sequence Models and Statistical Refinementopen access
- Authors
- Hwang, Hyoseong; Wong, Richard; Lim, Ducsun; Kang, Jonggu; Joe, Inwhee
- Issue Date
- Mar-2025
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Marine vehicles; Data models; Data integration; DSL; Biological system modeling; Accuracy; Transformers; Seaports; Interoperability; Computational modeling; Data collection; domain specific languages (DSL); sequence to sequence model; data platform
- Citation
- IEEE Access, v.13, pp 58636 - 58648
- Pages
- 13
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 13
- Start Page
- 58636
- End Page
- 58648
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207289
- DOI
- 10.1109/ACCESS.2025.3555272
- ISSN
- 2169-3536
2169-3536
- Abstract
- The increasing adoption of IoT devices on ships and the expansion of platform services present a critical challenge in integrating heterogeneous ship data models into a unified platform data model. The task involves mapping ship Domain-Specific Language (DSL) descriptions to platform indices, complicated by variability and class imbalance. To address these challenges, this paper proposes a framework that combines a sequence-to-sequence model with statistical vectorization techniques. The model generates structured mapping classes, offering flexibility to accommodate diverse equipment and attributes, while training exclusively on connected data mitigates class imbalance. Subsequently, statistical vectorization techniques are applied to identify the correct match among the classified candidates, while ensuring that unconnected data is excluded. This two-step approach enhances recall and guarantees accurate relationships between ship DSLs and platform data indices. The proposed framework is validated using real-world data from 52 ships. Experimental results demonstrate that the sequence-to-sequence model with statistical refinement outperformed single-step and discriminative methods in handling class imbalance and variations when mapping ship DSLs to a unified platform data model. Our method achieved a recall of 89.14 and an Fβ-Score of 87.12, which are 4.15 and 1.91 points higher, respectively, than the reference classification method.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.