Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Enhancing Genomic Data Representation through BERT-LSTM Hybrid Architecture

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Kyeong Ho-
dc.contributor.authorKim, Minji-
dc.contributor.authorKim, Sohui-
dc.contributor.authorLee, Jong-Min-
dc.date.accessioned2026-05-27T00:30:38Z-
dc.date.available2026-05-27T00:30:38Z-
dc.date.issued2025-04-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212861-
dc.description.abstractThis study proposes a novel approach for effective genetic sequence representation, focusing on the challenges of compressing and analyzing complex genomic data. We introduce a hybrid architecture that combines Bidirectional Encoder Representations from Transformers (BERT) with Long Short-Term Memory (LSTM) networks to generate comprehensive and compact gene embeddings. Our method processes genetic sequence data through k-mer tokenization and employs BERT to capture complex patterns, followed by LSTM to preserve essential sequential information while creating fixed-size representations. Using data from 623 participants from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database, we analyzed genetic sequences across 10 genes to evaluate our approach. The effectiveness of our method is demonstrated through both visualization and quantitative evaluation. The t-distributed stochastic neighbor embedding (t-SNE) visualization revealed improved clustering of gene embeddings compared to traditional approaches, while our model achieved 82% accuracy in gene classification tasks. Our findings indicate that the combination of BERT and LSTM effectively captures both local and global genetic patterns while creating meaningful compressed representations, providing a promising framework for genetic sequence analysis.-
dc.format.extent11-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleEnhancing Genomic Data Representation through BERT-LSTM Hybrid Architecture-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/ACCESS.2025.3560282-
dc.identifier.scopusid2-s2.0-105003188956-
dc.identifier.wosid001483881100024-
dc.identifier.bibliographicCitationIEEE ACCESS, v.13, pp 76497 - 76507-
dc.citation.titleIEEE ACCESS-
dc.citation.volume13-
dc.citation.startPage76497-
dc.citation.endPage76507-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusGENETICS-
dc.subject.keywordAuthorGenetics-
dc.subject.keywordAuthorGenomics-
dc.subject.keywordAuthorBioinformatics-
dc.subject.keywordAuthorTransformers-
dc.subject.keywordAuthorLong short term memory-
dc.subject.keywordAuthorTokenization-
dc.subject.keywordAuthorEncoding-
dc.subject.keywordAuthorBidirectional control-
dc.subject.keywordAuthorData models-
dc.subject.keywordAuthorSequences-
dc.subject.keywordAuthorBERT-
dc.subject.keywordAuthorgene embedding-
dc.subject.keywordAuthorLSTM-
dc.subject.keywordAuthorrepresentation learning-
dc.subject.keywordAuthorSNP-
dc.subject.keywordAuthortokenization-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10964250-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Jong Min photo

Lee, Jong Min
COLLEGE OF ENGINEERING (서울 바이오메디컬공학전공)
Read more

Altmetrics

Total Views & Downloads

BROWSE