Cited 0 time in
DeepCOI: a large language model-driven framework for fast and accurate taxonomic assignment in animal metabarcoding
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Gwak, Ho-Jin | - |
| dc.contributor.author | Rho, Mina | - |
| dc.date.accessioned | 2025-12-01T07:01:30Z | - |
| dc.date.available | 2025-12-01T07:01:30Z | - |
| dc.date.issued | 2026-03 | - |
| dc.identifier.issn | 1474-7596 | - |
| dc.identifier.issn | 1474-760X | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209403 | - |
| dc.description.abstract | Metabarcoding remains challenging due to incomplete taxonomic annotations and computationally intensive processes. We present DeepCOI, a large language model-based classifier pre-trained on seven million cytochrome c oxidase I gene sequences. DeepCOI enables fast and accurate taxonomic assignment across eight major phyla, achieving an AU-ROC of 0.958 and AU-PR of 0.897-outperforming existing methods while significantly reducing inference time. Additionally, DeepCOI demonstrates interpretability by identifying taxonomically informative sequence positions. By integrating large-scale datasets and self-supervised learning, DeepCOI enhances both the accuracy and efficiency of metabarcoding processes, providing a scalable solution for biodiversity assessment and environmental monitoring. | - |
| dc.format.extent | 20 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | BioMed Central | - |
| dc.title | DeepCOI: a large language model-driven framework for fast and accurate taxonomic assignment in animal metabarcoding | - |
| dc.type | Article | - |
| dc.publisher.location | 영국 | - |
| dc.identifier.doi | 10.1186/s13059-025-03861-7 | - |
| dc.identifier.scopusid | 2-s2.0-105022133918 | - |
| dc.identifier.wosid | 001617745400003 | - |
| dc.identifier.bibliographicCitation | Genome Biology, v.26, no.1, pp 1 - 20 | - |
| dc.citation.title | Genome Biology | - |
| dc.citation.volume | 26 | - |
| dc.citation.number | 1 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 20 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Biotechnology & Applied Microbiology | - |
| dc.relation.journalResearchArea | Genetics & Heredity | - |
| dc.relation.journalWebOfScienceCategory | Biotechnology & Applied Microbiology | - |
| dc.relation.journalWebOfScienceCategory | Genetics & Heredity | - |
| dc.subject.keywordPlus | BARCODE | - |
| dc.subject.keywordPlus | SYSTEM | - |
| dc.subject.keywordAuthor | Metabarcoding | - |
| dc.subject.keywordAuthor | Metagenomics | - |
| dc.subject.keywordAuthor | COI genes | - |
| dc.subject.keywordAuthor | Language model | - |
| dc.subject.keywordAuthor | Self-supervised learning | - |
| dc.subject.keywordAuthor | Explainable AI | - |
| dc.identifier.url | https://link.springer.com/article/10.1186/s13059-025-03861-7 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
