NovoRank: Refinement for De Novo Peptide Sequencing Based on Spectral Clustering and Deep Learning
- Authors
- Seo, Jangho; Choi, Seunghyuk; Paek, Eunok
- Issue Date
- Feb-2025
- Publisher
- American Chemical Society
- Keywords
- bioinformatics; proteomics; peptide identification; <italic>de novo</italic> peptide sequencing; spectralclustering; deep learning
- Citation
- Journal of Proteome Research, v.24, no.2, pp 903 - 910
- Pages
- 8
- Indexed
- SCIE
SCOPUS
- Journal Title
- Journal of Proteome Research
- Volume
- 24
- Number
- 2
- Start Page
- 903
- End Page
- 910
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212200
- DOI
- 10.1021/acs.jproteome.4c00300
- ISSN
- 1535-3893
1535-3907
- Abstract
- De novo peptide sequencing is a valuable technique in mass-spectrometry-based proteomics, as it deduces peptide sequences directly from tandem mass spectra without relying on sequence databases. This database-independent method, however, relies solely on imperfect scoring functions that often lead to erroneous peptide identifications. To boost correct identification, we present NovoRank, a postprocessing tool that employs spectral clustering and machine learning to assign more plausible peptide sequences to spectra. Prior to de novo peptide sequencing, spectral clustering is applied to group similar spectra under the assumption that they originated from the same peptide species. NovoRank then employs a deep learning model, incorporating both cluster-derived proteomic features and individual spectrum characteristics, to rerank the candidate peptides produced by de novo peptide sequencing. Our results show that NovoRank significantly enhances the performance of various de novo peptide sequencing tools, increasing both recall and precision by 0.020 to 0.080 at the peptide-spectrum match (PSM) level. Notably, NovoRank achieves a recall as high as 0.830 for Casanovo at the PSM level. The source code of NovoRank is freely available at https://github.com/HanyangBISLab/NovoRank and is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.