NysAct: A Scalable Preconditioned Gradient Descent using Nyström Approximation
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 고현석 | - |
dc.date.accessioned | 2025-01-13T06:00:20Z | - |
dc.date.available | 2025-01-13T06:00:20Z | - |
dc.date.issued | 2024-12 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/121997 | - |
dc.description.abstract | Adaptive gradient methods are computationally efficient and converge quickly, but they often suffer from poor generalization. In contrast, second-order methods enhance convergence and generalization but typically incur high computational and memory costs. In this work, we introduce NYSACT, a scalable first-order gradient preconditioning method that strikes a balance between state-of-the-art first-order and second-order optimization methods. NYSACT leverages an eigenvalue-shifted Nyström method to approximate the activation covariance matrix, which is used as a preconditioning matrix, significantly reducing time and memory complexities with minimal impact on test accuracy. Our experiments show that NYSACT not only achieves improved test accuracy compared to both first-order and second-order methods but also demands considerably less computational resources than existing second-order methods. | - |
dc.format.extent | 8 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | IEEE | - |
dc.title | NysAct: A Scalable Preconditioned Gradient Descent using Nyström Approximation | - |
dc.type | Article | - |
dc.identifier.doi | 10.1109/BigData62323.2024.10825352 | - |
dc.identifier.scopusid | 2-s2.0-85217998577 | - |
dc.identifier.bibliographicCitation | IEEE International Conference on BigData, pp 1442 - 1449 | - |
dc.citation.title | IEEE International Conference on BigData | - |
dc.citation.startPage | 1442 | - |
dc.citation.endPage | 1449 | - |
dc.type.docType | Proceeding | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordAuthor | Deep learning optimization | - |
dc.subject.keywordAuthor | Gradient preconditioning | - |
dc.subject.keywordAuthor | Nyström approximation | - |
dc.identifier.url | https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10825352 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.