Cited 0 time in
Bootstrap Your Own PLM: Boosting Semantic Features of PLMs for Unsuperivsed Contrastive Learning
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jeong, Yoo Hyun | - |
| dc.contributor.author | Han, Myeongsoo | - |
| dc.contributor.author | Chae, Dong-Kyu | - |
| dc.date.accessioned | 2025-03-18T04:30:18Z | - |
| dc.date.available | 2025-03-18T04:30:18Z | - |
| dc.date.issued | 2024-03 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206807 | - |
| dc.description.abstract | This paper aims to investigate the possibility of exploiting original semantic features of PLMs (pre-trained language models) during contrastive learning in the context of SRL (sentence representation learning). In the context of feature modification, we identified a method called IFM (implicit feature modification), which reduces the tendency of contrastive models for VRL (visual representation learning) to rely on feature-suppressing short-cut solutions. We observed that IFM did not work well for SRL, which may be due to differences between the nature of VRL and SRL. We propose BYOP, which boosts well-represented features, taking the opposite idea of IFM, under the assumption that SimCSE's dropout-noise-based augmentation may be too simple to modify high-level semantic features, and that the features learned by PLMs are semantically meaningful and should be boosted, rather than removed. Extensive experiments lend credence to the logic of BYOP, which considers the nature of SRL. Our code is publicly available at https://github.com/myngsooo/BYOP. | - |
| dc.format.extent | 10 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | ASSOC COMPUTATIONAL LINGUISTICS-ACL | - |
| dc.title | Bootstrap Your Own PLM: Boosting Semantic Features of PLMs for Unsuperivsed Contrastive Learning | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.scopusid | 2-s2.0-85188729087 | - |
| dc.identifier.wosid | 001356735800038 | - |
| dc.identifier.bibliographicCitation | FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: EACL 2024, pp 560 - 569 | - |
| dc.citation.title | FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: EACL 2024 | - |
| dc.citation.startPage | 560 | - |
| dc.citation.endPage | 569 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.subject.keywordPlus | Computational linguistics | - |
| dc.subject.keywordPlus | Learning systems | - |
| dc.identifier.url | https://aclanthology.org/2024.findings-eacl.38/ | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
