Cited 0 time in
Simple Temperature Cool-down in Contrastive Framework for Unsupervised Sentence Representation Learning
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jeong, Yoo Hyun | - |
| dc.contributor.author | Han, Myeongsoo | - |
| dc.contributor.author | Chae, Dong-Kyu | - |
| dc.date.accessioned | 2025-03-18T04:30:16Z | - |
| dc.date.available | 2025-03-18T04:30:16Z | - |
| dc.date.issued | 2024-03 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206806 | - |
| dc.description.abstract | In this paper, we propose a simple, tricky method to improve sentence representation of unsupervised contrastive learning. Even though contrastive learning has achieved great performances in both visual representation learning (VRL) and sentence representation learning (SRL) fields, we focus on the fact that there is a gap between the characteristics and training dynamics of VRL and SRL. We first examine the role of temperature to bridge the gap between VRL and SRL, and find some temperature-dependent elements in SRL; i.e., a higher temperature causes overfitting of the uniformity while improving the alignment in the earlier phase of training. Then, we design a temperature cool-down technique based on this observation, which helps PLMs to be more suitable for contrastive learning via the preparation of uniform representation space. Our experimental results on widely-utilized benchmarks demonstrate the effectiveness and an extensibility of our method. Our code is publicly available at https://github.com/myngsooo/Cooldown. | - |
| dc.format.extent | 10 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | ASSOC COMPUTATIONAL LINGUISTICS-ACL | - |
| dc.title | Simple Temperature Cool-down in Contrastive Framework for Unsupervised Sentence Representation Learning | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.scopusid | 2-s2.0-85188706415 | - |
| dc.identifier.wosid | 001356735800037 | - |
| dc.identifier.bibliographicCitation | FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: EACL 2024, pp 550 - 559 | - |
| dc.citation.title | FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: EACL 2024 | - |
| dc.citation.startPage | 550 | - |
| dc.citation.endPage | 559 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.subject.keywordPlus | Learning systems | - |
| dc.identifier.url | https://aclanthology.org/2024.findings-eacl.37/ | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
