Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Semantic Alignment with Calibrated Similarity for Multilingual Sentence Embedding

Full metadata record
DC Field Value Language
dc.contributor.authorHam, Jiyeon-
dc.contributor.authorKim, Eun-Sol-
dc.date.accessioned2024-12-05T00:00:14Z-
dc.date.available2024-12-05T00:00:14Z-
dc.date.issued2021-11-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/200317-
dc.description.abstractMeasuring the similarity score between a pair of sentences in different languages is the essential requisite for multilingual sentence embedding methods. Predicting the similarity score consists of two sub-tasks, which are monolingual similarity evaluation and multilingual sentence retrieval. However, conventional methods have mainly tackled only one of the sub-tasks and therefore showed biased performances. In this paper, we suggest a novel and strong method for multilingual sentence embedding, which shows performance improvement on both sub-tasks, consequently resulting in robust predictions of multilingual similarity scores. The suggested method consists of two parts: to learn semantic similarity of sentences in the pivot language and then to extend the learned semantic structure to different languages. To align semantic structures across different languages, we introduce a teacher-student network. The teacher network distills the knowledge of the pivot language to different languages of the student network. During the distillation, the parameters of the teacher network are updated with the slow-moving average. Together with the distillation and the parameter updating, the semantic structure of the student network can be directly aligned across different languages while preserving the ability to measure the semantic similarity. Thus, the multilingual training method drives performance improvement on multilingual similarity evaluation. The suggested model achieves the state-of-the-art performance on extended STS 2017 multilingual similarity evaluation as well as two sub-tasks, which are extended STS 2017 monolingual similarity evaluation and Tatoeba multilingual retrieval in 14 languages.-
dc.format.extent11-
dc.language영어-
dc.language.isoENG-
dc.publisherASSOC COMPUTATIONAL LINGUISTICS-ACL-
dc.titleSemantic Alignment with Calibrated Similarity for Multilingual Sentence Embedding-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.18653/v1/2021.findings-emnlp.153-
dc.identifier.scopusid2-s2.0-85129003504-
dc.identifier.wosid001181828800061-
dc.identifier.bibliographicCitationFINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, pp 1781 - 1791-
dc.citation.titleFINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021-
dc.citation.startPage1781-
dc.citation.endPage1791-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordPlusComputational linguistics-
dc.subject.keywordPlusEmbeddings-
dc.subject.keywordPlusNatural language processing systems-
dc.subject.keywordPlusSemantics-
dc.subject.keywordPlusStudents-
dc.identifier.urlhttps://aclanthology.org/2021.findings-emnlp.153/-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Eun Sol photo

Kim, Eun Sol
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE