Cited 0 time in
Does Localization Inform Unlearning? A Rigorous Examination of Local Parameter Attribution for Knowledge Unlearning in Language Models
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Hwiyeong | - |
| dc.contributor.author | Hwang, Uiji | - |
| dc.contributor.author | Lim, Hyelim | - |
| dc.contributor.author | Kim, Taeuk | - |
| dc.date.accessioned | 2026-06-16T04:30:30Z | - |
| dc.date.available | 2026-06-16T04:30:30Z | - |
| dc.date.issued | 2025-11 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/213282 | - |
| dc.description.abstract | Large language models often retain unintended content, prompting growing interest in knowledge unlearning.Recent approaches emphasize localized unlearning, restricting parameter updates to specific regions in an effort to remove target knowledge while preserving unrelated general knowledge. However, their effectiveness remains uncertain due to the lack of robust and thorough evaluation of the trade-off between the competing goals of unlearning.In this paper, we begin by revisiting existing localized unlearning approaches. We then conduct controlled experiments to rigorously evaluate whether local parameter updates causally contribute to unlearning.Our findings reveal that the set of parameters that must be modified for effective unlearning is not strictly determined, challenging the core assumption of localized unlearning that parameter locality is inherently indicative of effective knowledge removal. | - |
| dc.format.extent | 13 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Association for Computational Linguistics | - |
| dc.title | Does Localization Inform Unlearning? A Rigorous Examination of Local Parameter Attribution for Knowledge Unlearning in Language Models | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.18653/v1/2025.emnlp-main.1109 | - |
| dc.identifier.scopusid | 2-s2.0-105040218112 | - |
| dc.identifier.bibliographicCitation | EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, pp 21857 - 21869 | - |
| dc.citation.title | EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference | - |
| dc.citation.startPage | 21857 | - |
| dc.citation.endPage | 21869 | - |
| dc.type.docType | Conference paper | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.identifier.url | https://aclanthology.org/2025.emnlp-main.1109/ | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
