Does Localization Inform Unlearning? A Rigorous Examination of Local Parameter Attribution for Knowledge Unlearning in Language Models

Lee, Hwiyeong; Hwang, Uiji; Lim, Hyelim; Kim, Taeuk

doi:10.18653/v1/2025.emnlp-main.1109

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Does Localization Inform Unlearning? A Rigorous Examination of Local Parameter Attribution for Knowledge Unlearning in Language Models

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Hwiyeong	-
dc.contributor.author	Hwang, Uiji	-
dc.contributor.author	Lim, Hyelim	-
dc.contributor.author	Kim, Taeuk	-
dc.date.accessioned	2026-06-16T04:30:30Z	-
dc.date.available	2026-06-16T04:30:30Z	-
dc.date.issued	2025-11	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/213282	-
dc.description.abstract	Large language models often retain unintended content, prompting growing interest in knowledge unlearning.Recent approaches emphasize localized unlearning, restricting parameter updates to specific regions in an effort to remove target knowledge while preserving unrelated general knowledge. However, their effectiveness remains uncertain due to the lack of robust and thorough evaluation of the trade-off between the competing goals of unlearning.In this paper, we begin by revisiting existing localized unlearning approaches. We then conduct controlled experiments to rigorously evaluate whether local parameter updates causally contribute to unlearning.Our findings reveal that the set of parameters that must be modified for effective unlearning is not strictly determined, challenging the core assumption of localized unlearning that parameter locality is inherently indicative of effective knowledge removal.	-
dc.format.extent	13	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Association for Computational Linguistics	-
dc.title	Does Localization Inform Unlearning? A Rigorous Examination of Local Parameter Attribution for Knowledge Unlearning in Language Models	-
dc.type	Article	-
dc.identifier.doi	10.18653/v1/2025.emnlp-main.1109	-
dc.identifier.scopusid	2-s2.0-105040218112	-
dc.identifier.bibliographicCitation	EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, pp 21857 - 21869	-
dc.citation.title	EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference	-
dc.citation.startPage	21857	-
dc.citation.endPage	21869	-
dc.type.docType	Conference paper	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scopus	-
dc.identifier.url	https://aclanthology.org/2025.emnlp-main.1109/	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Taeuk photo

Kim, Taeuk: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE