PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection

Park, Someen; Kim, Jaehoon; Jin, Seungwan; Park, Sohyun; Han, Kyungsik

doi:10.18653/v1/2024.emnlp-main.1166

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection

Full metadata record

DC Field	Value	Language
dc.contributor.author	Park, Someen	-
dc.contributor.author	Kim, Jaehoon	-
dc.contributor.author	Jin, Seungwan	-
dc.contributor.author	Park, Sohyun	-
dc.contributor.author	Han, Kyungsik	-
dc.date.accessioned	2025-03-11T02:30:18Z	-
dc.date.available	2025-03-11T02:30:18Z	-
dc.date.issued	2024-11	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206736	-
dc.description.abstract	While a few public benchmarks have been proposed for training hate speech detection models, the differences in labeling criteria between these benchmarks pose challenges for generalized learning, limiting the applicability of the models. Previous research has presented methods to generalize models through data integration or augmentation, but overcoming the differences in labeling criteria between datasets remains a limitation. To address these challenges, we propose PREDICT, a novel framework that uses the notion of multi-agent for hate speech detection. PREDICT consists of two phases: (1) PRE (Perspective-based REasoning): Multiple agents are created based on the induced labeling criteria of given datasets, and each agent generates stances and reasons; (2) DICT (Debate using InCongruenT references): Agents representing hate and non-hate stances conduct the debate, and a judge agent classifies hate or non-hate and provides a balanced reason. Experiments on five representative public benchmarks show that PREDICT achieves superior cross-evaluation performance compared to methods that focus on specific labeling criteria or majority voting methods. Furthermore, we validate that PREDICT effectively mediates differences between agents' opinions and appropriately incorporates minority opinions to reach a consensus. Our code is available at https://github.com/Hanyang-HCCLab/PREDICT.	-
dc.format.extent	25	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Association for Computational Linguistics (ACL)	-
dc.title	PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection	-
dc.type	Article	-
dc.identifier.doi	10.18653/v1/2024.emnlp-main.1166	-
dc.identifier.scopusid	2-s2.0-85217792624	-
dc.identifier.bibliographicCitation	EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, pp 20963 - 20987	-
dc.citation.title	EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference	-
dc.citation.startPage	20963	-
dc.citation.endPage	20987	-
dc.type.docType	Conference paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordPlus	Benchmarking	-
dc.subject.keywordPlus	Labeled data	-
dc.subject.keywordPlus	Prediction models	-
dc.subject.keywordPlus	Speech recognition	-
dc.identifier.url	https://aclanthology.org/2024.emnlp-main.1166/	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Han, Kyungsik photo

Han, Kyungsik: COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE