Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection

Authors
Park, SomeenKim, JaehoonJin, SeungwanPark, SohyunHan, Kyungsik
Issue Date
Nov-2024
Publisher
Association for Computational Linguistics (ACL)
Citation
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, pp 20963 - 20987
Pages
25
Indexed
SCOPUS
Journal Title
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
Start Page
20963
End Page
20987
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206736
DOI
10.18653/v1/2024.emnlp-main.1166
Abstract
While a few public benchmarks have been proposed for training hate speech detection models, the differences in labeling criteria between these benchmarks pose challenges for generalized learning, limiting the applicability of the models. Previous research has presented methods to generalize models through data integration or augmentation, but overcoming the differences in labeling criteria between datasets remains a limitation. To address these challenges, we propose PREDICT, a novel framework that uses the notion of multi-agent for hate speech detection. PREDICT consists of two phases: (1) PRE (Perspective-based REasoning): Multiple agents are created based on the induced labeling criteria of given datasets, and each agent generates stances and reasons; (2) DICT (Debate using InCongruenT references): Agents representing hate and non-hate stances conduct the debate, and a judge agent classifies hate or non-hate and provides a balanced reason. Experiments on five representative public benchmarks show that PREDICT achieves superior cross-evaluation performance compared to methods that focus on specific labeling criteria or majority voting methods. Furthermore, we validate that PREDICT effectively mediates differences between agents' opinions and appropriately incorporates minority opinions to reach a consensus. Our code is available at https://github.com/Hanyang-HCCLab/PREDICT.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Han, Kyungsik photo

Han, Kyungsik
COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)
Read more

Altmetrics

Total Views & Downloads

BROWSE