사전 학습된 언어 모델에서의 성별 편향 측정Measuring Gender Bias in Pre-trained Language Models
- Other Titles
- Measuring Gender Bias in Pre-trained Language Models
- Authors
- 김용우; 우상미; 김영민
- Issue Date
- Mar-2025
- Publisher
- 한국경영공학회
- Keywords
- Gender bias; Stereotypes; Pre-trained Language Model; WinoBias
- Citation
- 한국경영공학회지, v.30, no.1, pp 41 - 54
- Pages
- 14
- Indexed
- KCI
- Journal Title
- 한국경영공학회지
- Volume
- 30
- Number
- 1
- Start Page
- 41
- End Page
- 54
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207111
- ISSN
- 2005-7776
2713-573X
- Abstract
- Purpose This study aims to analyze and quantify gender bias in pre-trained language models by examining their behavior and proposing new metrics for bias assessment.
Methods In this study, we utilized the WinoBias benchmark dataset to measure gender stereotype and skew through tasks evaluating model outputs in pro- and anti-stereotypic scenarios. Unlike traditional methods that measure genter stereotype and skew using model outputs, this study proposes a new bias measurement approach based on the model's probability values.
Results Unlike previous studies that showed a conflicting relationship between gender stereotype and skew, measurements using the proposed metric revealed a positive correlation between the two, diverging from the results of conventional metrics.
Conclusion The study highlights the need to adopt and improve alternative metrics to better assess and address bias in language models.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 서울 산업융합학부 > 서울 산업융합학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.