Computer Science Faculty Publications

KoSBI: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Applications

Hwaran Lee
Seokhee Hong
Joonsuk Park, University of RichmondFollow
Takyoung Kim
Gunhee Kim
Jung-Woo Ha

DOI

10.18653/v1/2023.acl-industry.21

Abstract

Large language models (LLMs) not only learn natural text generation abilities but also social biases against different demographic groups from real-world data. This poses a critical risk when deploying LLM-based applications. Existing research and resources are not readily applicable in South Korea due to the differences in language and culture, both of which significantly affect the biases and targeted demographic groups. This limitation requires localized social bias datasets to ensure the safe and effective deployment of LLMs. To this end, we present KosBi, a new social bias dataset of 34k pairs of contexts and sentences in Korean covering 72 demographic groups in 15 categories. We find that through filtering-based moderation, social biases in generated content can be reduced by 16.47%p on average for HyperClova (30B and 82B), and GPT-3.

Document Type

Conference Proceeding

Publication Date

7-2023

Publisher Statement

ACL materials are Copyright © 1963–2024 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License. Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.

Recommended Citation

Hwaran Lee, Seokhee Hong, Joonsuk Park, Takyoung Kim, Gunhee Kim, and Jung-woo Ha. 2023. KoSBI: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Applications. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), pages 208–224, Toronto, Canada. Association for Computational Linguistics. DOI: 10.18653/v1/2023.acl-industry.21

Link to Full Text

COinS

UR Scholarship Repository

Computer Science Faculty Publications

KoSBI: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Applications

DOI

Abstract

Document Type

Publication Date

Publisher Statement

Recommended Citation

Search

Browse

Author Corner

UR Scholarship Repository

Computer Science Faculty Publications

KoSBI: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Applications

Authors

DOI

Abstract

Document Type

Publication Date

Publisher Statement

Recommended Citation

Share

Search

Browse

Author Corner