ESG-Kor: A Korean Dataset for ESG-related Information Extraction and Practical Use Cases

Lee, Jaeyoung; Son, Geonyeong; Kim, Misuk

doi:10.18653/v1/2024.findings-emnlp.387

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

ESG-Kor: A Korean Dataset for ESG-related Information Extraction and Practical Use Cases

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Jaeyoung	-
dc.contributor.author	Son, Geonyeong	-
dc.contributor.author	Kim, Misuk	-
dc.date.accessioned	2025-03-11T00:30:13Z	-
dc.date.available	2025-03-11T00:30:13Z	-
dc.date.issued	2024-11	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206721	-
dc.description.abstract	With the expansion of pre-trained language model usage in recent years, the importance of datasets for performing tasks in specialized domains has significantly increased. Therefore, we have built a Korean dataset called ESG-Kor to automatically extract Environmental, Social, and Governance (ESG) information, which has recently gained importance. ESG-Kor is a dataset consisting of a total of 118,946 sentences that extracted information on each ESG component from Korean companies' sustainability reports and manually labeled it according to objective rules provided by ESG evaluation agencies. To verify the effectiveness and applicability of the ESG-Kor dataset, classification performance was confirmed using several Korean pre-trained language models, and significant performance was obtained. Additionally, by extending the ESG classification model to documents of small and medium enterprises and extracting information based on ESG key issues and in-depth analysis, we demonstrated potential and practical use cases in the ESG field.	-
dc.format.extent	17	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Association for Computational Linguistics (ACL)	-
dc.title	ESG-Kor: A Korean Dataset for ESG-related Information Extraction and Practical Use Cases	-
dc.type	Article	-
dc.identifier.doi	10.18653/v1/2024.findings-emnlp.387	-
dc.identifier.scopusid	2-s2.0-85217623106	-
dc.identifier.bibliographicCitation	EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024, pp 6627 - 6643	-
dc.citation.title	EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2024	-
dc.citation.startPage	6627	-
dc.citation.endPage	6643	-
dc.type.docType	Conference paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordPlus	Classification (of information)	-
dc.subject.keywordPlus	Computational linguistics	-
dc.subject.keywordPlus	Data mining	-
dc.subject.keywordPlus	Modeling languages	-
dc.identifier.url	https://aclanthology.org/2024.findings-emnlp.387/	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher MISUK, KIM photo

MISUK, KIM: COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE