다중 기계학습 방법을 이용한 한국어 커뮤니티 기반 질의-응답 시스템

권순재; 김주애; 강상우; 서정연

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

다중 기계학습 방법을 이용한 한국어 커뮤니티 기반 질의-응답 시스템

Full metadata record

DC Field	Value	Language
dc.contributor.author	권순재	-
dc.contributor.author	김주애	-
dc.contributor.author	강상우	-
dc.contributor.author	서정연	-
dc.date.available	2020-02-28T04:43:34Z	-
dc.date.created	2020-02-12	-
dc.date.issued	2016	-
dc.identifier.issn	2383-630X	-
dc.identifier.uri	https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/9148	-
dc.description.abstract	커뮤니티 기반 질의 응답 시스템은 사용자 질의에 대한 정답을 인터넷 커뮤니티에 사용자들이게시했던 문서 중에서 선택하여 제공하는 시스템이다. 기존 방법들은 질의 분석의 성능 향상을 위하여 목적 영역에 적합한 규칙을 구축하거나 일부 처리 과정에 기계 학습을 적용하였다. 하지만 기존 방법들은적용 영역을 확장하거나 수정하는 경우 많은 비용이 소요되며 경우에 따라서는 시스템이 특정 영역에 과적합되는 경우가 발생한다. 본 논문에서는 커뮤니티 기반 질의-응답 시스템의 효과적인 처리를 위해서 시스템의 각 과정에 적합한 기계 학습 방법을 적용하여 전체 과정을 자동화하는 다중 기계학습 방법을 제안한다. 제안 시스템은 사용자 질의를 분석하는 부분과 정답 문서를 선택하는 부분으로 나눌 수 있다. 질의분석 과정은 질의의 초점 구문을 분석하는 질의 핵심부 추출기와 질의의 주제를 분류하는 질의 유형 분류기로 구성하였으며, 전자는 조건부 무작위장을 사용하고 후자는 지지 벡터 기계를 사용한다. 정답 문서 선택에서는 유사도 측정에서 사용하는 가중치를 인공 신경망으로 학습한다. 또한 인터넷에 커뮤니티에 게시된데이터는 형태소 분석 결과를 신뢰할 수 없는 경우가 많이 발생한다. 따라서 음절 자질을 사용하여 질의를 분석 단계에서 형태소 분석의 영향을 최소화하는 방법을 제안한다. 제안하는 시스템은 Mean Average Precision 기준으로 0.765, R-Precision 기준으로 0.872의 성능을 보여 기존 시스템보다 성능이 우수하다.	-
dc.language	한국어	-
dc.language.iso	ko	-
dc.publisher	한국정보과학회	-
dc.relation.isPartOf	정보과학회논문지	-
dc.title	다중 기계학습 방법을 이용한 한국어 커뮤니티 기반 질의-응답 시스템	-
dc.title.alternative	A Korean Community-based Question Answering System Using Multiple Machine Learning Methods	-
dc.type	Article	-
dc.type.rims	ART	-
dc.description.journalClass	2	-
dc.identifier.bibliographicCitation	정보과학회논문지, v.43, no.10, pp.1085 - 1093	-
dc.identifier.kciid	ART002156002	-
dc.citation.endPage	1093	-
dc.citation.startPage	1085	-
dc.citation.title	정보과학회논문지	-
dc.citation.volume	43	-
dc.citation.number	10	-
dc.contributor.affiliatedAuthor	강상우	-
dc.subject.keywordAuthor	community-based question answering	-
dc.subject.keywordAuthor	related document retrieval	-
dc.subject.keywordAuthor	model ensemble	-
dc.subject.keywordAuthor	document type classification	-
dc.subject.keywordAuthor	focus construction analysis	-
dc.subject.keywordAuthor	natural language processing	-
dc.subject.keywordAuthor	커뮤니티 기반 질의-응답 시스템	-
dc.subject.keywordAuthor	관련 문서 검색	-
dc.subject.keywordAuthor	모델 앙상블	-
dc.subject.keywordAuthor	문서 유형 분류	-
dc.subject.keywordAuthor	초점 구문 분석	-
dc.subject.keywordAuthor	자연어처리	-
dc.description.journalRegisteredClass	kci	-

Files in This Item: There are no files associated with this item.

Appears in Collections: IT융합대학 > 소프트웨어학과 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kang, Sang Woo photo

Kang, Sang Woo: College of IT Convergence (Department of Software)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :4,170,066; Today View :31,884

RSS_1.0 RSS_2.0 ATOM_1.0

1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE