SANTM: Efficient Self-attention-driven Network for Text Matching

Tiwari, Prayag; Jaiswal, Amit Kumar; Garg, Sahil; You, Ilsun

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

SANTM: Efficient Self-attention-driven Network for Text Matching

Full metadata record

DC Field	Value	Language
dc.contributor.author	Tiwari, Prayag	-
dc.contributor.author	Jaiswal, Amit Kumar	-
dc.contributor.author	Garg, Sahil	-
dc.contributor.author	You, Ilsun	-
dc.date.accessioned	2022-11-29T04:40:51Z	-
dc.date.available	2022-11-29T04:40:51Z	-
dc.date.issued	2022-08	-
dc.identifier.issn	1533-5399	-
dc.identifier.issn	1557-6051	-
dc.identifier.uri	https://scholarworks.bwise.kr/sch/handle/2021.sw.sch/21741	-
dc.description.abstract	Self-attention mechanisms have recently been embraced for a broad range of text-matching applications. Self-attention model takes only one sentence as an input with no extra information, i.e., one can utilize the final hidden state or pooling. However, text-matching problems can be interpreted either in symmetrical or asymmetrical scopes. For instance, paraphrase detection is an asymmetrical task, while textual entailment classification and question-answer matching are considered asymmetrical tasks. In this article, we leverage attractive properties of self-attention mechanism and proposes an attention-based network that incorporates three key components for inter-sequence attention: global pointwise features, preceding attentive features, and contextual features while updating the rest of the components. Our model follows evaluation on two benchmark datasets cover tasks of textual entailment and question-answer matching. The proposed efficient Self-attention-driven Network for Text Matching outperforms the state of the art on the Stanford Natural Language Inference and WikiQA datasets with much fewer parameters.	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Association for Computing Machinary, Inc.	-
dc.title	SANTM: Efficient Self-attention-driven Network for Text Matching	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1145/3426971	-
dc.identifier.wosid	000844323400002	-
dc.identifier.bibliographicCitation	ACM Transactions on Internet Technology, v.22, no.3	-
dc.citation.title	ACM Transactions on Internet Technology	-
dc.citation.volume	22	-
dc.citation.number	3	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Computer Science, Software Engineering	-
dc.subject.keywordPlus	ENVIRONMENT	-
dc.subject.keywordAuthor	Text matching	-
dc.subject.keywordAuthor	deep learning	-
dc.subject.keywordAuthor	attention mechanism	-

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :1,423,001; Today View :562

RSS_1.0 RSS_2.0 ATOM_1.0

(31538) 22, Soonchunhyang-ro, Asan-si, Chungcheongnam-do, Republic of Korea+82-41-530-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE