Cited 0 time in
The Antecedents of Transformer Models
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Dennis, Simon | - |
| dc.contributor.author | Shabahang, Kevin | - |
| dc.contributor.author | Yim, Hyungwook | - |
| dc.date.accessioned | 2026-04-29T01:00:09Z | - |
| dc.date.available | 2026-04-29T01:00:09Z | - |
| dc.date.issued | 2025-02 | - |
| dc.identifier.issn | 0963-7214 | - |
| dc.identifier.issn | 1467-8721 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212436 | - |
| dc.description.abstract | Transformer models of language represent a step change in our ability to account for cognitive phenomena. Although the specific architecture that has garnered recent interest is quite young, many of its components have antecedents in the cognitive science literature. In this article, we start by providing an introduction to large language models aimed at a general psychological audience. We then highlight some of the antecedents, including the importance of scale, instance-based memory models, paradigmatic association and systematicity, positional encodings of serial order, and the learning of control processes. This article offers an exploration of the relationship between transformer models and their precursors, showing how they can be understood as a next phase in our understanding of cognitive processes. | - |
| dc.format.extent | 9 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | SAGE Publications | - |
| dc.title | The Antecedents of Transformer Models | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1177/09637214241279504 | - |
| dc.identifier.scopusid | 2-s2.0-86000789554 | - |
| dc.identifier.wosid | 001356691600001 | - |
| dc.identifier.bibliographicCitation | Current Directions in Psychological Science, v.34, no.1, pp 3 - 11 | - |
| dc.citation.title | Current Directions in Psychological Science | - |
| dc.citation.volume | 34 | - |
| dc.citation.number | 1 | - |
| dc.citation.startPage | 3 | - |
| dc.citation.endPage | 11 | - |
| dc.type.docType | Article; Early Access | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | ssci | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Psychology | - |
| dc.relation.journalWebOfScienceCategory | Psychology, Multidisciplinary | - |
| dc.subject.keywordPlus | RECOGNITION MEMORY | - |
| dc.subject.keywordPlus | SERIAL ORDER | - |
| dc.subject.keywordPlus | ACQUISITION | - |
| dc.subject.keywordPlus | RETRIEVAL | - |
| dc.subject.keywordAuthor | large language models | - |
| dc.subject.keywordAuthor | cognitive psychology | - |
| dc.identifier.url | https://journals.sagepub.com/doi/10.1177/09637214241279504 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
