The Antecedents of Transformer Modelsopen access
- Authors
- Dennis, Simon; Shabahang, Kevin; Yim, Hyungwook
- Issue Date
- Feb-2025
- Publisher
- SAGE Publications
- Keywords
- large language models; cognitive psychology
- Citation
- Current Directions in Psychological Science, v.34, no.1, pp 3 - 11
- Pages
- 9
- Indexed
- SSCI
SCOPUS
- Journal Title
- Current Directions in Psychological Science
- Volume
- 34
- Number
- 1
- Start Page
- 3
- End Page
- 11
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212436
- DOI
- 10.1177/09637214241279504
- ISSN
- 0963-7214
1467-8721
- Abstract
- Transformer models of language represent a step change in our ability to account for cognitive phenomena. Although the specific architecture that has garnered recent interest is quite young, many of its components have antecedents in the cognitive science literature. In this article, we start by providing an introduction to large language models aimed at a general psychological audience. We then highlight some of the antecedents, including the importance of scale, instance-based memory models, paradigmatic association and systematicity, positional encodings of serial order, and the learning of control processes. This article offers an exploration of the relationship between transformer models and their precursors, showing how they can be understood as a next phase in our understanding of cognitive processes.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > ETC > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.