Feature Selection for High-Dimensional Data: A Case Study of NFT Valuationopen access
- Authors
- Lee, Geun-cheol; Lee, Heejung; Koo, Hoon-Young
- Issue Date
- Jan-2026
- Publisher
- ENGINEERING & TECHNOLOGY PUBLISHING
- Keywords
- Azuki; hedonic model; high-dimensional data; NFT valuation; Non-Fungible Token (NFT); Term Frequency-Inverse Document Frequency (TF-IDF); variable selection
- Citation
- JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, v.17, no.1, pp 141 - 152
- Pages
- 12
- Indexed
- SCOPUS
ESCI
- Journal Title
- JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY
- Volume
- 17
- Number
- 1
- Start Page
- 141
- End Page
- 152
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210918
- DOI
- 10.12720/jait.17.1.141-152
- ISSN
- 1798-2340
1798-2340
- Abstract
- In this study, we propose hedonic models for valuing Non-Fungible Tokens (NFTs) from the Azuki collection. We first analyze the NFT’s metadata and introduce a market volatility-robust dependent variable. Specific information of Azuki attributes is encoded via Term Frequency-Inverse Document Frequency (TF-IDF) to reflect both presence and collection-wide scarcity, yielding hundreds of features for each token. Two hedonic models are considered: a linear model and a squared model. To address high dimensionality, we tailor three variable-selection procedures—forward, backward, and stepwise—and compare them with regularization benchmarks and machine-learning methods. Using actual Azuki transaction data, we evaluate performance on a train-validation partition. The squared model overfits out of sample, while the linear model generalizes better and is adopted as the baseline. Applying variable selection to the linear baseline improves both parsimony and predictive performance. Machine-learning models exhibit very high training fit but notable performance degradation on the validation set, indicating overfitting in this setting. Overall, carefully specified hedonic models combined with principled variable selection offer competitive, interpretable, and more generalizable NFT valuation.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 산업융합학부 > 서울 산업융합학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.