Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Layered Feature Engineering for E-Commerce Purchase Prediction: A Hierarchical Evaluation on Taobao User Behavior Datasets

Full metadata record
DC Field Value Language
dc.contributor.authorSuo, Liqiu-
dc.contributor.authorXia, Lin-
dc.contributor.authorChung, Yoona-
dc.contributor.authorKim, Eunchan-
dc.date.accessioned2026-06-15T00:00:39Z-
dc.date.available2026-06-15T00:00:39Z-
dc.date.issued2026-02-
dc.identifier.issn1546-2218-
dc.identifier.issn1546-2226-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/213264-
dc.description.abstractAccurate purchase prediction in e-commerce critically depends on the quality of behavioral features. This paper proposes a layered and interpretable feature engineering framework that organizes user signals into three layers: Basic, Conversion & Stability (efficiency and volatility across actions), and Advanced Interactions & Activity (cross-behavior synergies and intensity). Using real Taobao (Alibaba’s primary e-commerce platform) logs (57,976 records for 10,203 users; 25 November–03 December 2017), we conducted a hierarchical, layer-wise evaluation that holds data splits and hyperparameters fixed while varying only the feature set to quantify each layer’s marginal contribution. Across logistic regression (LR), decision tree, random forest, XGBoost, and CatBoost models with stratified 5-fold cross-validation, the performance improved monotonically from Basic to Conversion & Stability to Advanced features. With LR, F1 increased from 0.613 (Basic) to 0.962 (Advanced); boosted models achieved high discrimination (0.995 AUC Score) and an F1 score up to 0.983. Calibration and precision–recall analyses indicated strong ranking quality and acknowledged potential dataset and period biases given the short (9-day) window. By making feature contributions measurable and reproducible, the framework complements model-centric advances and offers a transparent blueprint for production-grade behavioral modeling. The code and processed artifacts are publicly available, and future work will extend the validation to longer, seasonal datasets and hybrid approaches that combine automated feature learning with domain-driven design.-
dc.format.extent25-
dc.language영어-
dc.language.isoENG-
dc.publisherTech Science Press-
dc.titleLayered Feature Engineering for E-Commerce Purchase Prediction: A Hierarchical Evaluation on Taobao User Behavior Datasets-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.32604/cmc.2025.076329-
dc.identifier.scopusid2-s2.0-105029602441-
dc.identifier.wosid001697182400001-
dc.identifier.bibliographicCitationComputers, Materials and Continua, v.87, no.1, pp 1 - 25-
dc.citation.titleComputers, Materials and Continua-
dc.citation.volume87-
dc.citation.number1-
dc.citation.startPage1-
dc.citation.endPage25-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaMaterials Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryMaterials Science, Multidisciplinary-
dc.subject.keywordPlusBehavioral research-
dc.subject.keywordPlusElectronic commerce-
dc.subject.keywordPlusForecasting-
dc.subject.keywordPlusLogistic regression-
dc.subject.keywordPlusPurchasing-
dc.subject.keywordPlusRandom forests-
dc.subject.keywordPlusSales-
dc.subject.keywordAuthore-commerce platform-
dc.subject.keywordAuthorfeature importance-
dc.subject.keywordAuthorHierarchical feature engineering-
dc.subject.keywordAuthorpurchase prediction-
dc.subject.keywordAuthorTaobao-
dc.subject.keywordAuthoruser behavior dataset-
dc.identifier.urlhttps://www.techscience.com/cmc/v87n1/66116-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 정보시스템학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Eunchan photo

Kim, Eunchan
COLLEGE OF ENGINEERING (DEPARTMENT OF INFORMATION SYSTEMS)
Read more

Altmetrics

Total Views & Downloads

BROWSE