Comparative analysis of model performance for predicting the customer of cafeteria using unstructured data
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kim, Seungsik | - |
dc.contributor.author | Gu, Nami | - |
dc.contributor.author | Moon, Jeongin | - |
dc.contributor.author | Kim, Keunwook | - |
dc.contributor.author | Hwang, Yeongeun | - |
dc.contributor.author | Lee, Kyeongjun | - |
dc.date.accessioned | 2024-03-13T01:30:27Z | - |
dc.date.available | 2024-03-13T01:30:27Z | - |
dc.date.issued | 2023-09 | - |
dc.identifier.issn | 2287-7843 | - |
dc.identifier.issn | 2383-4757 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/28509 | - |
dc.description.abstract | This study aimed to predict the number of meals served in a group cafeteria using machine learning methodology. Features of the menu were created through the Word2Vec methodology and clustering, and a stacking ensemble model was constructed using Random Forest, Gradient Boosting, and CatBoost as sub-models. Results showed that CatBoost had the best performance with the ensemble model showing an 8% improvement in performance. The study also found that the date variable had the greatest influence on the number of diners in a cafeteria, followed by menu characteristics and other variables. The implications of the study include the potential for machine learning methodology to improve predictive performance and reduce food waste, as well as the removal of subjective elements in menu classification. Limitations of the research include limited data cases and a weak model structure when new menus or foreign words are not included in the learning data. Future studies should aim to address these limitations. | - |
dc.format.extent | 15 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | KOREAN STATISTICAL SOC | - |
dc.title | Comparative analysis of model performance for predicting the customer of cafeteria using unstructured data | - |
dc.type | Article | - |
dc.publisher.location | 대한민국 | - |
dc.identifier.doi | 10.29220/CSAM.2023.30.5.485 | - |
dc.identifier.scopusid | 2-s2.0-85173435546 | - |
dc.identifier.wosid | 001162387300004 | - |
dc.identifier.bibliographicCitation | COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, v.30, no.5, pp 485 - 499 | - |
dc.citation.title | COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS | - |
dc.citation.volume | 30 | - |
dc.citation.number | 5 | - |
dc.citation.startPage | 485 | - |
dc.citation.endPage | 499 | - |
dc.type.docType | Article | - |
dc.identifier.kciid | ART003003402 | - |
dc.description.isOpenAccess | Y | - |
dc.description.journalRegisteredClass | scopus | - |
dc.description.journalRegisteredClass | esci | - |
dc.description.journalRegisteredClass | kci | - |
dc.relation.journalResearchArea | Mathematics | - |
dc.relation.journalWebOfScienceCategory | Statistics & Probability | - |
dc.subject.keywordAuthor | cafeteria | - |
dc.subject.keywordAuthor | ensemble model | - |
dc.subject.keywordAuthor | ESG | - |
dc.subject.keywordAuthor | food waste | - |
dc.subject.keywordAuthor | machine learning | - |
dc.subject.keywordAuthor | menu features | - |
dc.subject.keywordAuthor | performance improvement | - |
dc.subject.keywordAuthor | prediction | - |
dc.subject.keywordAuthor | word embedding | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
350-27, Gumi-daero, Gumi-si, Gyeongsangbuk-do, Republic of Korea (39253)054-478-7170
COPYRIGHT 2020 Kumoh University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.