Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Towards HPC I/O Performance Prediction through Large-scale Log Analysis

Full metadata record
DC Field Value Language
dc.contributor.authorKim, S.-
dc.contributor.authorSim, A.-
dc.contributor.authorWu, K.-
dc.contributor.authorByna, S.-
dc.contributor.authorSon, Yong Seok-
dc.contributor.authorEom, H.-
dc.date.accessioned2023-07-24T06:41:27Z-
dc.date.available2023-07-24T06:41:27Z-
dc.date.issued2020-
dc.identifier.issn0000-0000-
dc.identifier.urihttps://scholarworks.bwise.kr/cau/handle/2019.sw.cau/67256-
dc.description.abstractLarge-scale high performance computing (HPC) systems typically consist of many thousands of CPUs and storage units, while used by hundreds to thousands of users at the same time. Applications from these large numbers of users have diverse characteristics, such as varying compute, communication, memory, and I/O intensiveness. A good understanding of the performance characteristics of each user application is important for job scheduling and resource provisioning. Among these performance characteristics, the I/O performance is difficult to predict because the I/O system software is complex, the I/O system is shared among all users, and the I/O operations also heavily rely on networking systems. To improve the prediction of the I/O performance on HPC systems, we propose to integrate information from a number of different system logs and develop a regression-based approach that dynamically selects the most relevant features from the most recent log entries, and automatically select the best regression algorithm for the prediction task. Evaluation results show that our proposed scheme can predict the I/O performance with up to 84% prediction accuracy in the case of the I/O-intensive applications using the logs from CORI supercomputer at NERSC. © 2020 ACM.-
dc.format.extent12-
dc.language영어-
dc.language.isoENG-
dc.publisherAssociation for Computing Machinery, Inc-
dc.titleTowards HPC I/O Performance Prediction through Large-scale Log Analysis-
dc.typeArticle-
dc.identifier.doi10.1145/3369583.3392678-
dc.identifier.bibliographicCitationHPDC 2020 - Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing, pp 77 - 88-
dc.description.isOpenAccessN-
dc.identifier.scopusid2-s2.0-85088395233-
dc.citation.endPage88-
dc.citation.startPage77-
dc.citation.titleHPDC 2020 - Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing-
dc.type.docTypeConference Paper-
dc.subject.keywordAuthordistributed file system-
dc.subject.keywordAuthorhigh performance computing-
dc.subject.keywordAuthorI/O performance prediction-
dc.subject.keywordAuthorlog analysis-
dc.subject.keywordPlusForecasting-
dc.subject.keywordPlusProgram processors-
dc.subject.keywordPlusSupercomputers-
dc.subject.keywordPlusEvaluation results-
dc.subject.keywordPlusHigh performance computing systems-
dc.subject.keywordPlusNetworking systems-
dc.subject.keywordPlusPerformance characteristics-
dc.subject.keywordPlusPerformance prediction-
dc.subject.keywordPlusPrediction accuracy-
dc.subject.keywordPlusRegression algorithms-
dc.subject.keywordPlusRelevant features-
dc.subject.keywordPlusLarge scale systems-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Software > School of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Son, Yong Seok photo

Son, Yong Seok
소프트웨어대학 (소프트웨어학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE