Prefetching data cache based on SSD device in distributed file systems
- Authors
- 백두원; 홍지만; An, J.-H.; Kim, Y.
- Issue Date
- Nov-2016
- Publisher
- International Information Institute Ltd.
- Keywords
- Hadoop; HDFS; MapReduce; Split; SSDS caching
- Citation
- Information (Japan), v.19, no.11, pp.5299 - 5307
- Journal Title
- Information (Japan)
- Volume
- 19
- Number
- 11
- Start Page
- 5299
- End Page
- 5307
- URI
- http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/5713
- ISSN
- 1343-4500
- Abstract
- A storage device that is used mainly in the Hadoop system, HDDs(Hard Drives Disks) have slower transfer speed and higher latency than SSDs(Solid State Disks). Even though the SSDs as an alternative to HDDs will replace HDDs eventually, some distributed file systems used to use SSDs as a cache of HDDs because SSDs are still too expensive. In this paper, we propose an efficient prefetching mechanism for distributed file systems that use SSDs as a cache. The proposed mechanism acquires a block ID that already used or will be used and prefetching it into the SSDs cache before receiving I/O requests. The proposed mechanism can increase the SSDs cache hit ratio when Hadoop MapReduce tasks are executing. In our performance evaluation, results show that the proposed mechanism significantly increased the performance of the Mapreduce compared with existing distributed file system where the existing SSDs cache was applied. © 2016 International Information Institute.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Information Technology > School of Computer Science and Engineering > 1. Journal Articles
- College of Information Technology > Global School of Media > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/5713)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.