Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Effects of design factors of HDFS on I/O performance

Authors
김한규
Issue Date
13-Mar-2018
Publisher
Science Publications 244, 5th Avenue # S-207, New York NY 10001
Citation
Journal of Computer Science, v.14, no.3, pp.304 - 309
Journal Title
Journal of Computer Science
Volume
14
Number
3
Start Page
304
End Page
309
URI
https://scholarworks.bwise.kr/hongik/handle/2020.sw.hongik/3908
ISSN
1549-3636
Abstract
Four major design factors of HDFS, the block size, the number of data nodes, the number of client processes and replication factor are investigated to find out the effects on the I/O performance of HDFS by performing experiments in a real physical HDFS infrastructure consisting of 64 Hadoop data nodes of Intel i9 based blades. The block size is observed to be optimal when it equals to about 1Gb or 128MB that is the amount of the data the hard disk drive device can effectively input and output for 1 second in most of today's off-the-shelf computers. Sophisticated allocation strategy is required to determine the number of mappers and reducers as the number of data nodes increase because the overall performance is influenced in complicated manner by the number of raw data blocks of the job to be processed, the processing time of the blocks for each node and the overhead of shuffling. Experiments shows that Hadoop distributes the work properly that the number of clients does not have a significant impact as the number of clients increases. There is little delay in copying the replica because replication is done in pipelined manner although the network is overloaded.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > Computer Engineering Major > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Han gyoo photo

Kim, Han gyoo
Engineering (Department of Computer Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE