Detailed Information

Cited 6 time in webofscience Cited 17 time in scopus
Metadata Downloads

A Specialized Architecture for Object Serialization with Applications to Big Data Analytics

Full metadata record
DC Field Value Language
dc.contributor.authorJang, J.[Jang, J.]-
dc.contributor.authorJung, S.J.[Jung, S.J.]-
dc.contributor.authorJeong, S.[Jeong, S.]-
dc.contributor.authorHeo, J.[Heo, J.]-
dc.contributor.authorShin, H.[Shin, H.]-
dc.contributor.authorHam, T.J.[Ham, T.J.]-
dc.contributor.authorLee, J.W.[Lee, J.W.]-
dc.date.accessioned2021-07-28T13:39:45Z-
dc.date.available2021-07-28T13:39:45Z-
dc.date.created2021-05-10-
dc.date.issued2020-
dc.identifier.issn1063-6897-
dc.identifier.urihttps://scholarworks.bwise.kr/skku/handle/2021.sw.skku/6760-
dc.description.abstractObject serialization and deserialization (S/D) is an essential feature for efficient communication between distributed computing nodes with potentially non-uniform execution environments. S/D operations are widely used in big data analytics frameworks for remote procedure calls and massive data transfers like shuffles. However, frequent S/D operations incur significant performance and energy overheads as they must traverse and process a large object graph. Prior approaches improve S/D throughput by effectively hiding disk or network I/O latency with computation, increasing compression ratio, and/or application-specific customization. However, inherent dependencies in the existing (de)serialization formats and algorithms eventually become the major performance bottleneck. Thus, we propose Cereal, a specialized hardware accelerator for memory object serialization. By co-designing the serialization format with hardware architecture, Cereal effectively utilizes abundant parallelism in the S/D process to deliver high throughput. Cereal also employs an efficient object packing scheme to compress metadata such as object reference offsets and a space-efficient bitmap representation for the object layout. Our evaluation of Cereal using both a cycle-level simulator and synthesizable Chisel RTL demonstrates that Cereal delivers 43.4× higher average S/D throughput than 88 other S/D libraries on Java Serialization Benchmark Suite. For six Spark applications Cereal achieves 7.97× and 4.81× speedups on average for S/D operations over Java built-in serializer and Kryo, respectively, while saving S/D energy by 227.75× and 136.28×. © 2020 IEEE.-
dc.language영어-
dc.language.isoen-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleA Specialized Architecture for Object Serialization with Applications to Big Data Analytics-
dc.typeArticle-
dc.contributor.affiliatedAuthorJang, J.[Jang, J.]-
dc.identifier.doi10.1109/ISCA45697.2020.00036-
dc.identifier.scopusid2-s2.0-85091993304-
dc.identifier.wosid000617734800025-
dc.identifier.bibliographicCitationProceedings - International Symposium on Computer Architecture, v.2020-May, pp.322 - 334-
dc.relation.isPartOfProceedings - International Symposium on Computer Architecture-
dc.citation.titleProceedings - International Symposium on Computer Architecture-
dc.citation.volume2020-May-
dc.citation.startPage322-
dc.citation.endPage334-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Hardware & Architecture-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordPlusBig data-
dc.subject.keywordPlusComputer hardware-
dc.subject.keywordPlusComputer software-
dc.subject.keywordPlusData Analytics-
dc.subject.keywordPlusData transfer-
dc.subject.keywordPlusJava programming language-
dc.subject.keywordPlusMetadata-
dc.subject.keywordPlusNetwork architecture-
dc.subject.keywordPlusBitmap representation-
dc.subject.keywordPlusEfficient communications-
dc.subject.keywordPlusExecution environments-
dc.subject.keywordPlusHardware architecture-
dc.subject.keywordPlusObject serialization-
dc.subject.keywordPlusPerformance bottlenecks-
dc.subject.keywordPlusRemote Procedure Call-
dc.subject.keywordPlusSpecialized hardware-
dc.subject.keywordPlusAdvanced Analytics-
dc.subject.keywordAuthorApache Spark-
dc.subject.keywordAuthorData analytics-
dc.subject.keywordAuthorDomain-specific architecture-
dc.subject.keywordAuthorHardware-software co-design-
dc.subject.keywordAuthorObject serialization-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE