Comprehensive Style Transfer for Facial Images Using Enhanced Feature Attribution in Generative Adversarial Netsopen access
- Authors
- Yoo, Yongseon; Kim, Seonggyu; Lee, Jong-Min
- Issue Date
- Jun-2025
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Image-to-image translation; style transfer; gram matrix; generative adversarial networks (GANs); style application; style evaluation; Image-to-image translation; style transfer; gram matrix; generative adversarial networks (GANs); style application; style evaluation
- Citation
- IEEE Access, v.13, pp 99145 - 99159
- Pages
- 15
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 13
- Start Page
- 99145
- End Page
- 99159
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207984
- DOI
- 10.1109/ACCESS.2025.3574729
- ISSN
- 2169-3536
2169-3536
- Abstract
- Image-to-image translation is a fundamental task in computer vision that transforms images between domains while preserving essential content. Although adaptive instance normalization (AdaIN) is widely used for style transfer, its reliance on simple statistical measures (mean and variance) may limit its ability to capture complex style characteristics. We propose a novel framework that enhances style transfer by combining AdaIN with Gram matrices, leveraging the complementary strengths of both approaches. Our method introduces two key innovations for enhanced feature attribution: 1) dual Gram matrix-based loss functions (G1 and G2), which operate at different stages of the generation process to capture richer style information by establishing deeper correlations between feature maps, and 2) a balanced training objective that integrates perceptual loss with cycle-consistency loss to maintain content fidelity during style transfer. This comprehensive feature attribution mechanism enables our model to decompose and reassign stylistic elements across domains more precisely. Through ablation studies, we demonstrate that each component of our framework contributes to performance improvements, with the complete model achieving the best results on both the CelebA-HQ and FFHQ datasets. Our comprehensive evaluation, using distribution similarity metrics, classification-based assessments, and visual comparisons, demonstrates that our approach effectively captures and transfers complex style characteristics while preserving content integrity, outperforming state-of-the-art models. Specifically, our model achieves superior Fr & eacute;chet Inception Distance (FID) scores (19.88 vs. 24.22) and recognition accuracy (0.966 vs. 0.941) compared to StarGAN v2, confirming the performance gains introduced by our enhanced feature attribution strategy.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > ETC > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.