Exploiting mixing regularization for truly unsupervised font synthesis
- Authors
- Muhammad, Ammar Ul Hassan; Lee, Hyunsoo; Choi, Jaeyoung
- Issue Date
- May-2023
- Publisher
- ELSEVIER
- Keywords
- Keyword1; Keyword2; Keyword3
- Citation
- PATTERN RECOGNITION LETTERS, v.169, pp.35 - 42
- Journal Title
- PATTERN RECOGNITION LETTERS
- Volume
- 169
- Start Page
- 35
- End Page
- 42
- URI
- http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/43964
- DOI
- 10.1016/j.patrec.2023.03.019
- ISSN
- 0167-8655
- Abstract
- Creating a novel font set requires domain expertise and is a laborious and time-consuming process, par-ticularly for languages with a large number of characters and complicated structures. Existing deep learn-ing based methods consider font generation (FG) as an image-to-image translation problem, mostly in a supervised setting, either in the form of pair images (paired data) or font labels (character or style la-bels), which requires extensive effort and is expensive to collect. Additionally, these supervised counter parts lack generalization for extending to other text image-related tasks, such as word image genera-tion and font attribute control at inference time. We found that these drawbacks are mainly due to the supervised setting adopted by these existing methods for font generation. In this paper, we tackle the FG problem in a truly unsupervised fashion, where a complete font set can be generated by training the generator such that adjacent styles are not correlated and projecting the input glyph image into its corresponding font style latent space. To accomplish this, we propose the Font Mixing Generative Adver-sarial Network (FM-GAN), which employs mixing regularization to supervise the generator to localize the font styles, and a projection encoder to project an arbitrary glyph image into its corresponding semantic space that is compatible with the generator. In the experiments, we demonstrated that our unsupervised model synthesizes font images that are comparable to supervised state-of-the-art FG baselines. Further-more, FM-GAN can be directly applied to other text image related tasks, such as multi-lingual font style transfer, word image generation, and font attribute control.(c) 2023 Elsevier B.V. All rights reserved.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Information Technology > School of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.