Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Exploiting mixing regularization for truly unsupervised font synthesis

Authors
Muhammad, Ammar Ul HassanLee, HyunsooChoi, Jaeyoung
Issue Date
May-2023
Publisher
ELSEVIER
Keywords
Keyword1; Keyword2; Keyword3
Citation
PATTERN RECOGNITION LETTERS, v.169, pp.35 - 42
Journal Title
PATTERN RECOGNITION LETTERS
Volume
169
Start Page
35
End Page
42
URI
http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/43964
DOI
10.1016/j.patrec.2023.03.019
ISSN
0167-8655
Abstract
Creating a novel font set requires domain expertise and is a laborious and time-consuming process, par-ticularly for languages with a large number of characters and complicated structures. Existing deep learn-ing based methods consider font generation (FG) as an image-to-image translation problem, mostly in a supervised setting, either in the form of pair images (paired data) or font labels (character or style la-bels), which requires extensive effort and is expensive to collect. Additionally, these supervised counter parts lack generalization for extending to other text image-related tasks, such as word image genera-tion and font attribute control at inference time. We found that these drawbacks are mainly due to the supervised setting adopted by these existing methods for font generation. In this paper, we tackle the FG problem in a truly unsupervised fashion, where a complete font set can be generated by training the generator such that adjacent styles are not correlated and projecting the input glyph image into its corresponding font style latent space. To accomplish this, we propose the Font Mixing Generative Adver-sarial Network (FM-GAN), which employs mixing regularization to supervise the generator to localize the font styles, and a projection encoder to project an arbitrary glyph image into its corresponding semantic space that is compatible with the generator. In the experiments, we demonstrated that our unsupervised model synthesizes font images that are comparable to supervised state-of-the-art FG baselines. Further-more, FM-GAN can be directly applied to other text image related tasks, such as multi-lingual font style transfer, word image generation, and font attribute control.(c) 2023 Elsevier B.V. All rights reserved.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Information Technology > School of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Jaeyoung photo

Choi, Jaeyoung
College of Information Technology (School of Computer Science and Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE