Exploiting mixing regularization for truly unsupervised font synthesis

Muhammad, Ammar Ul Hassan; Lee, Hyunsoo; Choi, Jaeyoung

doi:10.1016/j.patrec.2023.03.019

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Exploiting mixing regularization for truly unsupervised font synthesis

Authors: Muhammad, Ammar Ul Hassan; Lee, Hyunsoo; Choi, Jaeyoung

Issue Date: May-2023

Publisher: ELSEVIER

Keywords: Keyword1; Keyword2; Keyword3

Citation: PATTERN RECOGNITION LETTERS, v.169, pp.35 - 42

Journal Title: PATTERN RECOGNITION LETTERS

Volume: 169

Start Page: 35

End Page: 42

URI: http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/43964

DOI: 10.1016/j.patrec.2023.03.019

ISSN: 0167-8655

Abstract: Creating a novel font set requires domain expertise and is a laborious and time-consuming process, par-ticularly for languages with a large number of characters and complicated structures. Existing deep learn-ing based methods consider font generation (FG) as an image-to-image translation problem, mostly in a supervised setting, either in the form of pair images (paired data) or font labels (character or style la-bels), which requires extensive effort and is expensive to collect. Additionally, these supervised counter parts lack generalization for extending to other text image-related tasks, such as word image genera-tion and font attribute control at inference time. We found that these drawbacks are mainly due to the supervised setting adopted by these existing methods for font generation. In this paper, we tackle the FG problem in a truly unsupervised fashion, where a complete font set can be generated by training the generator such that adjacent styles are not correlated and projecting the input glyph image into its corresponding font style latent space. To accomplish this, we propose the Font Mixing Generative Adver-sarial Network (FM-GAN), which employs mixing regularization to supervise the generator to localize the font styles, and a projection encoder to project an arbitrary glyph image into its corresponding semantic space that is compatible with the generator. In the experiments, we demonstrated that our unsupervised model synthesizes font images that are comparable to supervised state-of-the-art FG baselines. Further-more, FM-GAN can be directly applied to other text image related tasks, such as multi-lingual font style transfer, word image generation, and font attribute control.(c) 2023 Elsevier B.V. All rights reserved.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Information Technology > School of Computer Science and Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Choi, Jaeyoung photo

Choi, Jaeyoung: College of Information Technology (School of Computer Science and Engineering)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

Soongsil University Library 369 Sangdo-Ro, Dongjak-Gu, Seoul, Korea (06978)02-820-0733

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE