Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

TAECE : T2I-Adapter with Enhanced Color Expression for Improving Conditional Text-to-Image Generation Capabilities

Authors
Seo, HyeinJeong, YunaChoi, Yong Suk
Issue Date
May-2025
Publisher
Association for Computing Machinery
Keywords
computer vision; image generation; text-to-image synthesis
Citation
Proceedings of the ACM Symposium on Applied Computing, pp 1180 - 1187
Pages
8
Indexed
SCOPUS
Journal Title
Proceedings of the ACM Symposium on Applied Computing
Start Page
1180
End Page
1187
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207629
DOI
10.1145/3672608.3707847
Abstract
The text-to-image diffusion model has advanced, enabling the generation of complex images from text as well as sketches, key poses, and segmentation maps. However, these models face challenges in accurately representing detailed scenes or real-world elements. This study addresses these challenges by proposing a method to enhance image generation ability based on both text and sketch. Our approach introduces an adapter incorporating a deformable convolution network (DCN) to process sketch inputs, allowing structural information to be retained in generated images. Additionally, we integrate large language models (LLMs) to enrich textual descriptions with nuanced color expressions. By combining structural input and enriched text, our model produces images that are not only realistic but visually appealing. This method significantly enhances the model's capacity to capture intricate details. Experimental results demonstrate that our model outperforms existing conditional text-to-image models in visual quality. Overall, this study contributes to image generation technology by advancing color representation via LLMs, fostering the creation of more visually consistent and detailed images. The proposed approach presents broad applicability, offering a notable contribution to text-to-image synthesis and advancing image generation techniques for greater realism.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE