Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

TAECE : T2I-Adapter with Enhanced Color Expression for Improving Conditional Text-to-Image Generation Capabilities

Full metadata record
DC Field Value Language
dc.contributor.authorSeo, Hyein-
dc.contributor.authorJeong, Yuna-
dc.contributor.authorChoi, Yong Suk-
dc.date.accessioned2025-06-18T07:30:30Z-
dc.date.available2025-06-18T07:30:30Z-
dc.date.issued2025-05-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207629-
dc.description.abstractThe text-to-image diffusion model has advanced, enabling the generation of complex images from text as well as sketches, key poses, and segmentation maps. However, these models face challenges in accurately representing detailed scenes or real-world elements. This study addresses these challenges by proposing a method to enhance image generation ability based on both text and sketch. Our approach introduces an adapter incorporating a deformable convolution network (DCN) to process sketch inputs, allowing structural information to be retained in generated images. Additionally, we integrate large language models (LLMs) to enrich textual descriptions with nuanced color expressions. By combining structural input and enriched text, our model produces images that are not only realistic but visually appealing. This method significantly enhances the model's capacity to capture intricate details. Experimental results demonstrate that our model outperforms existing conditional text-to-image models in visual quality. Overall, this study contributes to image generation technology by advancing color representation via LLMs, fostering the creation of more visually consistent and detailed images. The proposed approach presents broad applicability, offering a notable contribution to text-to-image synthesis and advancing image generation techniques for greater realism.-
dc.format.extent8-
dc.language영어-
dc.language.isoENG-
dc.publisherAssociation for Computing Machinery-
dc.titleTAECE : T2I-Adapter with Enhanced Color Expression for Improving Conditional Text-to-Image Generation Capabilities-
dc.typeArticle-
dc.identifier.doi10.1145/3672608.3707847-
dc.identifier.scopusid2-s2.0-105006455030-
dc.identifier.wosid001497934400162-
dc.identifier.bibliographicCitationProceedings of the ACM Symposium on Applied Computing, pp 1180 - 1187-
dc.citation.titleProceedings of the ACM Symposium on Applied Computing-
dc.citation.startPage1180-
dc.citation.endPage1187-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Interdisciplinary Applications-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordPlusComplex image-
dc.subject.keywordPlusDiffusion model-
dc.subject.keywordPlusImage diffusion-
dc.subject.keywordPlusImage generations-
dc.subject.keywordPlusImages synthesis-
dc.subject.keywordPlusKey pose-
dc.subject.keywordPlusLanguage model-
dc.subject.keywordPlusReal-world-
dc.subject.keywordPlusSegmentation map-
dc.subject.keywordPlusText-to-image synthesis-
dc.subject.keywordAuthorcomputer vision-
dc.subject.keywordAuthorimage generation-
dc.subject.keywordAuthortext-to-image synthesis-
dc.identifier.urlhttps://dl.acm.org/doi/10.1145/3672608.3707847-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE