TAECE : T2I-Adapter with Enhanced Color Expression for Improving Conditional Text-to-Image Generation Capabilities

Seo, Hyein; Jeong, Yuna; Choi, Yong Suk

doi:10.1145/3672608.3707847

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

TAECE : T2I-Adapter with Enhanced Color Expression for Improving Conditional Text-to-Image Generation Capabilities

Full metadata record

DC Field	Value	Language
dc.contributor.author	Seo, Hyein	-
dc.contributor.author	Jeong, Yuna	-
dc.contributor.author	Choi, Yong Suk	-
dc.date.accessioned	2025-06-18T07:30:30Z	-
dc.date.available	2025-06-18T07:30:30Z	-
dc.date.issued	2025-05	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207629	-
dc.description.abstract	The text-to-image diffusion model has advanced, enabling the generation of complex images from text as well as sketches, key poses, and segmentation maps. However, these models face challenges in accurately representing detailed scenes or real-world elements. This study addresses these challenges by proposing a method to enhance image generation ability based on both text and sketch. Our approach introduces an adapter incorporating a deformable convolution network (DCN) to process sketch inputs, allowing structural information to be retained in generated images. Additionally, we integrate large language models (LLMs) to enrich textual descriptions with nuanced color expressions. By combining structural input and enriched text, our model produces images that are not only realistic but visually appealing. This method significantly enhances the model's capacity to capture intricate details. Experimental results demonstrate that our model outperforms existing conditional text-to-image models in visual quality. Overall, this study contributes to image generation technology by advancing color representation via LLMs, fostering the creation of more visually consistent and detailed images. The proposed approach presents broad applicability, offering a notable contribution to text-to-image synthesis and advancing image generation techniques for greater realism.	-
dc.format.extent	8	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Association for Computing Machinery	-
dc.title	TAECE : T2I-Adapter with Enhanced Color Expression for Improving Conditional Text-to-Image Generation Capabilities	-
dc.type	Article	-
dc.identifier.doi	10.1145/3672608.3707847	-
dc.identifier.scopusid	2-s2.0-105006455030	-
dc.identifier.wosid	001497934400162	-
dc.identifier.bibliographicCitation	Proceedings of the ACM Symposium on Applied Computing, pp 1180 - 1187	-
dc.citation.title	Proceedings of the ACM Symposium on Applied Computing	-
dc.citation.startPage	1180	-
dc.citation.endPage	1187	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Interdisciplinary Applications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Software Engineering	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods	-
dc.subject.keywordPlus	Complex image	-
dc.subject.keywordPlus	Diffusion model	-
dc.subject.keywordPlus	Image diffusion	-
dc.subject.keywordPlus	Image generations	-
dc.subject.keywordPlus	Images synthesis	-
dc.subject.keywordPlus	Key pose	-
dc.subject.keywordPlus	Language model	-
dc.subject.keywordPlus	Real-world	-
dc.subject.keywordPlus	Segmentation map	-
dc.subject.keywordPlus	Text-to-image synthesis	-
dc.subject.keywordAuthor	computer vision	-
dc.subject.keywordAuthor	image generation	-
dc.subject.keywordAuthor	text-to-image synthesis	-
dc.identifier.url	https://dl.acm.org/doi/10.1145/3672608.3707847	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE