Cited 0 time in
TAECE : T2I-Adapter with Enhanced Color Expression for Improving Conditional Text-to-Image Generation Capabilities
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Seo, Hyein | - |
| dc.contributor.author | Jeong, Yuna | - |
| dc.contributor.author | Choi, Yong Suk | - |
| dc.date.accessioned | 2025-06-18T07:30:30Z | - |
| dc.date.available | 2025-06-18T07:30:30Z | - |
| dc.date.issued | 2025-05 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207629 | - |
| dc.description.abstract | The text-to-image diffusion model has advanced, enabling the generation of complex images from text as well as sketches, key poses, and segmentation maps. However, these models face challenges in accurately representing detailed scenes or real-world elements. This study addresses these challenges by proposing a method to enhance image generation ability based on both text and sketch. Our approach introduces an adapter incorporating a deformable convolution network (DCN) to process sketch inputs, allowing structural information to be retained in generated images. Additionally, we integrate large language models (LLMs) to enrich textual descriptions with nuanced color expressions. By combining structural input and enriched text, our model produces images that are not only realistic but visually appealing. This method significantly enhances the model's capacity to capture intricate details. Experimental results demonstrate that our model outperforms existing conditional text-to-image models in visual quality. Overall, this study contributes to image generation technology by advancing color representation via LLMs, fostering the creation of more visually consistent and detailed images. The proposed approach presents broad applicability, offering a notable contribution to text-to-image synthesis and advancing image generation techniques for greater realism. | - |
| dc.format.extent | 8 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Association for Computing Machinery | - |
| dc.title | TAECE : T2I-Adapter with Enhanced Color Expression for Improving Conditional Text-to-Image Generation Capabilities | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1145/3672608.3707847 | - |
| dc.identifier.scopusid | 2-s2.0-105006455030 | - |
| dc.identifier.wosid | 001497934400162 | - |
| dc.identifier.bibliographicCitation | Proceedings of the ACM Symposium on Applied Computing, pp 1180 - 1187 | - |
| dc.citation.title | Proceedings of the ACM Symposium on Applied Computing | - |
| dc.citation.startPage | 1180 | - |
| dc.citation.endPage | 1187 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Software Engineering | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.subject.keywordPlus | Complex image | - |
| dc.subject.keywordPlus | Diffusion model | - |
| dc.subject.keywordPlus | Image diffusion | - |
| dc.subject.keywordPlus | Image generations | - |
| dc.subject.keywordPlus | Images synthesis | - |
| dc.subject.keywordPlus | Key pose | - |
| dc.subject.keywordPlus | Language model | - |
| dc.subject.keywordPlus | Real-world | - |
| dc.subject.keywordPlus | Segmentation map | - |
| dc.subject.keywordPlus | Text-to-image synthesis | - |
| dc.subject.keywordAuthor | computer vision | - |
| dc.subject.keywordAuthor | image generation | - |
| dc.subject.keywordAuthor | text-to-image synthesis | - |
| dc.identifier.url | https://dl.acm.org/doi/10.1145/3672608.3707847 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
