Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Preserving instance-level characteristics for multi-instance generation

Full metadata record
DC Field Value Language
dc.contributor.authorRyu, Jaehak-
dc.contributor.authorMoon, Sungwon-
dc.contributor.authorCho, Donghyeon-
dc.date.accessioned2026-03-03T01:00:09Z-
dc.date.available2026-03-03T01:00:09Z-
dc.date.issued2026-02-
dc.identifier.issn0262-8856-
dc.identifier.issn1872-8138-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210979-
dc.description.abstractRecently, there have been efforts to explore instance-level control in diffusion models, where multiple instances are generated independently and then integrated into a single scene. However, several issues arise when instances are closely positioned or overlapping. First, independently generated instances frequently differ in style and lack coherence, leading to changes in their attributes as they influence each other when merged. Second, instances often merge with one another or become absorbed into others. To tackle these challenges, we propose a local latent refinement (LLR) that enforces each local latent to meet its conditions and remain distinct from others. We also propose a local latent injection (LLI) method that gradually integrates local latents during global latent generation for smoother fusion. Also, we find that the variance of latents changes significantly after instance fusion, which greatly impacts the quality of the generated images. To remedy this, we apply an instance normalization layer to regulate the variance of the fused latents, thereby producing high-quality images. Extensive experiments demonstrate that our approach achieves both high fidelity in instance layout and superior image quality, even in cases of high overlap among instances.-
dc.format.extent12-
dc.language영어-
dc.language.isoENG-
dc.publisherElsevier Ltd-
dc.titlePreserving instance-level characteristics for multi-instance generation-
dc.typeArticle-
dc.publisher.location네델란드-
dc.identifier.doi10.1016/j.imavis.2025.105851-
dc.identifier.scopusid2-s2.0-105030104458-
dc.identifier.wosid001632515200001-
dc.identifier.bibliographicCitationImage and Vision Computing, v.166, pp 1 - 12-
dc.citation.titleImage and Vision Computing-
dc.citation.volume166-
dc.citation.startPage1-
dc.citation.endPage12-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaOptics-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryOptics-
dc.subject.keywordPlusBehavioral research-
dc.subject.keywordPlusData mining-
dc.subject.keywordPlusImage fusion-
dc.subject.keywordPlusImage quality-
dc.subject.keywordAuthorAttention-
dc.subject.keywordAuthorDiffusion-
dc.subject.keywordAuthorInference optimization-
dc.subject.keywordAuthorLayout condition-
dc.identifier.urlhttps://www.sciencedirect.com/science/article/pii/S0262885625004391?via%3Dihub-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Cho, Donghyeon photo

Cho, Donghyeon
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE