Bounded Editing: Multi-Object Image Manipulation with Region-Specific Control
- Authors
- Kang, Min Gyu; Kim, Keon; Choi, Yong Suk
- Issue Date
- May-2025
- Publisher
- Association for Computing Machinery
- Keywords
- computer vision; diffusion models; text-guided-image-editing; text-to-image
- Citation
- Proceedings of the ACM Symposium on Applied Computing, pp 1122 - 1129
- Pages
- 8
- Indexed
- SCOPUS
- Journal Title
- Proceedings of the ACM Symposium on Applied Computing
- Start Page
- 1122
- End Page
- 1129
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207620
- DOI
- 10.1145/3672608.3707793
- Abstract
- Recent diffusion-based models have achieved significant success in vision domains such as image generation, and text-guided image manipulation. Text-guided image editing aims for users to modify specific objects and their attributes based on textual descriptions. However, current image editing approaches are susceptible to unintended modifications to non-target regions or other target regions in images when altering multiple objects. Some studies depend on detailed masks that are challenging to obtain for fine-grained image editing. To address these issues, we propose Bounded Editing, which allows for precise manipulation of specific areas. First, our approach separates the target and non-target regions from the image using bounding boxes. Second, we propose a guidance loss that enhances editing capabilities, enabling precise modifications to target objects while preventing undesired changes to background. By integrating our method with an existing image editing framework, we achieve significant improvements over state-of-the-art methods. Extensive experiments demonstrate the effectiveness of our proposed approach in changing objects, and modifying attributes such as colors and materials, especially on multi-object editing scenarios.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.