HiLite: Hierarchical Level-implemented Architecture Attaining Part-Whole Interpretability

Jeong, Yoo Hyun; Hwang, Sunghyun; Chae, Dong-Kyu

doi:10.1145/3627673.3679538

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

HiLite: Hierarchical Level-implemented Architecture Attaining Part-Whole Interpretability

Authors: Jeong, Yoo Hyun; Hwang, Sunghyun; Chae, Dong-Kyu

Issue Date: Oct-2024

Keywords: explainable AI; hierarchical architecture; neural networks with interpretability

Citation: International Conference on Information and Knowledge Management, Proceedings, pp 983 - 993

Pages: 11

Indexed: SCOPUS

Journal Title: International Conference on Information and Knowledge Management, Proceedings

Start Page: 983

End Page: 993

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/200543

DOI: 10.1145/3627673.3679538

ISSN: 2155-0751

Abstract: Beyond the traditional CNN structure, we have recently witnessed lots of breakthroughs in computer vision architectures such as Vision Transformer, MLP-Mixer, SNN-MLP, and so on. However, many efforts in developing novel architectures for vision tasks are heavily focused on achieving powerful performances, and how to attain interpretability in a trained neural network remains an open question. Inspired by the imaginary system GLOM, we present HiLite : Hierarchical Level-implemented Architecture attaining Part-Whole Interpretability, where islands of identical vectors can provide unprecedented interpretability. In our column-like structure, each level is a layer of a part-whole hierarchy composed of multiple neurons, and the function to define the neural field along an image input patch is initialized as the level vector inside the model. We propose two-column networks (Top-Down (TD) and Bottom-Up (BU)) that allow inter-level communication between adjacent levels on a specific patch and propose Gated Consensus Attention to perform intra-level communication on different patches within the level. At each time step, the level vector and outputs from different networks are combined into a weighted sum and passed to the next step, and outputs from the final time step are utilized as representation vectors. Here, supervised contrastive learning is used to find the relationship of meaningful patches in each class, where negative examples contribute to preventing representation collapse between neighboring patches. HiLite shows a possibility of performance through a quantitative evaluation on four image classification datasets as well as two metrics for assessing representation quality and showcases the intrinsic interpretability by simply generating a visual cue. We believe that our work is a solid step towards novel research on neural architectures attaining interpretability.

Files in This Item: There are no files associated with this item.

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Chae, Dong Kyu photo

Chae, Dong Kyu: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE