Scene Semantic Recognition Based on Modified Fuzzy C-Mean and Maximum Entropy Using Object-to-Object Relations

Jalal, Ahmad; Ahmed, Abrar; Rafique, Adnan Ahmed; Kim, Kibum

doi:10.1109/ACCESS.2021.3058986

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Scene Semantic Recognition Based on Modified Fuzzy C-Mean and Maximum Entropy Using Object-to-Object Relationsopen access

Authors: Jalal, Ahmad; Ahmed, Abrar; Rafique, Adnan Ahmed; Kim, Kibum

Issue Date: Feb-2021

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Feature extraction; Image segmentation; Object recognition; Image recognition; Visualization; Object segmentation; Entropy; Scene recognition; object segmentation; recognition; bag of features; artificial neural network; maximum entropy; object pattern

Citation: IEEE ACCESS, v.9, pp 27758 - 27772

Pages: 15

Indexed: SCIE
SCOPUS

Journal Title: IEEE ACCESS

Volume: 9

Start Page: 27758

End Page: 27772

URI: https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/639

DOI: 10.1109/ACCESS.2021.3058986

ISSN: 2169-3536
2169-3536

Abstract: With advances in machine vision systems (e.g., artificial eye, unmanned aerial vehicles, surveillance monitoring) scene semantic recognition (SSR) technology has attracted much attention due to its related applications such as autonomous driving, tourist navigation, intelligent traffic and remote aerial sensing. Although tremendous progress has been made in visual interpretation, several challenges remain (i.e., dynamic backgrounds, occlusion, lack of labeled data, changes in illumination, direction, and size). Therefore, we have proposed a novel SSR framework that intelligently segments the locations of objects, generates a novel Bag of Features, and recognizes scenes via Maximum Entropy. First, denoising and smoothing are applied on scene data. Second, modified Fuzzy C-Means integrates with super-pixels and Random Forest for the segmentation of objects. Third, these segmented objects are used to extract a novel Bag of Features that concatenate different blobs, multiple orientations, Fourier transform and geometrical points over the objects. An Artificial Neural Network recognizes the multiple objects using the different patterns of objects. Finally, labels are estimated via Maximum Entropy model. During experimental evaluation, our proposed system illustrated a remarkable mean accuracy rate of 90.07% over the MSRC dataset and 89.26% over the Caltech 101 for object recognition, and 93.53% over the Pascal-VOC12 dataset for scene recognition, respectively. The proposed system should be applicable to various emerging technologies, such as augmented reality, to represent the real-world environment for military training and engineering design, as well as for entertainment, artificial eyes for visually impaired people and traffic monitoring to avoid congestion or road accidents.

Files in This Item: Go to Link

Appears in Collections: COLLEGE OF COMPUTING > SCHOOL OF MEDIA, CULTURE, AND DESIGN TECHNOLOGY > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Kibum photo

Kim, Kibum: ERICA 소프트웨어융합대학 (SCHOOL OF MEDIA, CULTURE, AND DESIGN TECHNOLOGY)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE