Detailed Information

Cited 24 time in webofscience Cited 30 time in scopus
Metadata Downloads

Smart Glass System Using Deep Learning for the Blind and Visually Impaired

Authors
Mukhiddinov, M.Cho, Jinsoo
Issue Date
Nov-2021
Publisher
MDPI
Keywords
Artificial intelligence; Assistive technologies; Blind and visually impaired; Deep learning; Low-light images; Object detection; Refreshable tactile display; Smart glasses
Citation
Electronics, v.10, no.22
Journal Title
Electronics
Volume
10
Number
22
URI
https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/84476
DOI
10.3390/electronics10222756
ISSN
2079-9292
Abstract
Individuals suffering from visual impairments and blindness encounter difficulties in moving independently and overcoming various problems in their routine lives. As a solution, artificial intelligence and computer vision approaches facilitate blind and visually impaired (BVI) people in fulfilling their primary activities without much dependency on other people. Smart glasses are a potential assistive technology for BVI people to aid in individual travel and provide social comfort and safety. However, practically, the BVI are unable move alone, particularly in dark scenes and at night. In this study we propose a smart glass system for BVI people, employing computer vision techniques and deep learning models, audio feedback, and tactile graphics to facilitate independent movement in a night-time environment. The system is divided into four models: a low-light image enhancement model, an object recognition and audio feedback model, a salient object detection model, and a text-to-speech and tactile graphics generation model. Thus, this system was developed to assist in the following manner: (1) enhancing the contrast of images under low-light conditions employing a two-branch exposure-fusion network; (2) guiding users with audio feedback using a transformer encoder–decoder object detection model that can recognize 133 categories of sound, such as people, animals, cars, etc., and (3) accessing visual information using salient object extraction, text recognition, and refreshable tactile display. We evaluated the performance of the system and achieved competitive performance on the challenging Low-Light and ExDark datasets. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 컴퓨터공학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher ugli, Mukhiddinov Mukhriddin Nuriddin photo

ugli, Mukhiddinov Mukhriddin Nuriddin
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE