Comparative Analysis of Vision Transformer Models for Facial Emotion Recognition Using Augmented Balanced Datasets

Bobojanov, Sukhrob; Kim, Byeong Man; Arabboev, Mukhriddin; Begmatov, Shohruh

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Comparative Analysis of Vision Transformer Models for Facial Emotion Recognition Using Augmented Balanced Datasets

Full metadata record

DC Field	Value	Language
dc.contributor.author	Bobojanov, Sukhrob	-
dc.contributor.author	Kim, Byeong Man	-
dc.contributor.author	Arabboev, Mukhriddin	-
dc.contributor.author	Begmatov, Shohruh	-
dc.date.accessioned	2024-01-11T05:30:29Z	-
dc.date.available	2024-01-11T05:30:29Z	-
dc.date.issued	2023-11	-
dc.identifier.issn	2076-3417	-
dc.identifier.uri	https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/26465	-
dc.description.abstract	Facial emotion recognition (FER) has a huge importance in the field of human-machine interface. Given the intricacies of human facial expressions and the inherent variations in images, which are characterized by diverse facial poses and lighting conditions, the task of FER remains a challenging endeavour for computer-based models. Recent advancements have seen vision transformer (ViT) models attain state-of-the-art results across various computer vision tasks, encompassing image classification, object detection, and segmentation. Moreover, one of the most important aspects of creating strong machine learning models is correcting data imbalances. To avoid biased predictions and guarantee reliable findings, it is essential to maintain the distribution equilibrium of the training dataset. In this work, we have chosen two widely used open-source datasets, RAF-DB and FER2013. As well as resolving the imbalance problem, we present a new, balanced dataset, applying data augmentation techniques and cleaning poor-quality images from the FER2013 dataset. We then conduct a comprehensive evaluation of thirteen different ViT models with these three datasets. Our investigation concludes that ViT models present a promising approach for FER tasks. Among these ViT models, Mobile ViT and Tokens-to-Token ViT models appear to be the most effective, followed by PiT and Cross Former models.	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	MDPI	-
dc.title	Comparative Analysis of Vision Transformer Models for Facial Emotion Recognition Using Augmented Balanced Datasets	-
dc.type	Article	-
dc.publisher.location	스위스	-
dc.identifier.doi	10.3390/app132212271	-
dc.identifier.wosid	001118096900001	-
dc.identifier.bibliographicCitation	APPLIED SCIENCES-BASEL, v.13, no.22	-
dc.citation.title	APPLIED SCIENCES-BASEL	-
dc.citation.volume	13	-
dc.citation.number	22	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Chemistry	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Materials Science	-
dc.relation.journalResearchArea	Physics	-
dc.relation.journalWebOfScienceCategory	Chemistry, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Engineering, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Materials Science, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Physics, Applied	-
dc.subject.keywordAuthor	facial emotion recognition	-
dc.subject.keywordAuthor	vision transformer	-
dc.subject.keywordAuthor	data augmentation	-
dc.subject.keywordAuthor	balanced data	-
dc.subject.keywordAuthor	FER2013	-
dc.subject.keywordAuthor	RAF-DB	-

Files in This Item: There are no files associated with this item.

Appears in Collections: Department of Computer Software Engineering > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher KIM, BYEONG MAN photo

KIM, BYEONG MAN: College of Engineering (Department of Computer Software Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :1,544,719; Today View :283

RSS_1.0 RSS_2.0 ATOM_1.0

350-27, Gumi-daero, Gumi-si, Gyeongsangbuk-do, Republic of Korea (39253)054-478-7170

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE