Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

No-regret shannon entropy regularized neural contextual bandit online learning for robotic grasping

Full metadata record
DC Field Value Language
dc.contributor.authorLee, K.-
dc.contributor.authorChoy, J.-
dc.contributor.authorChoi, Y.-
dc.contributor.authorKee, H.-
dc.contributor.authorOh, S.-
dc.date.accessioned2022-11-28T01:57:51Z-
dc.date.available2022-11-28T01:57:51Z-
dc.date.issued2020-10-
dc.identifier.issn2153-0858-
dc.identifier.urihttps://scholarworks.bwise.kr/cau/handle/2019.sw.cau/59359-
dc.description.abstractIn this paper, we propose a novel contextual bandit algorithm that employs a neural network as a reward estimator and utilizes Shannon entropy regularization to encourage exploration, which is called Shannon entropy regularized neural contextual bandits (SERN). In many learning-based algorithms for robotic grasping, the lack of the real-world data hampers the generalization performance of a model and makes it difficult to apply a trained model to real-world problems. To handle this issue, the proposed method utilizes the benefit of an online learning. The proposed method trains a neural network to predict the success probability of a given grasp pose based on a depth image, which is called a grasp quality. We theoretically show that the SERN has a no regret property. We empirically demonstrate that the SERN outperforms ϵ-greedy in terms of sample efficiency.-
dc.format.extent6-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleNo-regret shannon entropy regularized neural contextual bandit online learning for robotic grasping-
dc.typeArticle-
dc.identifier.doi10.1109/IROS45743.2020.9341123-
dc.identifier.bibliographicCitationIEEE International Conference on Intelligent Robots and Systems, pp 9620 - 9625-
dc.description.isOpenAccessN-
dc.identifier.scopusid2-s2.0-85102409713-
dc.citation.endPage9625-
dc.citation.startPage9620-
dc.citation.titleIEEE International Conference on Intelligent Robots and Systems-
dc.type.docTypeConference Paper-
dc.publisher.location미국-
dc.subject.keywordPlusAgricultural robots-
dc.subject.keywordPlusE-learning-
dc.subject.keywordPlusEnd effectors-
dc.subject.keywordPlusIntelligent robots-
dc.subject.keywordPlusLearning systems-
dc.subject.keywordPlusNeural networks-
dc.subject.keywordPlusRobotics-
dc.subject.keywordPlusContextual bandits-
dc.subject.keywordPlusGeneralization performance-
dc.subject.keywordPlusGrasp qualities-
dc.subject.keywordPlusLearning-based algorithms-
dc.subject.keywordPlusOnline learning-
dc.subject.keywordPlusReal-world problem-
dc.subject.keywordPlusRobotic grasping-
dc.subject.keywordPlusShannon entropy-
dc.subject.keywordPlusLearning algorithms-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Software > Department of Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Kyungjae photo

Lee, Kyungjae
소프트웨어대학 (AI학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE