Cited 0 time in
Catching Robot: Predicting the Trajectory of a Rolling Ball using Transformer
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Namyeong | - |
| dc.contributor.author | Oh, Yuna | - |
| dc.contributor.author | Moon, Jun | - |
| dc.date.accessioned | 2026-04-06T06:00:14Z | - |
| dc.date.available | 2026-04-06T06:00:14Z | - |
| dc.date.issued | 2024-09 | - |
| dc.identifier.issn | 2169-3536 | - |
| dc.identifier.issn | 2169-3536 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212003 | - |
| dc.description.abstract | Various tasks in robotics such as 'pick and place' and 'catching flying/rolling objects' have been studied in the literature. Previously, to accomplish such tasks, it was necessary to detect the position of the object using a Sobel detector, a marker, or a stereo method and then predict the trajectory of the object through the model-based Kalman filter. However, these existing studies are not practical, since with this detection method, only one type of object can be detected or additional equipments are required. In addition, to compute the Kalman filter, a measurement of the object's position is essentially required, which may not be precise in various situations due to unmodeled noise. In this paper, we study the new framework of catching a rolling ball task in robotics using only machine learning techniques. Unlike previous approaches that rely on specified markers [1] or stereo camera systems [2], [3], our method uses a machine learning algorithm that can learn object positions and detect various sizes of balls using only one RGB camera without any markers. In our method, Convolutional Neural Network (CNN)-based models are applied to detect objects and the transformer model with an attention mechanism is applied for end-To-end trajectory prediction. We use the robotics simulator to efficiently train models and evaluate their performance directly in the real world. The experimental results of catching a rolling ball show that our framework is practical, and performs well in various sizes of balls. By using the proposed framework, the performance of the Gripper vicinity is 93.3%and the Catching success rate is 73.3%. In contrast, other baselines, such as CNN and long short-Term memory (LSTM), show poor Gripper vicinity and success rates, with all criteria falling below 30%. | - |
| dc.format.extent | 8 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Catching Robot: Predicting the Trajectory of a Rolling Ball using Transformer | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ACCESS.2024.3455553 | - |
| dc.identifier.scopusid | 2-s2.0-85203424636 | - |
| dc.identifier.wosid | 001316117500001 | - |
| dc.identifier.bibliographicCitation | IEEE Access, v.12, pp 128551 - 128558 | - |
| dc.citation.title | IEEE Access | - |
| dc.citation.volume | 12 | - |
| dc.citation.startPage | 128551 | - |
| dc.citation.endPage | 128558 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Telecommunications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Telecommunications | - |
| dc.subject.keywordPlus | Collaborative robots | - |
| dc.subject.keywordPlus | Convolutional neural networks | - |
| dc.subject.keywordPlus | Grippers | - |
| dc.subject.keywordPlus | Long short-term memory | - |
| dc.subject.keywordPlus | Robot learning | - |
| dc.subject.keywordPlus | Solvent extraction | - |
| dc.subject.keywordPlus | Stereo image processing | - |
| dc.subject.keywordAuthor | Attention mechanisms | - |
| dc.subject.keywordAuthor | Collaborative robots | - |
| dc.subject.keywordAuthor | Image recognition | - |
| dc.subject.keywordAuthor | Prediction algorithms | - |
| dc.subject.keywordAuthor | Robot learning | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10669029 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
