Catching Robot: Predicting the Trajectory of a Rolling Ball using Transformeropen access
- Authors
- Lee, Namyeong; Oh, Yuna; Moon, Jun
- Issue Date
- Sep-2024
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Attention mechanisms; Collaborative robots; Image recognition; Prediction algorithms; Robot learning
- Citation
- IEEE Access, v.12, pp 128551 - 128558
- Pages
- 8
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 12
- Start Page
- 128551
- End Page
- 128558
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212003
- DOI
- 10.1109/ACCESS.2024.3455553
- ISSN
- 2169-3536
2169-3536
- Abstract
- Various tasks in robotics such as 'pick and place' and 'catching flying/rolling objects' have been studied in the literature. Previously, to accomplish such tasks, it was necessary to detect the position of the object using a Sobel detector, a marker, or a stereo method and then predict the trajectory of the object through the model-based Kalman filter. However, these existing studies are not practical, since with this detection method, only one type of object can be detected or additional equipments are required. In addition, to compute the Kalman filter, a measurement of the object's position is essentially required, which may not be precise in various situations due to unmodeled noise. In this paper, we study the new framework of catching a rolling ball task in robotics using only machine learning techniques. Unlike previous approaches that rely on specified markers [1] or stereo camera systems [2], [3], our method uses a machine learning algorithm that can learn object positions and detect various sizes of balls using only one RGB camera without any markers. In our method, Convolutional Neural Network (CNN)-based models are applied to detect objects and the transformer model with an attention mechanism is applied for end-To-end trajectory prediction. We use the robotics simulator to efficiently train models and evaluate their performance directly in the real world. The experimental results of catching a rolling ball show that our framework is practical, and performs well in various sizes of balls. By using the proposed framework, the performance of the Gripper vicinity is 93.3%and the Catching success rate is 73.3%. In contrast, other baselines, such as CNN and long short-Term memory (LSTM), show poor Gripper vicinity and success rates, with all criteria falling below 30%.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.