Приказ основних података о документу

dc.creatorYou, Tianya
dc.creatorWu, Hao
dc.creatorXu, Xiangrong
dc.creatorPetrović, Petar
dc.creatorRodić, Aleksandar
dc.date.accessioned2022-09-19T19:29:01Z
dc.date.available2022-09-19T19:29:01Z
dc.date.issued2022
dc.identifier.issn2079-9292
dc.identifier.urihttps://machinery.mas.bg.ac.rs/handle/123456789/3725
dc.description.abstractThe most basic and primary skills of a robot are pushing and grasping. In cluttered scenes, push to make room for arms and fingers to grasp objects. We propose a modified Actor-Critic (A-C) framework for deep reinforcement learning, Cross-entropy Softmax A-C (CSAC), and use the Prioritized Experience Replay (PER) based on the theoretical foundation and main methods of deep reinforcement learning, combining the advantages of algorithms based on value functions and policy gradients. The grasping model is trained using self-supervised learning to achieve end-to-end mapping from image to propulsion and grasping action. A vision module and an action module have been created out of the entire algorithm framework. The prioritized experience replay is improved to further improve the CSAC-PER algorithm for model sample diversity and robot exploration performance during robot grasping training. The experience replay buffer is dynamically sampled using the prior beta distribution and the dynamic sampling algorithm based on the beta distribution (CSAC-beta) is proposed based on the CSAC algorithm. Despite its low initial efficiency, the experimental simulation results show that the CSAC-beta algorithm eventually achieves good results and has a higher grasping success rate (90%).en
dc.publisherMDPI, Basel
dc.relationProvincialNatural Science Foundation [2108085ME166]
dc.relationNatural Science Research Project of Universities in Anhui Province [KJ2021A0408]
dc.relationOpen Project of China International Science and Technology Cooperation Base on Intelligent Equipment Manufacturing in Special Service Environment [ISTC2021KF08
dc.rightsopenAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.sourceElectronics
dc.subjectrobotic manipulationen
dc.subjectFCNen
dc.subjectdeep reinforcement learningen
dc.subjectbeta distributionen
dc.titleA Proposed Priority Pushing and Grasping Strategy Based on an Improved Actor-Critic Algorithmen
dc.typearticle
dc.rights.licenseBY
dc.citation.issue13
dc.citation.other11(13): -
dc.citation.rankM22~
dc.citation.volume11
dc.identifier.doi10.3390/electronics11132065
dc.identifier.fulltexthttp://machinery.mas.bg.ac.rs/bitstream/id/2273/3722.pdf
dc.identifier.scopus2-s2.0-85133135093
dc.identifier.wos000825683600001
dc.type.versionpublishedVersion


Документи

Thumbnail

Овај документ се појављује у следећим колекцијама

Приказ основних података о документу