Advances in Manufacturing ›› 2026, Vol. 14 ›› Issue (2): 359-376.doi: 10.1007/s40436-025-00570-z

• ARTICLES • Previous Articles    

Complete coverage path planning for multi-connected free-form surface grinding based on reinforcement learning

Zhen Zhu1,2, Bing-Zhou Xu1,2, Chang-Qing Shen1,2, Xiao-Jian Zhang1,2, Si-Jie Yan1,2, Han Ding1,2   

  1. 1. State Key Laboratory of Intelligent Manufacturing Equipment and Technology, School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, People's Republic of China;
    2. HUST-Wuxi Research Institute, Wuxi 214174, Jiangsu, People's Republic of China
  • Received:2024-05-04 Revised:2024-07-02 Published:2026-04-27
  • Contact: Xiao-Jian Zhang,E-mail:xjzhang@hust.edu.cn E-mail:xjzhang@hust.edu.cn
  • Supported by:
    This work was supported by the National Key Research and Development Program of China (Grant No. 2022YFB4700501), the National Natural Science Foundation of China (Grant Nos. 52375495, 52188102), and the Fundamental Research Funds for the Central Universities (Grant No. HUST: 23JYCXJJ034).

Abstract: Coverage path planning (CPP) is an essential process in robotic grinding, particularly with the increasing demand for large-scale multiconnected free-form surfaces, such as high-speed rail shells, car shells, and aeronautical parts. Owing to its multi-connectivity, achieving full coverage with a single continuous path is challenging. Additionally, large curvatures make the path spacing difficult to control, leaving some areas uncovered. Existing methods often fail to optimize continuity and coverage rates simultaneously, resulting in redundant tool-feeding and lifting processes that significantly reduce processing efficiency. Thus, a novel method for free-form surface CPP is proposed based on reinforcement learning (RL), which enables the learning of an optimal path with optimized continuity and coverage rates. Specifically, to regulate the path spacing, a uniform grid map is constructed based on the least-squares conformal mapping (LSCM) method, which parameterizes the grinding surface to a two-dimensional (2D) plane with controllable distortion. Furthermore, a CPP-specific evaluation criteria (CEC) is designed to evaluate the path through various key factors, including coverage rate, continuity, and smoothness. Finally, a grinding path is generated using the CEC-guided RL framework. The method was verified through several simulations, and a grinding experiment on a high-speed rail head surface was conducted as a typical application. The results showed high path continuity and coverage rates, demonstrating its potential for addressing CPP problems in different manufacturing scenarios.

The full text can be downloaded at https://doi.org/10.1007/s40436-025-00570-z

Key words: Coverage path planning (CPP), Grinding, Reinforcement learning (RL), Muti-connected free-form surface, Reward function construction