Advances in Manufacturing ›› 2024, Vol. 12 ›› Issue (3): 603-618.doi: 10.1007/s40436-024-00496-y

Previous Articles    

CMAGAN: classifier-aided minority augmentation generative adversarial networks for industrial imbalanced data and its application to fault prediction

Wen-Jie Wang1,2, Zhao Liu3, Ping Zhu1,2   

  1. 1. School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China;
    2. National Engineering Research Center of Automotive Power and Intelligent Control, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China;
    3. School of Design, Shanghai Jiao Tong University, Shanghai, 200240, People's Republic of China
  • Received:2023-09-27 Revised:2023-11-06 Published:2024-09-07
  • Contact: Zhao Liu,E-mail:hotlz@sjtu.edu.cn;Ping Zhu,E-mail:pzhu@sjtu.edu.cn E-mail:hotlz@sjtu.edu.cn;pzhu@sjtu.edu.cn
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (Grant No. 52375256) and the Natural Science Foundation of Shanghai Municipality (Grant Nos. 21ZR1431500 and 23ZR1431600).

Abstract: Class imbalance is a common characteristic of industrial data that adversely affects industrial data mining because it leads to the biased training of machine learning models. To address this issue, the augmentation of samples in minority classes based on generative adversarial networks (GANs) has been demonstrated as an effective approach. This study proposes a novel GAN-based minority class augmentation approach named classifier-aided minority augmentation generative adversarial network (CMAGAN). In the CMAGAN framework, an outlier elimination strategy is first applied to each class to minimize the negative impacts of outliers. Subsequently, a newly designed boundary-strengthening learning GAN (BSLGAN) is employed to generate additional samples for minority classes. By incorporating a supplementary classifier and innovative training mechanisms, the BSLGAN focuses on learning the distribution of samples near classification boundaries. Consequently, it can fully capture the characteristics of the target class and generate highly realistic samples with clear boundaries. Finally, the new samples are filtered based on the Mahalanobis distance to ensure that they are within the desired distribution. To evaluate the effectiveness of the proposed approach, CMAGAN was used to solve the class imbalance problem in eight real-world fault-prediction applications. The performance of CMAGAN was compared with that of seven other algorithms, including state-of-the-art GAN-based methods, and the results indicated that CMAGAN could provide higher-quality augmented results.

The full text can be downloaded at https://link.springer.com/article/10.1007/s40436-024-00496-y

Key words: Class imbalance, Minority class augmentation, Generative adversarial network (GAN), Boundary strengthening learning (BSL), Fault prediction