Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect Patterns

Visual defect inspection and classification are significant steps of most manufacturing processes in the semiconductor and electronics industries. Known and unknown defects on wafer maps tend to cluster, and these spatial patterns provide valuable process information for supporting manufacturing in...

Full description

Bibliographic Details
Main Authors: Ho Sun Shon, Erdenebileg Batbaatar, Wan-Sup Cho, Seong Gon Choi
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9385104/
id doaj-dbaa73a0076b40b08a126df353924239
record_format Article
spelling doaj-dbaa73a0076b40b08a126df3539242392021-04-08T23:01:30ZengIEEEIEEE Access2169-35362021-01-019523525236310.1109/ACCESS.2021.30683789385104Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect PatternsHo Sun Shon0https://orcid.org/0000-0002-6717-7869Erdenebileg Batbaatar1https://orcid.org/0000-0002-9724-8955Wan-Sup Cho2Seong Gon Choi3https://orcid.org/0000-0002-5326-8321Research Institute for Computer and Information Communication, Chungbuk National University, Cheongju, Republic of KoreaSchool of Electrical Computer Engineering, Chungbuk National University, Cheongju, Republic of KoreaDepartment of Management Information Systems, Chungbuk National University, Cheongju, Republic of KoreaSchool of Information and Communication Engineering, Chungbuk National University, Cheongju, Republic of KoreaVisual defect inspection and classification are significant steps of most manufacturing processes in the semiconductor and electronics industries. Known and unknown defects on wafer maps tend to cluster, and these spatial patterns provide valuable process information for supporting manufacturing in determining the root causes of abnormal processes. In previous studies, data augmentation-based deep learning (DL) techniques were most commonly used for the identification of wafer map defect patterns (WMDP). Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations were manually designed for the WMDP problem. In this study, we propose a DL-based method with automatic data augmentation for the WMDP task. Basically, it focuses on learning effective discriminative features, from wafer maps, through a deep network structure. The network consists of a convolution-based variational autoencoder (CVAE) sequentially. First, we pre-trained the CVAE on large training data in an unsupervised manner. Second, we fine-tuned the encoder of the CVAE, which was followed by a neural network (NN) classifier, in a supervised manner. Additionally, we describe a simple procedure for automatically searching for improved data augmentation policies. The policy mainly consists of five image processing functions: rotation, flipping, shifting, shearing range, and zooming. The effectiveness of the proposed method was demonstrated through experimental results obtained from a simulation dataset and a real-world wafer map dataset (WM-811K). This study provides guidance for the application of deep learning in semiconductor manufacturing processes to improve product quality and yield.https://ieeexplore.ieee.org/document/9385104/Classificationconvolutional variational autoencoderdeep learningimbalanced dataneural networkunsupervised pre-training
collection DOAJ
language English
format Article
sources DOAJ
author Ho Sun Shon
Erdenebileg Batbaatar
Wan-Sup Cho
Seong Gon Choi
spellingShingle Ho Sun Shon
Erdenebileg Batbaatar
Wan-Sup Cho
Seong Gon Choi
Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect Patterns
IEEE Access
Classification
convolutional variational autoencoder
deep learning
imbalanced data
neural network
unsupervised pre-training
author_facet Ho Sun Shon
Erdenebileg Batbaatar
Wan-Sup Cho
Seong Gon Choi
author_sort Ho Sun Shon
title Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect Patterns
title_short Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect Patterns
title_full Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect Patterns
title_fullStr Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect Patterns
title_full_unstemmed Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect Patterns
title_sort unsupervised pre-training of imbalanced data for identification of wafer map defect patterns
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Visual defect inspection and classification are significant steps of most manufacturing processes in the semiconductor and electronics industries. Known and unknown defects on wafer maps tend to cluster, and these spatial patterns provide valuable process information for supporting manufacturing in determining the root causes of abnormal processes. In previous studies, data augmentation-based deep learning (DL) techniques were most commonly used for the identification of wafer map defect patterns (WMDP). Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations were manually designed for the WMDP problem. In this study, we propose a DL-based method with automatic data augmentation for the WMDP task. Basically, it focuses on learning effective discriminative features, from wafer maps, through a deep network structure. The network consists of a convolution-based variational autoencoder (CVAE) sequentially. First, we pre-trained the CVAE on large training data in an unsupervised manner. Second, we fine-tuned the encoder of the CVAE, which was followed by a neural network (NN) classifier, in a supervised manner. Additionally, we describe a simple procedure for automatically searching for improved data augmentation policies. The policy mainly consists of five image processing functions: rotation, flipping, shifting, shearing range, and zooming. The effectiveness of the proposed method was demonstrated through experimental results obtained from a simulation dataset and a real-world wafer map dataset (WM-811K). This study provides guidance for the application of deep learning in semiconductor manufacturing processes to improve product quality and yield.
topic Classification
convolutional variational autoencoder
deep learning
imbalanced data
neural network
unsupervised pre-training
url https://ieeexplore.ieee.org/document/9385104/
work_keys_str_mv AT hosunshon unsupervisedpretrainingofimbalanceddataforidentificationofwafermapdefectpatterns
AT erdenebilegbatbaatar unsupervisedpretrainingofimbalanceddataforidentificationofwafermapdefectpatterns
AT wansupcho unsupervisedpretrainingofimbalanceddataforidentificationofwafermapdefectpatterns
AT seonggonchoi unsupervisedpretrainingofimbalanceddataforidentificationofwafermapdefectpatterns
_version_ 1721533512451883008