DSpace at EWHA: Convolutional Neural Network을 이용한 영상의 잡음 제거 분석

Browse

My Repository

DSpace at EWHA일반대학원 전자공학과 Theses_Master

View : 3262 Download: 0

Convolutional Neural Network을 이용한 영상의 잡음 제거 분석

Title: Convolutional Neural Network을 이용한 영상의 잡음 제거 분석

Other Titles: Analysis of De-noising Convolutional Neural Network

Authors: 유소희

Issue Date: 2016

Department/Major: 대학원 전자공학과

Publisher: 이화여자대학교 대학원

Degree: Master

Advisors: 이병욱

Abstract: 최근 컴퓨터 하드웨어 사양이 높아지고, GPU의 발전 등으로 인해 시간과 메모리의 제약이 줄어들면서 방대한 data를 이용하는 기계학습을 통한 영상 개선 방식에 대한 연구가 활발히 이루어지고 있다. 학습을 통해 잡음을 제거하는 기존 방식으로 Multi- Layer Perceptron (MLP) 를 이용한 방식, De-noising Auto-encoder (DA) 를 이용한 방식 등이 연구되어왔다. 본 논문에서는 Convolutional Neural Network (CNN) 을 이용하여 가산 백색 가우시안 잡음 (AWGN) 을 제거하는 De-noising CNN (DNCNN) 을 제안한다. DNCNN은 3개의 convolution layer와 2개의 Rectified Linear Unit (ReLU) 으로 이루어진 구조로 이 model에 patch영상을 학습시켜 영상의 잡음이 제거되도록 한다. Patch영상은 training 영상에 가산 백색 가우시안 잡음을 더한 다음 0과 255 사이의 값으로 clipping하고 다시 0과 1사이의 값으로 normalization을 수행하여 patch 크기만큼 잘라서 생성하였다. 최종 model은 1번째, 2번째 layer에서 각각 256, 128개의 filter를 이용한다. 각 layer의 convolution filter의 수를 조정하여 더 큰 feature map을 이용함으로써 더 낮은 test MSE를 얻을 수 있었다. 결과적으로 잡음제거에서 아주 우수한 성능을 보이는BM3D와 유사한 잡음 제거 성능을 얻을 수 있었다. Non-local mean 방법을 도입하고 최적화와 training data를 추가하여 더 좋은 잡음 제거 성능을 얻을 수 있을 것으로 기대된다. 또한 DNCNN에서 잡음이 제거되는 원리를 분석해 보았다. DNCNN의 잡음제거 성능에는 2번째 layer의 ReLU에서 일어나는 thresholding이 중요한 역할을 수행하는 것을 알 수 있었다. 또한 2번째 layer의 feature 중에서도 몇 개 주요 feature의 잡음 분산이 큰 값을 가지며, 이 feature의 thresholding에 의해서 크게 잡음이 제거되는 것을 알 수 있었다. 이 때 ReLU의 threshold는 2번째 layer feature의 MSE를 최소화하는 threshold 값과 관련이 있으며, 이 thresholding에 의해 영상의 잡음이 제거되는 것을 알 수 있었다. 본 연구를 통하여 DNCNN에서 잡음이 제거되는 원리를 파악하였고 보다 더 우수한 잡음 제거 structure 연구의 기초가 되었다.;As the processing speed and memory capacity have been exponentially increased due to advanced hardware and high-performing GPU, researches on machine learning based image enhancement scheme are being actively conducted and achieving high performance. In this paper, de-nosing CNN model which removes additive white Gaussian nose is proposed and its performance is analyzed. The employed DNCNN consists of three convolutional layers and two rectifier units, and small image patches are used to train the DNCNN model. Lower MSE was obtained by increasing the number of feature maps. As a result, the DNCNN reached similar performance in de-noising compared with BM3D which is the state-of-the-art de-noising algorithm. DNCNN may reach higher performance with optimized structure. Also, de-noising mechanism of CNN is analyzed. The rectified linear unit of the second layer plays major role in de-nosing and some of the feature maps among the second layer showed dominant noise reduction. Optimal threshold of the rectified linear unit is related to threshold value which minimizes the feature MSE of the second layer. We conclude that DNCNN is equivalent to thresholding features. The DNCNN can be improved further by incorporating non-local means or utilizing the sparsity of the features.