A new CNN training approach with application to hyperspectral image classification

Kutluk, Sezer; KAYABOL, KORAY; Akan, Aydın

doi:10.1016/j.dsp.2021.103016

A new CNN training approach with application to hyperspectral image classification

Atıf İçin Kopyala

Kutluk S., KAYABOL K., Akan A.

DIGITAL SIGNAL PROCESSING, cilt.113, 2021 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 113
Basım Tarihi: 2021
Doi Numarası: 10.1016/j.dsp.2021.103016
Dergi Adı: DIGITAL SIGNAL PROCESSING
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Compendex, Computer & Applied Sciences, INSPEC
İstanbul Üniversitesi Adresli: Evet

Özet

Three main requirements of a successful application of deep learning are the network architecture, a large enough training dataset, and a good optimization algorithm. In this paper we mainly focus on the optimization part. We propose a training algorithm for convolutional neural networks which makes use of both first and second order derivatives for training different layers. We utilize an approximate second order algorithm for the classification layer while we train the rest of the network with the conventional approach which is backpropagation with first order derivatives. We show that this approach helps us achieve a higher classification accuracy with a much smaller number of training iterations compared to training the whole network with gradient descent based algorithms. Moreover, although second order optimization is generally costlier, we show that the proposed approach is trained faster not only in terms of the number of iterations but also training duration. We also present the integration of CNNs with a probabilistic spatial model and apply this to the land cover classification problem in hyperspectral images. The results show that the algorithm allows us to achieve superior results with a simple network even with limited training data compared to existing approaches. (C) 2021 Elsevier Inc. All rights reserved.