-
Views
-
Cite
Cite
Zongjie Ma, Abdul Sattar, Jun Zhou, Qingliang Chen, Kaile Su, Dropout with Tabu Strategy for Regularizing Deep Neural Networks, The Computer Journal, Volume 63, Issue 7, July 2020, Pages 1031–1038, https://doi.org/10.1093/comjnl/bxz062
- Share Icon Share
Abstract
Dropout has been proven to be an effective technique for regularizing and preventing the co-adaptation of neurons in deep neural networks (DNN). It randomly drops units with a probability of p during the training stage of DNN to avoid overfitting. The working mechanism of dropout can be interpreted as approximately and exponentially combining many different neural network architectures efficiently, leading to a powerful ensemble. In this work, we propose a novel diversification strategy for dropout, which aims at generating more different neural network architectures in less numbers of iterations. The dropped units in the last forward propagation will be marked. Then the selected units for dropping in the current forward propagation will be retained if they have been marked in the last forward propagation, i.e., we only mark the units from the last forward propagation. We call this new regularization scheme Tabu dropout, whose significance lies in that it does not have extra parameters compared with the standard dropout strategy and is computationally efficient as well. Experiments conducted on four public datasets show that Tabu dropout improves the performance of the standard dropout, yielding better generalization capability.