T–S Fuzzy Model Based Multi-Branch Deep Network Architecture

In the traditional CNN design, the hyperparameters, such as the size of the convolutional kernel and stride, are difficult to determine. In this paper, a new convolutional network architecture, named multi-branch fuzzy architecture network (MBFAN), was proposed for this problem. In MBFAN, some branc...

Full description

Bibliographic Details
Main Authors: Faguang Wang, Yue Wang, Hongmei Wang, Chaogang Tang
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9163333/
Description
Summary:In the traditional CNN design, the hyperparameters, such as the size of the convolutional kernel and stride, are difficult to determine. In this paper, a new convolutional network architecture, named multi-branch fuzzy architecture network (MBFAN), was proposed for this problem. In MBFAN, some branches with a certain convolutional neural network architecture are connected in parallel. In each branch, a different-sized convolutional kernel is applied. By data training and normalization, a weight is given to each branch. By these weights, the important features in the final output are strengthened. By normalization, the branches were interconnected together, making the training process more efficient. Due to overfitting, with the increase of branches, the MBFAN accuracy increases, and then decreases. The number of branches is optimized when the MBFAN accuracy is highest. On the other hand, the location of the convolutional kernel center in an image has a great influence on the convolutional results. This is also discussed in MBFAN. For the experiments, the proposed MBFAN was adopted and tested in a simple convolutional network and a VGG16 network.
ISSN:2169-3536