当前位置：网站首页>Keras' deep learning practice -- gender classification based on RESNET model

Keras' deep learning practice -- gender classification based on RESNET model

2022-07-22 07:23:00 【Hope Xiaohui】

Keras Deep learning practice —— be based on ResNet The model implements gender classification

0. Preface

from VGG16 To VGG19, The most significant change is the increase in the number of network layers , generally , The deeper the neural network , The better the model performance . However, higher model performance can be achieved only by increasing the number of network layers , It's easy , We can add more layers to the model until it achieves the best performance .
But unfortunately , This is not the case , As the number of network layers increases , The problem of gradient disappearance also surfaced —— As the number of layers increases , The gradient in the network will become very small , So that it is difficult to adjust the weight , At the same time, the network performance will also decline .
Deep residual network (ResNet) The proposal of is to solve the above problems . stay ResNet in , If the model has nothing to learn , Then the convolution layer can do nothing , Just pass the output of the previous layer to the next layer . however , If the model needs to learn some other features , Then the convolution layer takes the output of the previous layer as the input , And learn other characteristics needed to complete the target task .

1. ResNet Architecture brief introduction

residual (Residual) In mathematical statistics, it refers to the actual observed value and estimated value ( Fit value ) Difference between . classical ResNet The architecture is as follows ：

ResNet framework

In the diagram above , It can be seen that , There are jump connections in the model , This connection connects the previous layer with the traditional convolution layer in the network to the next layer of the line . More formally , Input $x$ Through convolution , Get the output after feature transformation $F (x)$ , With the input $x$ Add element by element , Get the final output $H (x)$ ：

$H (x) = x + F (x)$

VGG The comparison between module and residual module is as follows ：

Residual module

2. Based on pre training ResNet50 The model implements gender classification

stay 《 The migration study 》 in , We learned about using transfer learning , Only a few samples are needed to train the model with good performance ; And use pre trained based on transfer learning VGG16 The model Gender classification Actual combat . In this section , We also use pre trained ResNet50 Carry out gender classification practice , among ResNet50 Medium 50 Indicates that the network has 50 Network layer .

2.1 Training gender classification model

First, import the required Library , And download pre trained ResNet50 Model ：

from keras.applications import ResNet50
from keras.applications.resnet50 import preprocess_input
from glob import glob
from skimage import io
import cv2
import numpy as np

model = ResNet50(include_top=False, weights='imagenet', input_shape=(256, 256, 3))

Create input and output data sets , It should be noted that ,ResNet50 The size of the input image of is at least 224 x 224, In order to make sure ResNet50 The pre training model can work normally . We reuse 《 Convolution neural network for gender classification 》 Data set and data loading code used in ：

x = []
y = []
for i in glob('man_woman/a_resized/*.jpg')[:800]:
    try:
        image = io.imread(i)
        x.append(image)
        y.append(0)
    except:
        continue

for i in glob('man_woman/b_resized/*.jpg')[:800]:
    try:
        image = io.imread(i)
        x.append(image)
        y.append(1)
    except:
        continue

x_resnet50 = []
for i in range(len(x)):
    img = x[i]
    img = preprocess_input(img.reshape((1, 256, 256, 3)))
    img_feature = model.predict(img)
    x_resnet50.append(img_feature)

Build input and output numpy Array , At the same time, the data set is divided into training and testing sets ：

x_resnet50 = np.array(x_resnet50)
x_resnet50 = x_resnet50.reshape(x_resnet50.shape[0], x_resnet50.shape[2], x_resnet50.shape[3], x_resnet50.shape[4])
y = np.array(y)

from sklearn.model_selection import train_test_split
x_train, x_test, y_train, y_test = train_test_split(x_resnet50, y, test_size=0.2)

In pre training ResNet50 Build a fine-tuning model based on the output of the model ：

from keras.models import Sequential
from keras.layers import Conv2D, MaxPooling2D, Flatten, Dropout, Dense
model_fine_tuning = Sequential()
model_fine_tuning.add(Conv2D(2048, 
                        kernel_size=(3, 3),
                        activation='relu',
                        input_shape=(x_train.shape[1], x_train.shape[2], x_train.shape[3])))
model_fine_tuning.add(MaxPooling2D(pool_size=(2, 2)))
model_fine_tuning.add(Flatten())
model_fine_tuning.add(Dense(1024, activation='relu'))
model_fine_tuning.add(Dropout(0.5))
model_fine_tuning.add(Dense(1, activation='sigmoid'))
model_fine_tuning.summary()

The brief architecture information of the model is as follows ：

Model: "sequential"
_________________________________________________________________
Layer (type)                 Output Shape              Param # 
=================================================================
conv2d (Conv2D)              (None, 6, 6, 2048)        37750784  
_________________________________________________________________
max_pooling2d (MaxPooling2D) (None, 3, 3, 2048)        0         
_________________________________________________________________
flatten (Flatten)            (None, 18432)             0         
_________________________________________________________________
dense (Dense)                (None, 1024)              18875392  
_________________________________________________________________
dropout (Dropout)            (None, 1024)              0         
_________________________________________________________________
dense_1 (Dense)              (None, 1)                 1025      
=================================================================
Total params: 56,627,201
Trainable params: 56,627,201
Non-trainable params: 0
_________________________________________________________________

Compile and fit the constructed fine-tuning model ：

model_fine_tuning.compile(loss='binary_crossentropy',optimizer='adam',metrics=['acc'])

history = model_fine_tuning.fit(x_train, y_train,
                                    batch_size=32,
                                    epochs=20,
                                    verbose=1,
                                    validation_data = (x_test, y_test))

During training , The changes of accuracy and loss values of the model on the training data set and the test data set are as follows ：

Performance monitoring during training

You can see , Use pre training ResNet50 The accuracy of the gender classification model can reach 95％ about .

2.2 Misclassification image example

Examples of misclassified images are as follows ：

x = np.array(x)
from sklearn.model_selection import train_test_split
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.2)

x_test_resnet50 = []
for i in range(len(x_test)):
    img = x_test[i]
    img = preprocess_input(img.reshape((1, 256, 256, 3)))
    img_feature = model.predict(img)
    x_test_resnet50.append(img_feature)

x_test_resnet50 = np.array(x_test_resnet50)
x_test_resnet50 = x_test_resnet50.reshape(x_test_resnet50.shape[0], x_test_resnet50.shape[2], x_test_resnet50.shape[3], x_test_resnet50.shape[4])
y_pred = model_fine_tuning.predict(x_test_resnet50)
wrong = np.argsort(np.abs(y_pred.flatten()-y_test))
print(wrong)

y_test_char = np.where(y_test==0,'M','F')
y_pred_char = np.where(y_pred>0.5,'F','M')

plt.subplot(221)
plt.imshow(x_test[wrong[-1]])
plt.title('Actual: '+str(y_test_char[wrong[-1]])+', '+'Predicted: '+str((y_pred_char[wrong[-1]][0])))
plt.subplot(222)
plt.imshow(x_test[wrong[-2]])
plt.title('Actual: '+str(y_test_char[wrong[-2]])+', '+'Predicted: '+str((y_pred_char[wrong[-2]][0])))
plt.subplot(223)
plt.imshow(x_test[wrong[-3]])
plt.title('Actual: '+str(y_test_char[wrong[-3]])+', '+'Predicted: '+str((y_pred_char[wrong[-3]][0])))
plt.subplot(224)
plt.imshow(x_test[wrong[-4]])
plt.title('Actual: '+str(y_test_char[wrong[-4]])+', '+'Predicted: '+str((y_pred_char[wrong[-4]][0])))
plt.show()

Error classification example

contrast VGG16、VGG19 and Inception v3, There is no significant difference in the accuracy of multiple pre trained gender classification models , Because perhaps the image features extracted by these pre training models are more general features , There is no optimization for extracting gender features , We can train one from scratch RestNet50, Check the network performance .