微调 keras .h5 模型到 C++ 中使用的 .pb 模型会导致运行模型失败错误

Posted 2023-02-23

技术标签:

【中文标题】微调 keras .h5 模型到 C++ 中使用的 .pb 模型会导致运行模型失败错误【英文标题】：Fine tuned keras .h5 model to .pb model used in c++ gives running model failed error 【发布时间】：2018-07-24 03:58:52 【问题描述】：

我对 VGG16 模型进行了微调，并以最佳精度保存了模型。我的 keras 代码是 -

#Input size 
rows = 125
column = 125

channels = 3 #RGB

#Resizing

def read_images(src):
    #read every image and resize every image to same size
    img = Image.open(src)
    #print(img.size)
    #method = Image.NEAREST if img.size == (rows,column) else Image.ANTIALIAS
    im_rz = img.resize((rows,column), Image.ANTIALIAS)
    #im_rz = resize_image_to_square(img, new_size = (rows,column))
    #im_rz = ImageOps.fit(img, (rows,column), method = method)
    return im_rz

#Conversion of image to np

l = LabelEncoder()
y_all = l.fit_transform(y_all)
y_all = np_utils.to_categorical(y_all)

#print(l.classes_)
#x_train, x_valid, y_train, y_valid = test_train_split(x_all, y_all, test_size = 0.2, random_state = 42, stratify = y_all)

x_train, x_test, y_train, y_test = train_test_split(x_all, y_all, test_size = 0.2, random_state = 42, stratify = y_all)

batch_size = 32
nb_classes = 3
epochs = 300

x_train = np.array(x_train).astype('float32')
x_test = np.array(x_test).astype('float32')
y_train = np.array(y_train).astype('float32')
y_test = np.array(y_test).astype('float32')

x_train /= 255.
x_test /= 255.

#model

vgg = VGG16(weights='imagenet', include_top=False, input_shape=(125, 125, 3))

# Freeze the layers except the last 4 layers
for layer in vgg.layers[:-4]:
    layer.trainable = False

# Check the trainable status of the individual layers
for layer in vgg.layers:
    print(layer, layer.trainable)

model = Sequential()

# Add the vgg convolutional base model
model.add(vgg_conv)

# Add new layers
model.add(layers.Flatten())
model.add(layers.Dense(1024, activation='relu'))
model.add(layers.Dropout(0.5))
model.add(layers.Dense(3, activation='softmax'))

print(model.summary())


opt = keras.optimizers.RMSProp(lr=1e-4)

# #Training the model using optimizer

model.compile(loss='categorical_crossentropy',optimizer=opt,metrics=['accuracy']) 

#Save the best model obtained during training

#from keras.callbacks import ReduceLROnPlateau
#lr_reduce = ReduceLROnPlateau(monitor='val_acc', factor=0.1, epsilon=0.0001, patience=1, verbose=1)

filepath = "weights.h5"
checkpoint = ModelCheckpoint(filepath, monitor='val_acc', verbose=0, save_best_only=True, mode='max')
callbacks_list = [checkpoint]

#data generation and fitting

data_generation = ImageDataGenerator(
    rotation_range = 45, #Makes sense to keep the rotation between 0 and 10, we don't want to rotate it a lot
    width_shift_range = 0.0,
    height_shift_range = 0.0,
    horizontal_flip=True, #Random rotations, might help
    vertical_flip=True,
    zoom_range = 0.15
    )

#data_generation.fit(x_train)



#Model fitting! 

#tbCallBack = keras.callbacks.TensorBoard(log_dir="./Graph", histogram_freq = 0, write_graph = True, write_images = True)
#callbacks_list.append(tbCallBack)

model_param=model.fit_generator(data_generation.flow(x_train, y_train,batch_size=batch_size),
                                steps_per_epoch=x_train.shape[0] // batch_size,
                                epochs=epochs,
                validation_data=(x_test, y_test),
                validation_steps=x_test.shape[0], 
                callbacks=callbacks_list)

这是模型的总结

Layer (type)                 Output Shape              Param #
=================================================================
vgg16 (Model)                (None, 7, 7, 512)         14714688
_________________________________________________________________
flatten_1 (Flatten)          (None, 25088)             0
_________________________________________________________________
dense_1 (Dense)              (None, 1024)              25691136
_________________________________________________________________
dropout_1 (Dropout)          (None, 1024)              0
_________________________________________________________________
dense_2 (Dense)              (None, 3)                 3075
=================================================================
Total params: 40,408,899
Trainable params: 32,773,635
Non-trainable params: 7,635,264
_________________________________________________________________

我已使用此脚本将其从 .h5 转换为 .pb 文件，以便在 C++ 中使用。 https://github.com/bitbionic/keras-to-tensorflow/blob/master/k2tf_convert.py

我使用了与此代码类似的 C++ 代码 - https://github.com/bitbionic/keras-to-tensorflow/blob/master/main.cpp

在 C++ 中执行代码时，我收到以下错误 -

Running model failed: Invalid argument: Matrix size-incompatible: In[0]: [1,1152], In[1]: [4608,512]
         [[Node: dense_1/MatMul = MatMul[T=DT_FLOAT, transpose_a=false, transpose_b=false, _device="/job:localhost/replica:0/task:0/cpu:0"](flatten_1/Reshape, dense_1/kernel/read)]]

如何消除此错误以及 pb 文件是否未正确创建？

【问题讨论】：

当我将批量大小更改为 4 时，它工作正常并且没有给出任何错误。我不想将批量大小用作 4。有什么解决办法吗？ 【参考方案1】：

问题可能是由于将图像大小调整为 125*125，然后再将其输入 VGG16 网络。它需要 224*224 的图像大小，否则图像大小可能会在训练时减小到零，这可能是正在发生的事情。将图像大小保持为 224*224 会消除所有错误，并允许在更大的批量大小上训练模型。

【讨论】：

以上是关于微调 keras .h5 模型到 C++ 中使用的 .pb 模型会导致运行模型失败错误的主要内容，如果未能解决你的问题，请参考以下文章