Tensorflow 维度问题：ValueError: Shapes (3, 1) and (None, 3) is incompatible

Posted 2023-02-16

技术标签:

【中文标题】Tensorflow 维度问题：ValueError: Shapes (3, 1) and (None, 3) is incompatible【英文标题】：Tensorflow dimension issue: ValueError: Shapes (3, 1) and (None, 3) are incompatible 【发布时间】：2021-07-07 00:04:42 【问题描述】：

我对 NN 很陌生，在拟合模型时遇到了一些尺寸问题。这是我的情况：

model_sigmoid = tf.keras.Sequential([
  embedding_layer,
  GlobalAveragePooling1D(),
  Dense(3, activation="softmax")])

model_sigmoid.summary()

Model: "sequential_12"
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
embedding (Embedding)        (None, None, 100)         1195200   
_________________________________________________________________
global_average_pooling1d_5 ( (None, 100)               0         
_________________________________________________________________
dense_11 (Dense)             (None, 3)                 303       
=================================================================
Total params: 1,195,503
Trainable params: 303
Non-trainable params: 1,195,200
___________________________________________

这是我想要训练的模型（这是一个设置起始基线的模型）。这是一个带有嵌入层的多类分类问题：GloVe 100d embedding

model_sigmoid.compile(optimizer="adam", loss="categorical_crossentropy", metrics=["accuracy"])

history = model_sigmoid.fit(
        train, epochs=10, batch_size=128, 
        validation_data=validation, verbose=1
    )

train 和 validation 是我的训练和验证数据集的矢量化版本。

train_ds
<MapDataset shapes: ((None, 80), (3,)), types: (tf.int64, tf.float32)>
tweet, label = next(iter(train))

tweet
<tf.Tensor: shape=(1, 80), dtype=int64, numpy=
array([[   6,   32, 1321,    3,  157,  383,    4,   18,  137, 1222,    6,
          18,  181, 2770, 1024, 6781,   51,    6,  375,  240,  486,    0,
           0,    0,    0,    0,    0,    0,    0,    0,    0,    0,    0,
           0,    0,    0,    0,    0,    0,    0,    0,    0,    0,    0,
           0,    0,    0,    0,    0,    0,    0,    0,    0,    0,    0,
           0,    0,    0,    0,    0,    0,    0,    0,    0,    0,    0,
           0,    0,    0,    0,    0,    0,    0,    0,    0,    0,    0,
           0,    0,    0]])>

label
<tf.Tensor: shape=(3,), dtype=float32, numpy=array([1., 0., 0.], dtype=float32)>

如您所见，我的“X”是一个长度为 80 的序列，其整数对应于我的数据集中的初始单词。相反，我的“Y”是原始情绪值的编码版本（负面、中性、正面）。

当我调用 fit 操作时，我得到了

ValueError: Shapes (3, 1) and (None, 3) are incompatible

我很确定错误出在 Y 上，但我真的不知道如何修复张量的形状。

【问题讨论】：

我做了更多的挖掘和重塑我的标签的工作：tf.reshape(label, [1,3])。你能把它作为答案发布吗 【参考方案1】：

经过一些挖掘和更多的形状检查，我想出了如何解决上面的错误。

我在我的函数中添加了一个重塑调用：

def vectorize_text_and_reshape(text, label):
      text = tf.expand_dims(text, -1)
      return vectorizer(text), tf.reshape(label, [1,3]) 

train_ds = train_tf.map(vectorize_text_and_reshape)
val_ds = val_tf.map(vectorize_text_and_reshape)
test_ds = test_tf.map(vectorize_text_and_reshape)

我已经实现了上面的vectorize_text_and_reshape 函数来对我的文本数据进行矢量化。我只需要在标签级别添加重塑调用。这将我的标签从 (3,) 形状变成了 (1,3)。

【讨论】：

以上是关于Tensorflow 维度问题：ValueError: Shapes (3, 1) and (None, 3) is incompatible的主要内容，如果未能解决你的问题，请参考以下文章

Tensorflow卷积网络——维度（形状）是如何计算的？

在 Tensorflow 中删除张量的维度

Tensorflow实现cnn中的维度问题

[TensorFlow]Tensor维度理解

如何在 Tensorflow 中对无维度的张量进行切片