预期的输入batch_size以匹配目标batch_size（11）

Question

我知道这似乎是一个普遍的问题，但是我找不到解决方案。我正在运行一个多标签分类模型，并且张量大小有问题。

我的完整代码如下：

from transformers import DistilBertTokenizerFast, DistilBertForSequenceClassification
import torch

# Instantiating tokenizer and model
tokenizer = DistilBertTokenizerFast.from_pretrained('distilbert-base-cased')
model = DistilBertForSequenceClassification.from_pretrained('distilbert-base-cased')

# Instantiating quantized model
quantized_model = torch.quantization.quantize_dynamic(model, {torch.nn.Linear}, dtype=torch.qint8)

# Forming data tensors
input_ids = torch.tensor(tokenizer.encode(x_train[0], add_special_tokens=True)).unsqueeze(0)
labels = torch.tensor(Y[0]).unsqueeze(0)

# Train model
outputs = quantized_model(input_ids, labels=labels)
loss, logits = outputs[:2]

哪个会产生错误：

ValueError: Expected input batch_size (1) to match target batch_size (11)

Input_ids看起来像：

tensor([[  101,   789,   160,  1766,  1616,  1110,   170,  1205,  7727,  1113,
           170,  2463,  1128,  1336,  1309,  1138,   112,   119, 11882, 11545,
           119,   108, 15710,   108,  3645,   108,  3994,   102]])

具有形状：

torch.Size([1, 28])

并且标签看起来像：

tensor([[0, 1, 0, 0, 0, 0, 1, 0, 0, 0, 1]])

具有形状：

torch.Size([1, 11])

input_ids的大小将随着要编码的字符串的大小而变化。

[我还注意到，当输入5的Y值以产生5个标签时，会产生错误：

ValueError: Expected input batch_size (1) to match target batch_size (55).

带有标签形状：

torch.Size([1, 5, 11])

（（请注意，我没有输入5个input_id，这大概就是为什么输入大小保持不变的原因）

我已经尝试了几种不同的方法来使它们起作用，但是我现在很茫然。我真的很感谢一些指导。谢谢！