PyTorch中的交叉熵

Question

我对PyTorch中的交叉熵损失感到有些困惑。

考虑这个例子：

import torch
import torch.nn as nn
from torch.autograd import Variable

output = Variable(torch.FloatTensor([0,0,0,1])).view(1, -1)
target = Variable(torch.LongTensor([3]))

criterion = nn.CrossEntropyLoss()
loss = criterion(output, target)
print(loss)

我希望损失为0.但我得到：

Variable containing:
 0.7437
[torch.FloatTensor of size 1]

据我所知，交叉熵可以像这样计算：

但不应该是1 * log（1）= 0的结果？

我尝试了不同的输入，如单热编码，但这根本不起作用，所以看起来损失函数的输入形状是可以的。

如果有人可以帮助我并告诉我我的错误在哪里，我将非常感激。

提前致谢！

Answer 1

另一答案

Answer 2

另一答案