替换 CuPy 中的 numpy.apply_along_axis

Posted 2023-02-16

技术标签:

【中文标题】替换 CuPy 中的 numpy.apply_along_axis【英文标题】：Replacement for numpy.apply_along_axis in CuPy 【发布时间】：2020-05-08 04:39:57 【问题描述】：

我有一个基于 NumPy 的神经网络，我正在尝试移植到 CuPy。我有一个功能如下：

import numpy as np

def tensor_diag(x): return np.apply_along_axis(np.diag, -1, x)

# Usage: (x is a matrix, i.e. a 2-tensor)
def sigmoid_prime(x): return tensor_diag(sigmoid(x) * (1 - sigmoid(x)))

这可以使用 NumPy，但 CuPy 没有该功能的类似物（自 2020 年 5 月 8 日起不受支持）。如何在 CuPy 中模拟这种行为？

【问题讨论】：

apply_along_axis 没有什么特别之处。在这个调用中，它只是在x 的所有轴上迭代，除了最后一个，将最后一个传递给np.diag。 x 的 shape 是什么？它的文档解释了它的作用。 【参考方案1】：

In [284]: arr = np.arange(24).reshape(2,3,4)

np.diag 接受一个一维数组，并返回一个具有对角线值的二维数组。 apply_along_axis 只迭代除最后一个以外的所有维度，并将最后一个数组一次传递给diag：

In [285]: np.apply_along_axis(np.diag,-1,arr)                                                          
Out[285]: 
array([[[[ 0,  0,  0,  0],
         [ 0,  1,  0,  0],
         [ 0,  0,  2,  0],
         [ 0,  0,  0,  3]],

        [[ 4,  0,  0,  0],
         [ 0,  5,  0,  0],
         [ 0,  0,  6,  0],
         [ 0,  0,  0,  7]],

        [[ 8,  0,  0,  0],
         [ 0,  9,  0,  0],
         [ 0,  0, 10,  0],
         [ 0,  0,  0, 11]]],


       [[[12,  0,  0,  0],
         [ 0, 13,  0,  0],
         [ 0,  0, 14,  0],
         [ 0,  0,  0, 15]],

        [[16,  0,  0,  0],
         [ 0, 17,  0,  0],
         [ 0,  0, 18,  0],
         [ 0,  0,  0, 19]],

        [[20,  0,  0,  0],
         [ 0, 21,  0,  0],
         [ 0,  0, 22,  0],
         [ 0,  0,  0, 23]]]])
In [286]: _.shape                                                                                      
Out[286]: (2, 3, 4, 4)

我可以做同样的映射：

In [287]: res = np.zeros((2,3,4,4),int)                                                                
In [288]: res[:,:,np.arange(4),np.arange(4)] = arr

检查apply 结果：

In [289]: np.allclose(_285, res)                                                                       
Out[289]: True

或者对于apply 的更直接副本，使用np.ndindex 生成所有i,j 元组对以迭代arr 的前两个维度：

In [298]: res = np.zeros((2,3,4,4),int)                                                                
In [299]: for ij in np.ndindex(2,3): 
     ...:     res[ij]=np.diag(arr[ij]) 
     ...:                                                                                              
In [300]: np.allclose(_285, res)                                                                       
Out[300]: True

【讨论】：

以上是关于替换 CuPy 中的 numpy.apply_along_axis的主要内容，如果未能解决你的问题，请参考以下文章