循环神经网络系列Tensorflow中dynamic_rnn

Posted 2021-12-30 月来客栈

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了循环神经网络系列Tensorflow中dynamic_rnn相关的知识，希望对你有一定的参考价值。

1.回顾

上一篇博文(循环神经网络系列（一)Tensorflow中BasicRNNCell)中我们介绍了在Tensoflow中，每个RNN单元的实现，以及对应各个参数的含义。自那之后，我们就能通过Tensorflow实现一个单元的计算了。

import tensorflow as tf
import numpy as np

x = np.array([[1, 0, 1, 2], [2, 1, 1, 1]])
X = tf.placeholder(dtype=tf.float32, shape=[2, 4], name=input)
cell = tf.nn.rnn_cell.BasicRNNCell(num_units=5)  # output_size:10,也可以换成GRUCell，LSTMAACell,BasicRNNCell
h0 = cell.zero_state(batch_size=2, dtype=tf.float32)  # batch_size:2
output, h1 = cell.call(X, h0)

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    a, b = (sess.run([output, h1], feed_dict=X: x))
    print(output:)
    print(a)
    print(h1:)
    print(b)

>>

output:
[[ 0.4495004   0.9573416   0.6013933   0.75571895 -0.8172958 ]
 [ 0.6624889   0.7011481   0.68771356  0.77796507 -0.7617092 ]]
h1:
[[ 0.4495004   0.9573416   0.6013933   0.75571895 -0.8172958 ]
 [ 0.6624889   0.7011481   0.68771356  0.77796507 -0.7617092 ]]

通过以上的代码，我们完成了如下操作：

循环神经网络系列（二）Tensorflow中dynamic_rnn_循环神经网络

但是通常情况下，我们都是要进行这样的操作：

循环神经网络系列（二）Tensorflow中dynamic_rnn_python_02

输入 h 0 , x 1 h_0,x_1 h0,x1得到 o u t p u t 1 , h 1 output_1,h_1 output1,h1；然后输入 h 1 , x 2 h_1,x_2 h1,x2得到 o u t p u t 2 , h 2 output_2,h_2 output2,h2；接着再输入 h 2 , x 3 h_2,x_3 h2,x3得到 o u t p u t 3 , h 3 output_3,h_3 output3,h3以此类推。那么如何通过Tensorflow一步实现呢？

2. dynamic_rnn

为了实现一步计算多次，我们就要用到Tensorflow中的dynamic_rnn(),代码如下（实现了上图列出的三步

）：

import tensorflow as tf
import numpy as np
from tensorflow.python.ops import variable_scope as vs

output_size = 5
batch_size = 4
time_step = 3
dim = 3
cell = tf.nn.rnn_cell.BasicRNNCell(num_units=output_size)
inputs = tf.placeholder(dtype=tf.float32, shape=[time_step, batch_size, dim])
h0 = cell.zero_state(batch_size=batch_size, dtype=tf.float32)
X = np.array([[[1, 2, 1], [2, 0, 0], [2, 1, 0], [1, 1, 0]],  # x1
              [[1, 2, 1], [2, 0, 0], [2, 1, 0], [1, 1, 0]],  # x2
              [[1, 2, 1], [2, 0, 0], [2, 1, 0], [1, 1, 0]]])  # x3
outputs, final_state = tf.nn.dynamic_rnn(cell, inputs, initial_state=h0, time_major=True)

sess = tf.Session()
sess.run(tf.global_variables_initializer())
a, b = sess.run([outputs, final_state], feed_dict=inputs:X)
print(a)
print(b)

其中第7行time_step=3就表示计算三步，所以输入X就对应有三个部分。再最终的输出结果中，outputs里包含了( o u t p u t s 1 , o u t p u t s 2 , o u t p u t s 3 ) outputs_1,outputs_2,outputs_3) outputs1,outputs2,outputs3)，而final_stat就只是 h 3 h_3 h3，并且 o u t p u t s 3 , h 3 outputs_3,h_3 outputs3,h3是相等的。

结果：

outputs:
[[[ 0.9427065  -0.92617476 -0.79179853  0.6308035   0.07298201]
  [ 0.7051633  -0.62077284 -0.79618317  0.5004738  -0.20110159]
  [ 0.85066974 -0.77197933 -0.76875883  0.80251306 -0.04951192]
  [ 0.67497337 -0.57974416 -0.4408107   0.68083197  0.05233984]]# output1

 [[ 0.9828192  -0.9433205  -0.9233751   0.72930676 -0.34445292]
  [ 0.92153275 -0.58029604 -0.8949743   0.5431045  -0.46945637]
  [ 0.9690989  -0.7922626  -0.8973758   0.81312704 -0.46288016]
  [ 0.88565385 -0.6617377  -0.68075943  0.70066273 -0.34827012]]# output2

 [[ 0.99172366 -0.93298715 -0.9272905   0.7158564  -0.46278387]
  [ 0.9566409  -0.5595625  -0.9101479   0.58005375 -0.5905321 ]
  [ 0.9838727  -0.7693646  -0.91019756  0.82892674 -0.58026373]
  [ 0.9438508  -0.61732507 -0.7356022   0.73460865 -0.483655  ]]]# output3
final_state:
[[ 0.99172366 -0.93298715 -0.9272905   0.7158564  -0.46278387]
 [ 0.9566409  -0.5595625  -0.9101479   0.58005375 -0.5905321 ]
 [ 0.9838727  -0.7693646  -0.91019756  0.82892674 -0.58026373]
 [ 0.9438508  -0.61732507 -0.7356022   0.73460865 -0.483655  ]]# final_satae

3.总结

当使用dynamic_rnn时，对于输入数据的格式有两种：

第一种：输入格式为[batch_size,time_steps,input_size],此时得到的输出output的形状为[batch_size,time_steps,output_size]，final_state的形状为[batch_size,state_size]

第二种：也就是我们上面用到的，此时的输入格式为[time_steps,batch_size,input_size],得到的输出output的形状为[time_steps,batch_size,output_size]，final_state的形状仍然为[batch_size,state_size]，但此时要指定time_major = True

对比这两种输入方式，第二种最大的优点就是输出的结果形式方便我们观察。

更多内容欢迎扫码关注公众号月来客栈！

循环神经网络系列（二）Tensorflow中dynamic_rnn_python_03

以上是关于循环神经网络系列Tensorflow中dynamic_rnn的主要内容，如果未能解决你的问题，请参考以下文章

Tensorflow中循环神经网络及其Wrappers

TensorFlow：简单的循环神经网络

Tensorflow系列3:多层神经网络--解决非线性问题

tensorflow 基础学习十：RNN

机器学习与Tensorflow——循环神经网络长短时记忆网络

TensorFlow框架之RNN循环神经网络详解