用于时间序列异常检测的 Keras LSTM-VAE(变分自动编码器)

Posted

技术标签:

【中文标题】用于时间序列异常检测的 Keras LSTM-VAE(变分自动编码器)【英文标题】:Keras LSTM-VAE (Variational Autoencoder) for time-series anamoly detection 【发布时间】:2021-01-07 05:52:25 【问题描述】:

我正在尝试使用 Keras 为 LSTM-VAE 建模以进行时间序列重建。

我曾参考 https://github.com/twairball/keras_lstm_vae/blob/master/lstm_vae/vae.py 和 https://machinelearningmastery.com/lstm-autoencoders/ 创建 LSTM-VAE 架构。

我在训练网络时遇到了问题,在 Eager Execution 模式下训练时出现以下错误:

  InvalidArgumentError: Incompatible shapes: [8,1] vs. [32,1] [Op:Mul]

输入形状为(7752,30,1),此处为 30 个时间步长和 1 个特征。

模型编码器:

# encoder
latent_dim = 1
inter_dim = 32

#sample,timesteps, features
input_x = keras.layers.Input(shape= (X_train.shape[1], X_train.shape[2])) 

#intermediate dimension 
h = keras.layers.LSTM(inter_dim)(input_x)

#z_layer
z_mean = keras.layers.Dense(latent_dim)(h)
z_log_sigma = keras.layers.Dense(latent_dim)(h)
z = Lambda(sampling)([z_mean, z_log_sigma])

模型解码器:

# Reconstruction decoder
decoder1 = RepeatVector(X_train.shape[1])(z)
decoder1 = keras.layers.LSTM(100, activation='relu', return_sequences=True)(decoder1)
decoder1 = keras.layers.TimeDistributed(Dense(1))(decoder1)

采样功能:

batch_size = 32
def sampling(args):
    z_mean, z_log_sigma = args
    epsilon = K.random_normal(shape=(batch_size, latent_dim),mean=0., stddev=1.)
    return z_mean + z_log_sigma * epsilon

VAE 损失函数:

def vae_loss2(input_x, decoder1):
    """ Calculate loss = reconstruction loss + KL loss for each data in minibatch """
    # E[log P(X|z)]
    recon = K.sum(K.binary_crossentropy(input_x, decoder1), axis=1)
    # D_KL(Q(z|X) || P(z|X)); calculate in closed form as both dist. are Gaussian
    kl = 0.5 * K.sum(K.exp(z_log_sigma) + K.square(z_mean) - 1. - z_log_sigma, axis=1)

    return recon + kl

LSTM-VAE model architecture

有什么建议可以使模型工作吗?

【问题讨论】:

时间序列的 VAE LSTM:towardsdatascience.com/… 【参考方案1】:

您需要在采样函数中推断 batch_dim 并且需要注意您的损失...您的损失函数使用前一层的输出,因此您需要注意这一点。我使用model.add_loss(...)

来实现这个
# encoder
latent_dim = 1
inter_dim = 32
timesteps, features = 100, 1

def sampling(args):
    z_mean, z_log_sigma = args
    batch_size = tf.shape(z_mean)[0] # <================
    epsilon = K.random_normal(shape=(batch_size, latent_dim), mean=0., stddev=1.)
    return z_mean + z_log_sigma * epsilon

# timesteps, features
input_x = Input(shape= (timesteps, features)) 

#intermediate dimension 
h = LSTM(inter_dim, activation='relu')(input_x)

#z_layer
z_mean = Dense(latent_dim)(h)
z_log_sigma = Dense(latent_dim)(h)
z = Lambda(sampling)([z_mean, z_log_sigma])

# Reconstruction decoder
decoder1 = RepeatVector(timesteps)(z)
decoder1 = LSTM(inter_dim, activation='relu', return_sequences=True)(decoder1)
decoder1 = TimeDistributed(Dense(features))(decoder1)

def vae_loss2(input_x, decoder1, z_log_sigma, z_mean):
    """ Calculate loss = reconstruction loss + KL loss for each data in minibatch """
    # E[log P(X|z)]
    recon = K.sum(K.binary_crossentropy(input_x, decoder1))
    # D_KL(Q(z|X) || P(z|X)); calculate in closed form as both dist. are Gaussian
    kl = 0.5 * K.sum(K.exp(z_log_sigma) + K.square(z_mean) - 1. - z_log_sigma)

    return recon + kl

m = Model(input_x, decoder1)
m.add_loss(vae_loss2(input_x, decoder1, z_log_sigma, z_mean)) #<===========
m.compile(loss=None, optimizer='adam')

here the running notebook

【讨论】:

感谢@Marco Cerliani!它在 TF 2.3 版中工作!

以上是关于用于时间序列异常检测的 Keras LSTM-VAE(变分自动编码器)的主要内容,如果未能解决你的问题,请参考以下文章

CNN autoencoder 进行异常检测——TODO,使用keras进行测试

图像的深度学习异常检测

使用 Keras 进行秃头检测

keras中的预训练对象检测模型

用于异常检测的具有缺失值的时间序列的 STL 分解

用于 Elasticsearch 的替代异常检测引擎