Convolutional Neural Networks for Sentence Classification

Posted big_matster

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Convolutional Neural Networks for Sentence Classification相关的知识,希望对你有一定的参考价值。

摘要

We report on a series of experiments with
convolutional neural networks (CNN)
trained on top of pre-trained word vectors for sentence-level classification tasks.
We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific
vectors through fine-tuning offers further
gains in performance. We additionally
propose a simple modification to the architecture to allow for the use of both
task-specific and static vectors. The CNN
models discussed herein improve upon the
state of the art on 4 out of 7 tasks, which
include sentiment analysis and question
classification

  • 任务:句子级分类任务sentence-level classification tasks
  • a simple CNN with little hyperparameter tuniing and static vectors.

模型架构


x i ∈ R k x_i \\in R^k xiRk the k-dimensional word vector
A sentence of length: n n n
x 1 : n = x 1 ⊗ x 2 ⊗ ⋯   , ⊗ x n x_1:n = x_1\\otimes x_2\\otimes \\cdots,\\otimes x_n x1:n=x1x2,xn

⊗ \\otimes is the concatenation operator.

x i : i + j x_i:i+j xi:i+j 代表单词的拼接
w ∈ R h k w \\in R^hk wRhk: 卷积滤波器。

卷积操作

a max-over-time pooling operation

c ^ = max ⁡ c \\hatc = \\maxc c^=maxc

倒数第二层加入dropout ,防止过拟合。

解决了句子长度可变问题。

  • the penultimate layer 倒数第二层

a fully connected softmax layer

数据集

MR

SST-1:

SST-2:

Subj

TREC

CR

• MPQA

更新算法

  • 随机梯度下降法:Adadelta 算法
  • 预训练词向量:the publicly available word2vec vectors

模型变体

  • CNN-rand
  • CNN-static
  • CNN-non-static
  • CNN-multichannel

以上是关于Convolutional Neural Networks for Sentence Classification的主要内容,如果未能解决你的问题,请参考以下文章

卷积神经网络(Convolutional Neural Network, CNN)

Convolutional Neural Networks for Sentence Classification

卷积神经网络用于视觉识别Convolutional Neural Networks for Visual Recognition

Convolutional neural network (CNN) - Pytorch版

课程四(Convolutional Neural Networks),第四 周(Special applications: Face recognition & Neural style tr

Learning Convolutional Neural Networks Backpropagation