0 citations0 references

Going Wider: Recurrent Neural Network With Parallel Cells

arXiv (Cornell University)2017

Citations Over Time

Danhao Zhu, Si Shen, Xinyu Dai, Jiajun Chen

Abstract

Recurrent Neural Network (RNN) has been widely applied for sequence modeling. In RNN, the hidden states at current step are full connected to those at previous step, thus the influence from less related features at previous step may potentially decrease model's learning ability. We propose a simple technique called parallel cells (PCs) to enhance the learning ability of Recurrent Neural Network (RNN). In each layer, we run multiple small RNN cells rather than one single large cell. In this paper, we evaluate PCs on 2 tasks. On language modeling task on PTB (Penn Tree Bank), our model outperforms state of art models by decreasing perplexity from 78.6 to 75.3. On Chinese-English translation task, our model increases BLEU score for 0.39 points than baseline model.

Related Papers

Combination of Recurrent Neural Networks and Factored Language Models for Code-Switching Language Modeling(2013)
→ Improved topic-dependent language modeling using information retrieval techniques(1999)55 cited
→ Verifying the long-range dependency of RNN language models(2016)2 cited
→ Going Wider: Recurrent Neural Network With Parallel Cells(2017)5 cited
→ Building Personalized Language Models Through Language Model Interpolation(2023)