“Pointer Sentinel Mixture Models” by Stephen Merity, Caiming Xiong, James Bradbury, Richard Socher
Paper: http://arxiv.org/abs/1609.07843
PDF: http://arxiv.org/pdf/1609.07843
Description: http://metamind.io/research/the-wikitext-long-term-dependency-language-modeling-dataset/
Datasets:
- Download WikiText-2 (4.3 MB)
- Download WikiText-103 (181 MB)