Skip to content

Subscribe
GA on AI
About
ML Jobs
AI Companies
AI Meetup (SF)
DataLabeling API

AI Strategy, Machine Learning and Deep Learning

Posted on September 12, 2016

d211: Language Model on One Billion Word Benchmark

https://github.com/tensorflow/models/tree/master/lm_1b

Introduction

“In this release, we open source a model trained on the One Billion Word Benchmark (http://arxiv.org/abs/1312.3005), a large language corpus in English which was released in 2013. This dataset contains about one billion words, and has a vocabulary size of about 800K words. It contains mostly news data. Since sentences in the training set are shuffled, models can ignore the context and focus on sentence level language modeling.

In the original release and subsequent work, people have used the same test set to train models on this dataset as a standard benchmark for language modeling. Recently, we wrote an article (http://arxiv.org/abs/1602.02410) describing a model hybrid between character CNN, a large and deep LSTM, and a specific Softmax architecture which allowed us to train the best model on this dataset thus far, almost halving the best perplexity previously obtained by others.”

@article{jozefowicz2016exploring, title={Exploring the Limits of Language Modeling}, author={Jozefowicz, Rafal and Vinyals, Oriol and Schuster, Mike and Shazeer, Noam and Wu, Yonghui}, journal={arXiv preprint arXiv:1602.02410}, year={2016} }

Share this:

Twitter
Facebook
Google

Related

Categories SourcesTags NLP, TensorFlowLeave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Comment

Name *

Email *

Website

Notify me of follow-up comments by email.

Notify me of new posts by email.

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Post navigation

Previous Previous post: d210: What are the best resources to learn about deep learning?

Next Next post: d211: Baidu open sources its deep learning platform PaddlePaddle (PArallel Distributed Deep LEarning)

Menu

Subscribe
GA on AI
About
ML Jobs
AI Companies
AI Meetup (SF)
DataLabeling API

Search for:

Tags

AI Jobs Andrej Karpathy Andrew Ng Baidu Berkeley Books DARPA Dataset Deep Learning DeepMind Demis Hassabis Facebook FAIR Games Geoff Hinton Google Google Brain Greg Brockman Hardware Healthcare Hugo Larochelle Ian Goodfellow IBM Watson Ilya Sutskever Intel Keras Mark Zuckerberg Marvin Minsky Microsoft MIT NIPS NLP NVIDIA OpenAI PyTorch SDC Self-Driving Cars Stanford Stephen Wolfram TensorFlow Tesla Tutorial Uber Yann LeCun Yoshua Bengio

Recent Posts

d1026: Deep Learning for Classical Japanese Literature
d1025: PyTorch v1.0 stable release
d1024: Depth First Learning Fellowship: $4000 grants to build ML curricula
d1023: Machine Learning for Combinatorial Optimization
d1022: Can you tell if these faces are real or GAN-generated?

Recent Comments

d531: AI and Machine Learning Jobs July-August 2017 | AI:Mechanic on d380: AI and Machine Learning Jobs California, February 2017
d531: AI and Machine Learning Jobs July-August 2017 | AI:Mechanic on d501: AI and Machine Learning Jobs June-July 2017
d531: AI and Machine Learning Jobs July-August 2017 | AI:Mechanic on d346: Machine Learning Jobs, January 2017 [San Francisco, Bay Area]
d501: AI and Machine Learning Jobs June-July 2017 | AI:Mechanic on d440: AI / Machine Learning Jobs April 2017
d501: AI and Machine Learning Jobs June-July 2017 | AI:Mechanic on d412: Machine Learning and AI Jobs, March 2017

Archives

Categories

AI companies
Machine Learning Jobs
News
Sources
Study

Meta

Log in
Entries RSS
Comments RSS
WordPress.org

Proudly powered by WordPress | Theme: Afterlight by WordPress.com.