Regularizing and Optimizing LSTM Language Models An Analysis of Neural Language Modeling at Multiple Scales This code was originally forked from the PyTorch word level language modeling example. The ...