Paper summary The authors propose a simplified version of LSTMs. Some non-linearities and weighted components are removed, in order to arrive at the recurrent additive network (RAN). The model is evaluated on 3 language modeling datasets: PTB, the billion word benchmark, and character-level Text8.
Lee, Kenton and Levy, Omer and Zettlemoyer, Luke
Summary by Marek Rei 2 months ago
