Sequence-to-Sequence Learning as Beam-Search Optimization

arXiv e-Print archive - 2016 via Local arXiv

Keywords: cs.CL, cs.LG, cs.NE, stat.ML

Understanding deep learning requires rethinking generalization

arXiv e-Print archive - 2016 via Local arXiv

Keywords: cs.LG

Regularizing Trajectory Optimization with Denoising Autoencoders

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Dual Learning for Machine Translation

arXiv e-Print archive - 2016 via Local arXiv

Keywords: cs.CL

Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CL

Comparing Rewinding and Fine-tuning in Neural Network Pruning

International Conference on Learning Representations - 2020 via Local Bibsonomy

Keywords: dblp

Learning to Predict Without Looking Ahead: World Models Without Forward Prediction

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Alternative structures for character-level RNNs

arXiv e-Print archive - 2015 via Local Bibsonomy

Keywords: dblp

Deep Residual Learning for Image Recognition

arXiv e-Print archive - 2015 via Local Bibsonomy

Keywords: dblp

Molecular Graph Convolutions: Moving Beyond Fingerprints

arXiv e-Print archive - 2016 via Local Bibsonomy

Keywords: dblp

