When to use parametric models in reinforcement learning?

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Are Disentangled Representations Helpful for Abstract Visual Reasoning?

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizers

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Generating Diverse High-Fidelity Images with VQ-VAE-2

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Approximating CNNs with Bag-of-local-Features models works surprisingly well on ImageNet

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.LG, stat.ML

more

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.LG, stat.ML

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

- 2019 via Local Bibsonomy

Keywords: approximate, readings, generalization, optimization, information, compression, theory

- 2019 via Local Bibsonomy

Keywords: approximate, readings, generalization, optimization, information, compression, theory

Adversarial Self-Defense for Cycle-Consistent GANs

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

FreeLB: Enhanced Adversarial Training for Language Understanding

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Ease-of-Teaching and Language Structure from Emergent Communication

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Plan Arithmetic: Compositional Plan Vectors for Multi-Task Control

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

RTFM: Generalising to Novel Environment Dynamics via Reading

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Learning to Predict Without Looking Ahead: World Models Without Forward Prediction

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Are Sixteen Heads Really Better than One?

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Wasserstein Dependency Measure for Representation Learning

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Explanations can be manipulated and geometry is to blame

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Saccader: Improving Accuracy of Hard Attention Models for Vision

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Learning Robust Rewards with Adversarial Inverse Reinforcement Learning

arXiv e-Print archive - 2017 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2017 via Local Bibsonomy

Keywords: dblp

Explaining Image Classifiers by Counterfactual Generation

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.CV

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.CV

Measuring the Effects of Data Parallelism on Neural Network Training

- 2018 via Local Bibsonomy

Keywords: parallel, distributed

- 2018 via Local Bibsonomy

Keywords: parallel, distributed

Temporal Difference Variational Auto-Encoder

arXiv e-Print archive - 2018 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2018 via Local Bibsonomy

Keywords: dblp

Hierarchical Autoregressive Image Models with Auxiliary Decoders

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Meta-Learning Update Rules for Unsupervised Representation Learning

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.NE, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.NE, stat.ML

Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CL

more

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CL

Few-Shot Adversarial Learning of Realistic Neural Talking Head Models

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CV, cs.GR, cs.LG

more

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CV, cs.GR, cs.LG

HellaSwag: Can a Machine Really Finish Your Sentence?

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CL

more

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CL

Learning to Navigate in Cities Without a Map

arXiv e-Print archive - 2018 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2018 via Local Bibsonomy

Keywords: dblp

Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask

- 2019 via Local Bibsonomy

Keywords: pruning, nas

- 2019 via Local Bibsonomy

Keywords: pruning, nas

Decoupled Weight Decay Regularization

arXiv e-Print archive - 2017 via Local arXiv

Keywords: cs.LG, cs.NE, math.OC

more

arXiv e-Print archive - 2017 via Local arXiv

Keywords: cs.LG, cs.NE, math.OC

Meta-learners' learning dynamics are unlike learners'

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.LG, cs.AI, stat.ML

more

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.LG, cs.AI, stat.ML

MixMatch: A Holistic Approach to Semi-Supervised Learning

- 2019 via Local Bibsonomy

Keywords: semi-supervised-learning

- 2019 via Local Bibsonomy

Keywords: semi-supervised-learning

Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

S$^\mathbf{4}$L: Self-Supervised Semi-Supervised Learning

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CV, cs.LG

more

arXiv e-Print archive - 2019 via Local arXiv

Keywords: cs.CV, cs.LG

Adversarial Examples Are Not Bugs, They Are Features

- 2019 via Local Bibsonomy

Keywords: adversarial

- 2019 via Local Bibsonomy

Keywords: adversarial

The Marginal Value of Adaptive Gradient Methods in Machine Learning

arXiv e-Print archive - 2017 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2017 via Local Bibsonomy

Keywords: dblp

Diversity is All You Need: Learning Skills without a Reward Function

arXiv e-Print archive - 2018 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2018 via Local Bibsonomy

Keywords: dblp

Exploration by Random Network Distillation

- 2018 via Local Bibsonomy

Keywords: reinforcement-learning

- 2018 via Local Bibsonomy

Keywords: reinforcement-learning

On the Pitfalls of Measuring Emergent Communication

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

The Lottery Ticket Hypothesis at Scale

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Generating Long Sequences with Sparse Transformers

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

arXiv e-Print archive - 2019 via Local Bibsonomy

Keywords: dblp

Relational Forward Models for Multi-Agent Learning

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.MA, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.MA, stat.ML

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, stat.ML

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

arXiv e-Print archive - 2017 via Local arXiv

Keywords: cs.CV

more

arXiv e-Print archive - 2017 via Local arXiv

Keywords: cs.CV

An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.CV, cs.LG, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.CV, cs.LG, stat.ML

Visualizing the Loss Landscape of Neural Nets

arXiv e-Print archive - 2017 via Local arXiv

Keywords: cs.LG, cs.CV, stat.ML

more

arXiv e-Print archive - 2017 via Local arXiv

Keywords: cs.LG, cs.CV, stat.ML

On the Intriguing Connections of Regularization, Input Gradients and Transferability of Evasion and Poisoning Attacks

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.CR, stat.ML, 68T10, 68T45

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.CR, stat.ML, 68T10, 68T45

Adversarial Reprogramming of Neural Networks

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.CR, cs.CV, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.CR, cs.CV, stat.ML

Learning Plannable Representations with Causal InfoGAN

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.CV, cs.NE, cs.RO, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.CV, cs.NE, cs.RO, stat.ML

Intrinsic Social Motivation via Causal Influence in Multi-Agent RL

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.MA, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.MA, stat.ML

Emergence of Grounded Compositional Language in Multi-Agent Populations

arXiv e-Print archive - 2017 via Local arXiv

Keywords: cs.AI, cs.CL

more

arXiv e-Print archive - 2017 via Local arXiv

Keywords: cs.AI, cs.CL

Model-Based Active Exploration

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.IT, cs.NE, math.IT, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.IT, cs.NE, math.IT, stat.ML

Episodic Curiosity through Reachability

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.CV, cs.RO, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.CV, cs.RO, stat.ML

Large-Scale Study of Curiosity-Driven Learning

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.CV, cs.RO, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, cs.AI, cs.CV, cs.RO, stat.ML

Multi-task Deep Reinforcement Learning with PopArt

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, stat.ML

more

arXiv e-Print archive - 2018 via Local arXiv

Keywords: cs.LG, stat.ML