Lista de obras - David Silver - Dominio Público Uruguay

A Monte-Carlo AIXI Approximation

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning ⬇️

artículo científico publicado en 2017

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

artículo científico publicado en 2018

Bayes-Adaptive Simulation-based Search with Value Function Approximation ⬇️

artículo científico publicado en 2014

Bootstrapping from Game Tree Search ⬇️

artículo científico publicado en 2009

Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation ⬇️

artículo científico publicado en 2009

Discovering faster matrix multiplication algorithms with reinforcement learning

artículo científico publicado en 2022

Discovery of Useful Questions as Auxiliary Tasks ⬇️

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search ⬇️

artículo científico publicado en 2012

Grandmaster level in StarCraft II using multi-agent reinforcement learning

artículo científico publicado en 2019

Highly accurate protein structure prediction with AlphaFold

artículo científico publicado en 2021

Human-level control through deep reinforcement learning

artículo científico publicado en 2015

Imagination-Augmented Agents for Deep Reinforcement Learning ⬇️

artículo científico publicado en 2017

Learning Continuous Control Policies by Stochastic Value Gradients ⬇️

artículo científico publicado en 2015

Learning values across many orders of magnitude ⬇️

artículo científico publicado en 2016

Mastering Atari, Go, chess and shogi by planning with a learned model

artículo científico publicado en 2020

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm ⬇️

artículo científico publicado en 2017

Mastering the game of Go with deep neural networks and tree search

artículo científico publicado en 2016

Mastering the game of Go without human knowledge ⬇️

artículo científico publicado en 2017

Mastering the game of Stratego with model-free multiagent reinforcement learning

artículo científico publicado en 2022

Meta-Gradient Reinforcement Learning ⬇️

scholarly article by Zhongwen Xu et al published 2018 in Advances in Neural Information Processing Systems 31

Monte-Carlo Planning in Large POMDPs ⬇️

Natural Value Approximators: Learning when to Trust Past Estimates ⬇️

artículo científico publicado en 2017

Successor Features for Transfer in Reinforcement Learning ⬇️

artículo científico publicado en 2017

The Option Keyboard: Combining Skills in Reinforcement Learning ⬇️

scholarly article by Andre Barreto et al published 2019 in Advances in Neural Information Processing Systems 32

Lista de obras de David Silver

A Monte-Carlo AIXI Approximation

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning ⬇️

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

Bayes-Adaptive Simulation-based Search with Value Function Approximation ⬇️

Bootstrapping from Game Tree Search ⬇️

Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation ⬇️

Discovering faster matrix multiplication algorithms with reinforcement learning

Discovery of Useful Questions as Auxiliary Tasks ⬇️

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search ⬇️

Grandmaster level in StarCraft II using multi-agent reinforcement learning

Highly accurate protein structure prediction with AlphaFold

Human-level control through deep reinforcement learning

Imagination-Augmented Agents for Deep Reinforcement Learning ⬇️

Learning Continuous Control Policies by Stochastic Value Gradients ⬇️

Learning values across many orders of magnitude ⬇️

Mastering Atari, Go, chess and shogi by planning with a learned model

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm ⬇️

Mastering the game of Go with deep neural networks and tree search

Mastering the game of Go without human knowledge ⬇️

Mastering the game of Stratego with model-free multiagent reinforcement learning

Meta-Gradient Reinforcement Learning ⬇️

Monte-Carlo Planning in Large POMDPs ⬇️

Natural Value Approximators: Learning when to Trust Past Estimates ⬇️

Successor Features for Transfer in Reinforcement Learning ⬇️

The Option Keyboard: Combining Skills in Reinforcement Learning ⬇️