Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
recurrent_reinforcement_learning [2018/03/08 10:50]
admin
recurrent_reinforcement_learning [2018/12/05 10:47] (current)
admin
Line 32: Line 32:
 algorithm. ​ algorithm. ​
  
 +https://​arxiv.org/​abs/​1808.10552 Directed Exploration in PAC Model-Free Reinforcement Learning
 +
 +https://​arxiv.org/​abs/​1708.05866v2 A Brief Survey of Deep Reinforcement Learning
 +
 +https://​papers.nips.cc/​paper/​8200-non-delusional-q-learning-and-value-iteration.pdf Non-delusional Q-learning and value iteration