Archive for February, 2018

Title: Efficient Exploration through Bayesian Deep Q-Networks Authors: Kamyar Azizzadenesheli and Emma Brunskill and Animashree Anandkumar Categories: cs.AI cs.LG stat.ML We propose Bayesian Deep Q-Network (BDQN), a practical Thompson sampling based Reinforcement Learning (RL) Algorithm. Thompson sampling allows for targeted exploration in high dimensions through posterior sampling but is usually computationally expensive. We address this […]


Title: More Robust Doubly Robust Off-policy Evaluation Authors: Mehrdad Farajtabar, Yinlam Chow, and Mohammad Ghavamzadeh Categories: cs.AI We study the problem of off-policy evaluation (OPE) in reinforcement learning (RL), where the goal is to estimate the performance of a policy from the data generated by another policy(ies). In particular, we focus on the doubly robust (DR) […]


Title: A Unified Approach for Multi-step Temporal-Difference Learning with   Eligibility Traces in Reinforcement Learning Authors: Long Yang, Minhao Shi, Qian Zheng, Wenjia Meng, Gang Pan Categories: cs.AI cs.LG stat.ML Recently, a new multi-step temporal learning algorithm, called $Q(\sigma)$, unifies $n$-step Tree-Backup (when $\sigma=0$) and $n$-step Sarsa (when $\sigma=1$) by introducing a sampling parameter $\sigma$. However, similar […]


Title: Deep Reinforcement Learning for Image Hashing Authors: Jian Zhang, Yuxin Peng and Zhaoda Ye Categories: cs.CV Comments: 18 pages, submitted to ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM). arXiv admin note: text overlap with arXiv:1612.02541 Deep hashing methods have received much attention recently, which achieve promising results by taking advantage of the […]


Title: A Critical Investigation of Deep Reinforcement Learning for Navigation Authors: Vikas Dhiman, Shurjo Banerjee, Brent Griffin, Jeffrey M Siskind, Jason J Corso Categories: cs.RO cs.AI The navigation problem is classically approached in two steps: an exploration step, where map-information about the environment is gathered; and an exploitation step, where this information is used to […]


Title: Shared Autonomy via Deep Reinforcement Learning Authors: Siddharth Reddy, Sergey Levine, Anca Dragan Categories: cs.LG cs.HC cs.RO In shared autonomy, user input is combined with semi-autonomous control to achieve a common goal. The goal is often unknown ex-ante, so prior work enables agents to infer the goal from user input and assist with the task. Such methods […]


Title: Coordinated Exploration in Concurrent Reinforcement Learning Authors: Maria Dimakopoulou, Benjamin Van Roy Categories: cs.AI We consider a team of reinforcement learning agents that concurrently learn to operate in a common environment. We identify three properties – adaptivity, commitment, and diversity – which are necessary for efficient coordinated exploration and demonstrate that straightforward extensions to […]


Well, I will cheat a little for today since there is no paper related to Reinforcement Learning on Arxiv. As a consequence, I browse the one from Feb. 2nd… Title: Elements of Effective Deep Reinforcement Learning towards Tactical   Driving Decision Making Authors: Jingchu Liu, Pengfei Hou, Lisen Mu, Yinan Yu, Chang Huang Categories: cs.AI cs.LG Comments: […]


Hello everyone, Another day, another version of this blog and I hope this time, this will be the right one. mphuget.com knew different versions corresponding to my own interests but that time I would like to discuss about my own research which is (Deep) Reinforcement Learning. Why should I provide a blog on this subject? […]