Top Research Papers on Reinforcement Learning
Discover the key research papers that have shaped the field of reinforcement learning. These papers provide essential insights into the algorithms, methodologies, and applications driving this area of AI. Whether you are a researcher, student, or enthusiast, delve into these top papers to deepen your understanding of reinforcement learning.
Looking for research-backed answers?Try AI Search
Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones
184 Citations 2021Brijen Thananjeyan, Ashwin Balakrishna, Suraj Nair + 7 more
IEEE Robotics and Automation Letters
This work proposes Recovery RL, an algorithm which navigates this tradeoff by leveraging offline data to learn about constraint violating zones before policy learning and separating the goals of improving task performance and constraint satisfaction across two policies: a task policy that only optimizes the task reward and a recovery policy that guides the agent to safety when constraint violation is likely.
CFR-RL: Traffic Engineering With Reinforcement Learning in SDN
162 Citations 2020Junjie Zhang, Minghao Ye, Zehua Guo + 2 more
IEEE Journal on Selected Areas in Communications
CFR-RL (Critical Flow Rerouting-Reinforcement Learning), a Reinforcement Learning-based scheme that learns a policy to select critical flows for each given traffic matrix automatically and reroutes these selected critical flows to balance link utilization of the network by formulating and solving a simple Linear Programming problem.
RL-CycleGAN: Reinforcement Learning Aware Simulation-to-Real
149 Citations 2020Kanishka Rao, C.J. Harris, Alex Irpan + 3 more
journal unavailable
The RL-CycleGAN, a new approach for simulation-to-real-world transfer for reinforcement learning, is obtained by incorporating the RL-scene consistency loss into unsupervised domain translation, which ensures that the translation operation is invariant with respect to the Q-values associated with the image.
RL-Routing: An SDN Routing Algorithm Based on Deep Reinforcement Learning
145 Citations 2020Yiren Chen, Amir Rezapour, Wen-Guey Tzeng + 1 more
IEEE Transactions on Network Science and Engineering
This work develops a reinforcement learning routing algorithm (RL-Routing) to solve a traffic engineering problem of SDN in terms of throughput and delay and considers comprehensive network information for state representation and use one-to-many network configuration for routing choices.
GCN-RL Circuit Designer: Transferable Transistor Sizing with Graph Neural Networks and Reinforcement Learning
266 Citations 2020Hanrui Wang, Kuan Wang, Jiacheng Yang + 4 more
journal unavailable
This paper presents GCN-RL Circuit Designer, leveraging reinforcement learning (RL) to transfer the knowledge between different technology nodes and topologies, and demonstrates that RL with transfer learning can achieve much higher FoMs than methods without knowledge transfer.
RL-GA: A Reinforcement Learning-based Genetic Algorithm for Electromagnetic Detection Satellite Scheduling Problem
134 Citations 2023Yanjie Song, Luona Wei, Qing Yang + 3 more
Swarm and Evolutionary Computation
Through the experimental verification of multiple instances, it can be seen that the RL-GA can solve the EDSSP problem effectively and performs better in several aspects than the state-of-the-art algorithms.
Reinforced model predictive control (RL-MPC) for building energy management
202 Citations 2022Javier Arroyo, Carlo Manna, Fred Spiessens + 1 more
Applied Energy
sponsorship: This work emerged from the IBPSA Project 1, an international project conducted under the umbrella of the International Building Performance Simulation Association (IBPSA) . Project 1 will develop and demonstrate a BIM/GIS and Modelica Framework for building and community energy system design and operation. The work of Javier Arroyo is financed by VITO, Belgium through a PhD Fellowship (grant number 1710754) . Finally, the authors wish to thank to Brida V. Mbuwir, Jan Drgona, and Iago Cupeiro Figueroa for kindly reviewing the paper. (Project 1 will develop and demonstrate a BIM/GIS...
A Survey of Deep RL and IL for Autonomous Driving Policy Learning
182 Citations 2021Zeyu Zhu, Huijing Zhao
IEEE Transactions on Intelligent Transportation Systems
This is the first survey to focus on AD policy learning using DRL/DIL, which is addressed simultaneously from the system, task-driven and problem-driven perspectives.
Hierarchical Reinforcement Learning
363 Citations 2021Shubham Pateria, Budhitama Subagdja, Ah‐Hwee Tan + 1 more
ACM Computing Surveys
A survey of the diverse HRL approaches concerning the challenges of learning hierarchical policies, subtask discovery, transfer learning, and multi-agent learning using HRL is presented according to a novel taxonomy of the approaches.
Deep Reinforcement Learning
216 Citations 2020Hao Dong, Zihan Ding, Shanghang Zhang
journal unavailable
This is the first comprehensive and self-contained introduction to deep reinforcement learning, covering all aspects from fundamentals and research to applications. It includes examples and codes to help readers practice and implement the techniques.
Transfer Learning in Deep Reinforcement Learning: A Survey
150 Citations 2020Zhuangdi Zhu, Kaixiang Lin, Jiayu Zhou + 1 more
arXiv (Cornell University)
Reinforcement learning is a learning paradigm for solving sequential decision-making problems. Recent years have witnessed remarkable progress in reinforcement learning upon the fast development of deep neural networks. Along with the promising prospects of reinforcement learning in numerous domains such as robotics and game-playing, transfer learning has arisen to tackle various challenges faced by reinforcement learning, by transferring knowledge from external expertise to facilitate the efficiency and effectiveness of the learning process. In this survey, we systematically investigate the r...
Offline Reinforcement Learning with Implicit Q-Learning
129 Citations 2021Ilya Kostrikov, Ashvin Nair, Sergey Levine
arXiv (Cornell University)
Offline reinforcement learning requires reconciling two conflicting aims: learning a policy that improves over the behavior policy that collected the dataset, while at the same time minimizing the deviation from the behavior policy so as to avoid errors due to distributional shift. This trade-off is critical, because most current offline reinforcement learning methods need to query the value of unseen actions during training to improve the policy, and therefore need to either constrain these actions to be in-distribution, or else regularize their values. We propose an offline RL method that ne...
Conservative Q-Learning for Offline Reinforcement Learning
531 Citations 2020Aviral Kumar, Aurick Zhou, George Tucker + 1 more
arXiv (Cornell University)
Conservative Q-learning (CQL) is proposed, which aims to address limitations of offline RL methods by learning a conservative Q-function such that the expected value of a policy under this Q- function lower-bounds its true value.
Deep learning, reinforcement learning, and world models
419 Citations 2022Yutaka Matsuo, Yann LeCun, Maneesh Sahani + 5 more
Neural Networks
This review of talks and discussions in the "Deep Learning and Reinforcement Learning" session of the symposium, International Symposium on Artificial Intelligence and Brain Science, discusses whether the authors can achieve comprehensive understanding of human intelligence based on the recent advances of deep learning and reinforcement learning algorithms.
Transfer Learning in Deep Reinforcement Learning: A Survey
630 Citations 2023Zhuangdi Zhu, Kaixiang Lin, Anil K. Jain + 1 more
IEEE Transactions on Pattern Analysis and Machine Intelligence
This survey systematically investigates the recent progress of transfer learning approaches in the context of deep reinforcement learning, and provides a framework for categorizing the state-of-the-art transfer Learning approaches under which to analyze their goals, methodologies, compatible reinforcement learning backbones, and practical applications.
Reinforcement Learning in Healthcare: A Survey
443 Citations 2021Chao Yu, Jiming Liu, Shamim Nemati + 1 more
ACM Computing Surveys
This survey provides an extensive overview of RL applications in a variety of healthcare domains, ranging from dynamic treatment regimes in chronic diseases and critical care, automated medical diagnosis, and many other control or scheduling problems that have infiltrated every aspect of the healthcare system.
Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning
209 Citations 2021Hua Jiang, Liangcai Zeng, Gongfa Li + 1 more
Sensors
A state-of-the-art survey on an intelligent robot with the capability of autonomous deciding and learning reveals that the latest research in deep learning and reinforcement learning has paved the way for highly complex tasks to be performed by robots.
Beyond dichotomies in reinforcement learning
132 Citations 2020Anne Collins, Jeffrey Cockburn
Nature reviews. Neuroscience
It is argued that the field is well positioned to move beyond simplistic dichotomies, and a means of refocusing research questions towards the rich and complex components that comprise learning and decision-making is proposed.
Deep Reinforcement Learning: A Survey
658 Citations 2022Xu Wang, Sen Wang, Xingxing Liang + 5 more
IEEE Transactions on Neural Networks and Learning Systems
The fundamental theories, key algorithms, and primary research domains of DRL, in addition to value-based and policy-based DRL algorithms, are summarized and the advances in maximum entropy- based DRL are summarized.
Reinforcement Learning with Augmented Data
245 Citations 2020Michael Laskin, Kimin Lee, Adam Stooke + 3 more
arXiv (Cornell University)
Learning from visual observations is a fundamental yet challenging problem in Reinforcement Learning (RL). Although algorithmic advances combined with convolutional neural networks have proved to be a recipe for success, current methods are still lacking on two fronts: (a) data-efficiency of learning and (b) generalization to new environments. To this end, we present Reinforcement Learning with Augmented Data (RAD), a simple plug-and-play module that can enhance most RL algorithms. We perform the first extensive study of general data augmentations for RL on both pixel-based and state-based inp...