|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 36 occurrences of 24 keywords
|
|
|
Results
Found 1303 publication records. Showing 1303 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
158 | Thomas Hanselmann, Lyle Noakes, Anthony Zaknich |
Continuous-Time Adaptive Critics. |
IEEE Trans. Neural Networks |
2007 |
DBLP DOI BibTeX RDF |
|
146 | Shamama Anwar, K. Sridhar Patnaik |
Actor Critic Learning: A Near Set Approach. |
RSCTC |
2008 |
DBLP DOI BibTeX RDF |
ethogram, ethology, actor critic, rough sets, Adaptive learning, approximation space, near sets |
127 | Jan Peters 0001, Sethu Vijayakumar, Stefan Schaal |
Natural Actor-Critic. |
ECML |
2005 |
DBLP DOI BibTeX RDF |
|
113 | Rafiuddin Syam, Keigo Watanabe, Kiyotaka Izumi |
An Adaptive Actor-critic Algorithm with Multi-step Simulated Experiences for Controlling Nonholonomic Mobile Robots. |
Soft Comput. |
2007 |
DBLP DOI BibTeX RDF |
Actor-critic algorithms, Multi-step prediction, Nonlinear predictive model, Simulated experience, Kinematic model, Nonholonomic mobile robot |
113 | Rafiuddin Syam, Keigo Watanabe, Kiyotaka Izumi |
Adaptive actor-critic learning for the control of mobile robots by applying predictive models. |
Soft Comput. |
2005 |
DBLP DOI BibTeX RDF |
Actor-critic algorithms, Tracking control problem, Predictive model, Temporal difference learning, Nonholonomic mobile robot |
107 | Jooyoung Park, Jongho Kim, Daesung Kang |
An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm. |
CIS (1) |
2005 |
DBLP DOI BibTeX RDF |
|
105 | Andrés Pérez-Uribe |
Using a Time-Delay Actor-Critic Neural Architecture with Dopamine-Like Reinforcement Signal for Learning in Autonomous Robots. |
Emergent Neural Computational Architectures Based on Neuroscience |
2001 |
DBLP DOI BibTeX RDF |
Learning robots, actor-critic architecture, TD-learning, dopamine neurons, human teaching signals, reinforcement learning, time-delay neural networks |
94 | James F. Peters |
Granular Computing in Actor-Critic Learning. |
FOCI |
2007 |
DBLP DOI BibTeX RDF |
|
86 | Francisco S. Melo, Manuel Lopes 0001 |
Fitted Natural Actor-Critic: A New Algorithm for Continuous State-Action MDPs. |
ECML/PKDD (2) |
2008 |
DBLP DOI BibTeX RDF |
|
86 | Efraín Franco Flores, Julio Waissman Vilanova, Jair García Lamont |
Learning the Filling Policy of a Biodegradation Process by Fuzzy Actor-Critic Learning Methodology. |
MICAI |
2008 |
DBLP DOI BibTeX RDF |
|
83 | James F. Peters, Christopher J. Henry, Sheela Ramanna |
Reinforcement Learning in Swarms that Learn. |
IAT |
2005 |
DBLP DOI BibTeX RDF |
|
81 | Mohammed Shahid Abdulla, Shalabh Bhatnagar |
Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes. |
Discret. Event Dyn. Syst. |
2007 |
DBLP DOI BibTeX RDF |
Actor-critic algorithms, Two timescale stochastic approximation, Simultaneous perturbation stochastic approximation, Normalized Hadamard matrices, TD-learning, Reinforcement learning, Markov decision processes, Policy iteration |
69 | Yoichiro Matsuno, Tatsuya Yamazaki, Shin Ishii |
A multi-agent reinforcement learning method for a partially-observable competitive game. |
Agents |
2001 |
DBLP DOI BibTeX RDF |
actor-critic model, competitive game, reinforcement learning, multi-agent |
65 | Shingo Mabu, Yan Chen 0008, Kotaro Hirasawa, Jinglu Hu |
Stock trading rules using genetic network programming with actor-critic. |
IEEE Congress on Evolutionary Computation |
2007 |
DBLP DOI BibTeX RDF |
|
62 | Junichiro Yoshimoto, Shin Ishii, Masa-aki Sato |
On-Line EM Reinforcement Learning. |
IJCNN (3) |
2000 |
DBLP DOI BibTeX RDF |
|
60 | Chrisantha Fernando |
Neuronal replicators solve the stability-plasticity dilemma. |
GECCO |
2010 |
DBLP DOI BibTeX RDF |
actor-critic, neuronal replicator hypothesis, robotics, reinforcement learning |
60 | Dusko Katic, Aleksandar Rodic 0001, Miomir Vukobratovic |
Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning. |
J. Intell. Robotic Syst. |
2008 |
DBLP DOI BibTeX RDF |
Biped locomotion, Integrated dynamic control, Actor-critic method, Reinforcement learning, Humanoid robots |
60 | Takashi Kuremoto, Masanao Obayashi, Kunikazu Kobayashi, Hirotaka Adachi, Kentaro Yoneda |
A Neuro-fuzzy Learning System for Adaptive Swarm Behaviors Dealing with Continuous State Space. |
ICIC (2) |
2008 |
DBLP DOI BibTeX RDF |
neuro-fuzzy net, swarm behavior, actor-critic algorithm, goal-exploration problem, multi-agent system, reinforcement learning |
60 | James F. Peters |
Toward Approximate Adaptive Learning. |
RSEISP |
2007 |
DBLP DOI BibTeX RDF |
Actor-critic, behaviour pattern, stopping time, perception, adaptive learning, approximation space |
47 | Shalabh Bhatnagar, Vivek S. Borkar, Soumyajit Guin |
Actor-Critic or Critic-Actor? A Tale of Two Time Scales. |
IEEE Control. Syst. Lett. |
2023 |
DBLP DOI BibTeX RDF |
|
47 | Prashansa Panda, Shalabh Bhatnagar |
Finite Time Analysis of Constrained Actor Critic and Constrained Natural Actor Critic Algorithms. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
47 | Shalabh Bhatnagar, Vivek S. Borkar, Soumyajit Guin |
Actor-Critic or Critic-Actor? A Tale of Two Time Scales. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
47 | Aras Dargazany |
Model-based actor-critic: GAN + DRL (actor-critic) => AGI. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
47 | Norman L. Tasfi, Miriam A. M. Capretz |
Noisy Importance Sampling Actor-Critic: An Off-Policy Actor-Critic With Experience Replay. |
IJCNN |
2020 |
DBLP DOI BibTeX RDF |
|
47 | Ala'eddin Masadeh, Zhengdao Wang, Ahmed E. Kamal 0001 |
Selector-Actor-Critic and Tuner-Actor-Critic Algorithms for Reinforcement Learning. |
WCSP |
2019 |
DBLP DOI BibTeX RDF |
|
47 | Jing Wang 0044, Ioannis Ch. Paschalidis |
An Actor-Critic Algorithm With Second-Order Actor and Critic. |
IEEE Trans. Autom. Control. |
2017 |
DBLP DOI BibTeX RDF |
|
44 | Byungchan Kim, Byungduk Kang, Shinsuk Park, Sungchul Kang |
Learning robot stiffness for contact tasks using the natural actor-critic. |
ICRA |
2008 |
DBLP DOI BibTeX RDF |
|
44 | Sertan Girgin, Philippe Preux |
Basis Expansion in Natural Actor Critic Methods. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
44 | Mohammad Ghavamzadeh, Yaakov Engel |
Bayesian actor-critic algorithms. |
ICML |
2007 |
DBLP DOI BibTeX RDF |
|
44 | Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, Tomohiro Shibata, Koh Hosoda, Shin Ishii |
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic. |
IROS |
2006 |
DBLP DOI BibTeX RDF |
|
44 | Mehdi Khamassi, Louis-Emmanuel Martinet, Agnès Guillot |
Combining Self-organizing Maps with Mixtures of Experts: Application to an Actor-Critic Model of Reinforcement Learning in the Basal Ganglia. |
SAB |
2006 |
DBLP DOI BibTeX RDF |
|
41 | Hassab Elgawi Osman |
Architecture of behavior-based and robotics self-optimizing memory controller. |
ICRA |
2009 |
DBLP DOI BibTeX RDF |
|
41 | Takashi Kuremoto, Masanao Obayashi, Kunikazu Kobayashi, Hirotaka Adachi, Kentaro Yoneda |
A reinforcement learning system for swarm behaviors. |
IJCNN |
2008 |
DBLP DOI BibTeX RDF |
|
41 | Daan Wierstra, Jürgen Schmidhuber |
Policy Gradient Critics. |
ECML |
2007 |
DBLP DOI BibTeX RDF |
|
41 | Yutaka Nakamura, Takeshi Mori, Shin Ishii |
Natural Policy Gradient Reinforcement Learning for a CPG Control of a Biped Robot. |
PPSN |
2004 |
DBLP DOI BibTeX RDF |
|
36 | Spilios Evmorfos, Athina P. Petropulu, H. Vincent Poor |
Actor-Critic Methods for IRS Design in Correlated Channel Environments: A Closer Look Into the Neural Tangent Kernel of the Critic. |
IEEE Trans. Signal Process. |
2023 |
DBLP DOI BibTeX RDF |
|
36 | Swaminathan Gurumurthy, Zachary Manchester, J. Zico Kolter |
Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning. |
L4DC |
2023 |
DBLP BibTeX RDF |
|
36 | Riazat Ryan, Ming Shao |
Critic-over-Actor-Critic Modeling: Finding Optimal Strategy in ICU Environments. |
IEEE Big Data |
2022 |
DBLP DOI BibTeX RDF |
|
36 | Gengzhi Zhang, Liang Feng 0001, Yaqing Hou |
Multi-task Actor-Critic with Knowledge Transfer via a Shared Critic. |
ACML |
2021 |
DBLP BibTeX RDF |
|
36 | Wei Zhou, Yiying Li, Yongxin Yang, Huaimin Wang, Timothy M. Hospedales |
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
36 | Jiajun Fan, He Ba, Xian Guo, Jianye Hao |
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
36 | Roumeissa Kitouni, Abderrahim Kitouni, Feng Jiang 0001 |
Generalized Critic Policy Optimization: A Model For Combining Advantage Estimates In Actor Critic Methods. |
ICIP |
2020 |
DBLP DOI BibTeX RDF |
|
36 | Wei Zhou, Yiying Li, Yongxin Yang, Huaimin Wang, Timothy M. Hospedales |
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods. |
NeurIPS |
2020 |
DBLP BibTeX RDF |
|
36 | Jonathan Lebensold, William L. Hamilton, Borja Balle, Doina Precup |
Actor Critic with Differentially Private Critic. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
35 | Lin Li, Yuze Li, Wei Wei 0018, Yujia Zhang, Jiye Liang |
Multi-actor mechanism for actor-critic reinforcement learning. |
Inf. Sci. |
2023 |
DBLP DOI BibTeX RDF |
|
35 | Bo Li 0004, Shuangxia Bai, Shiyang Liang, Rui Ma, Evgeny Sergeevich Neretin, Jingyi Huang |
Manoeuvre decision-making of unmanned aerial vehicles in air combat based on an expert actor-based soft actor critic algorithm. |
CAAI Trans. Intell. Technol. |
2023 |
DBLP DOI BibTeX RDF |
|
35 | Yuhu Cheng, Longyang Huang, C. L. Philip Chen, Xuesong Wang 0001 |
Robust Actor-Critic With Relative Entropy Regulating Actor. |
IEEE Trans. Neural Networks Learn. Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
35 | Thibault Lahire |
Actor Loss of Soft Actor Critic Explained. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
35 | Siddharth Mysore, Bassel El Mabsout, Renato Mancuso 0001, Kate Saenko |
Honey. I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL. |
CoG |
2021 |
DBLP DOI BibTeX RDF |
|
35 | Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
35 | Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. |
ICML |
2018 |
DBLP BibTeX RDF |
|
33 | Abdeslam Boularias, Brahim Chaib-draa |
Predictive representations for policy gradient in POMDPs. |
ICML |
2009 |
DBLP DOI BibTeX RDF |
|
33 | Qinmin Yang, Jonathan Blake Vance, Sarangapani Jagannathan |
Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks. |
IEEE Trans. Syst. Man Cybern. Part B |
2008 |
DBLP DOI BibTeX RDF |
|
33 | Wipawee Usaha, Javier A. Barria |
Reinforcement Learning for Resource Allocation in LEO Satellite Networks. |
IEEE Trans. Syst. Man Cybern. Part B |
2007 |
DBLP DOI BibTeX RDF |
|
29 | Atsushi Shimizu, Yuko Osana |
Reinforcement Learning Using Kohonen Feature Map Associative Memory with Refractoriness Based on Area Representation. |
ICONIP (2) |
2008 |
DBLP DOI BibTeX RDF |
|
29 | Dimitri Ognibene, Angelo Rega, Gianluca Baldassarre |
A Model of Reaching that Integrates Reinforcement Learning and Population Encoding of Postures. |
SAB |
2006 |
DBLP DOI BibTeX RDF |
|
24 | Hongjun Zhu, Yong Xie, Suijun Zheng |
A double Actor-Critic learning system embedding improved Monte Carlo tree search. |
Neural Comput. Appl. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Hao Li, Xiao-Hu Zhou, Xiao-Liang Xie, Shi-Qi Liu 0004, Zhen-Qiu Feng, Zeng-Guang Hou |
CASOG: Conservative Actor-Critic With SmOoth Gradient for Skill Learning in Robot-Assisted Intervention. |
IEEE Trans. Ind. Electron. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Jingliang Duan, Yangang Ren, Fawang Zhang, Jie Li 0042, Shengbo Eben Li, Yang Guan, Keqiang Li |
Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-Lane Scenarios [Research Frontier] [Research Frontier]. |
IEEE Comput. Intell. Mag. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Bingyi Liu, Weizhen Han, Enshu Wang, Shengwu Xiong, Libing Wu, Qian Wang 0002, Jianping Wang 0001, Chunming Qiao |
Multi-Agent Attention Double Actor-Critic Framework for Intelligent Traffic Light Control in Urban Scenarios With Hybrid Traffic. |
IEEE Trans. Mob. Comput. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Ali Beikmohammadi, Sindri Magnússon |
Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge. |
Inf. Sci. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Amgad Abdallah Mahmoud, Nada Adel Alyan, Ahmed Elkerdawy, Shihori Tanabe, Frédéric Andrès, Andreas Pester, Hesham H. Ali |
Geom-SAC: Geometric Multi-Discrete Soft Actor Critic With Applications in De Novo Drug Design. |
IEEE Access |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Sudheer Mangalampalli, Ganesh Reddy Karri, Sachi Nandan Mohanty, Shahid Ali, Muhammad Ijaz Khan, Sherzod Sh. Abdullaev, Salman A. AlQahtani |
Multi-Objective Prioritized Task Scheduler Using Improved Asynchronous Advantage Actor Critic (a3c) Algorithm in Multi Cloud Environment. |
IEEE Access |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Dan Wu, Liming Wang, Meiyan Liang, Yunpeng Kang, Qi Jiao, Yajun Cheng, Jian Li 0010 |
UAV-Assisted Real-Time Video Transmission for Vehicles: A Soft Actor-Critic DRL Approach. |
IEEE Internet Things J. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Xiangyu Gao, Yaping Sun, Hao Chen 0013, Xiaodong Xu 0001, Shuguang Cui |
Joint Computing, Pushing, and Caching Optimization for Mobile-Edge Computing Networks via Soft Actor-Critic Learning. |
IEEE Internet Things J. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Kaiqing Bu, Yan Liu, Fuli Wang |
Adaptation Entropy Regularization Actor-Critic for Process Operation Performance Assessment. |
IEEE Trans. Ind. Informatics |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Fang Fu, Xianpeng Wei, Zhicai Zhang, Laurence T. Yang, Lin Cai 0001, Jia Luo 0003, Zhe Zhang 0010, Chenmeng Wang |
Age of Information Minimization for UAV-Assisted Internet of Things Networks: A Safe Actor-Critic With Policy Distillation Approach. |
IEEE Trans. Netw. Sci. Eng. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Mohammad Cheraghinia, Seyed Hamed Rastegar, Vahid Shah-Mansouri, Hamed Kebriaei, Kun Zhu 0001, Dusit Niyato |
Toward a Virtual Edge Service Provider: Actor-Critic Learning to Incentivize the Computation Nodes. |
IEEE Trans. Netw. Sci. Eng. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Ran Wang, Ye Tian, Kenji Kashima |
Density estimation based soft actor-critic: deep reinforcement learning for static output feedback control with measurement noise. |
Adv. Robotics |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Sicen Li, Yiming Pang, Panju Bai, Jiawei Li, Zhaojin Liu, Shihao Hu, Liquan Wang, Gang Wang |
Learning Locomotion for Quadruped Robots via Distributional Ensemble Actor-Critic. |
IEEE Robotics Autom. Lett. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Mélodie Daniel, Aly Magassouba, Miguel Aranda, Laurent Lequièvre, Juan Antonio Corrales Ramon, Roberto Iglesias Rodriguez, Youcef Mezouar |
Multi Actor-Critic DDPG for Robot Action Space Decomposition: A Framework to Control Large 3D Deformation of Soft Linear Objects. |
IEEE Robotics Autom. Lett. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Lulu Zhang, Huaguang Zhang, Xiaohui Yue, Tianbiao Wang |
Actor-Critic Optimal Control for Semi-Markovian Jump Systems With Time Delay. |
IEEE Trans. Circuits Syst. II Express Briefs |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Long Zhang 0003, Xingliang Jia, Ni Tian, Choong Seon Hong, Zhu Han 0001 |
When Visible Light Communication Meets RIS: A Soft Actor-Critic Approach. |
IEEE Wirel. Commun. Lett. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Le Thanh Tan, Martin Reisslein, Sachin Shetty |
Multi-Timescale Actor-Critic Learning for Computing Resource Management With Semi-Markov Renewal Process Mobility. |
IEEE Trans. Intell. Transp. Syst. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Lu Dong 0002, Zichen He, Chunwei Song, Xin Yuan, Haichao Zhang |
Multi-robot social-aware cooperative planning in pedestrian environments using attention-based actor-critic. |
Artif. Intell. Rev. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Jiacun Wang, GuiPeng Xi, Xiwang Guo, Shujin Qin, Henry Han |
Multi-Objective Advantage Actor-Critic Algorithm for Hybrid Disassembly Line Balancing with Multi-Skilled Workers. |
Inf. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Xiao Wang 0034, Dazi Li |
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy. |
Neurocomputing |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha |
Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Alberto Sinigaglia, Niccolò Turcato, Alberto Dalla Libera, Ruggero Carli, Gian Antonio Susto |
Exploiting Estimation Bias in Deep Double Q-Learning for Actor-Critic Methods. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Jinxuan Chen, Mustafa Özger, Cicek Cavdar |
Nash Soft Actor-Critic LEO Satellite Handover Management Algorithm for Flying Vehicles. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Samuel Chun-Hei Lam, Justin A. Sirignano, Ziheng Wang |
Weak Convergence Analysis of Online Neural Actor-Critic Algorithms. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Yuan Lin, Xiao Liu, Zishun Zheng, Liyao Wang |
Discretionary Lane-Change Decision and Control via Parameterized Soft Actor-Critic for Hybrid Action Space. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Shengchao Yan, Lukas König, Wolfram Burgard |
Single-Agent Actor Critic for Decentralized Cooperative Driving. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Tianying Ji, Yongyuan Liang, Yan Zeng 0002, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun 0001, Huazhe Xu |
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Jiarui Wang, Mahyar Fazlyab |
Actor-Critic Physics-informed Neural Lyapunov Control. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Michal Nauman, Michal Bortkiewicz, Mateusz Ostaszewski, Piotr Milos, Tomasz Trzcinski, Marek Cygan |
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu 0003, Melih Kandemir |
Probabilistic Actor-Critic: Learning to Explore with PAC-Bayes Uncertainty. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Luca Grillotti, Maxence Faldor, Borja G. León, Antoine Cully |
Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Tobias Enders, James Harrison, Maximilian Schiffer |
Risk-Sensitive Soft Actor-Critic for Robust Deep Reinforcement Learning under Distribution Shifts. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Nikhil Kumar Singh 0004, Indranil Saha |
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Honghao Wei, Xiyue Peng, Xin Liu, Arnob Ghosh |
Adversarially Trained Actor Critic for offline CMDPs. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Michal Nauman, Mateusz Ostaszewski, Marek Cygan |
A Case for Validation Buffer in Pessimistic Actor-Critic. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang 0001, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin A. Riedmiller |
Offline Actor-Critic Reinforcement Learning Scales to Large Models. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Michael Kölle 0001, Mohamad Hgog, Fabian Ritz, Philipp Altmann, Maximilian Zorn, Jonas Stein 0001, Claudia Linnhoff-Popien |
Quantum Advantage Actor-Critic for Reinforcement Learning. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Clément Gaspard, Grégoire Passault, Mélodie Daniel, Olivier Ly |
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Hamed Rahimi Nohooji, Abolfazl Zaraki, Holger Voos |
Actor-critic learning based PID control for robotic manipulators. |
Appl. Soft Comput. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Marta Monaci, Valerio Agasucci, Giorgio Grani |
An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents. |
Eur. J. Oper. Res. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Guilherme Piêgas Koslovski, Kleiton Pereira, Paulo Roberto Albuquerque |
DAG-based workflows scheduling using Actor-Critic Deep Reinforcement Learning. |
Future Gener. Comput. Syst. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Xiaohong Nian, MengMeng Li, Haibo Wang, Yalei Gong, Hongyun Xiong |
Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm. |
Appl. Intell. |
2024 |
DBLP DOI BibTeX RDF |
|
24 | Wei Li, Si Li, Huaguang Shi, Wenhao Yan, Yi Zhou 0004 |
UAV-enabled fair offloading for MEC networks: a DRL approach based on actor-critic parallel architecture. |
Appl. Intell. |
2024 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 1303 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|