|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 71 occurrences of 67 keywords
|
|
|
Results
Found 311 publication records. Showing 311 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
105 | Peter W. O'Hearn, Noam Rinetzky, Martin T. Vechev, Eran Yahav, Greta Yorsh |
Verifying linearizability with hindsight. |
PODC |
2010 |
DBLP DOI BibTeX RDF |
hindsight, linearizability, wait-freedom, optimistic concurrency |
49 | Sham M. Kakade, Adam Tauman Kalai, Katrina Ligett |
Playing games with approximation algorithms. |
STOC |
2007 |
DBLP DOI BibTeX RDF |
online linear optimization, approximation algorithms, regret minimization |
49 | Shie Mannor, John N. Tsitsiklis |
Online Learning with Constraints. |
COLT |
2006 |
DBLP DOI BibTeX RDF |
|
49 | Avrim Blum, Adam Kalai, Jon M. Kleinberg |
Admission Control to Minimize Rejections. |
WADS |
2001 |
DBLP DOI BibTeX RDF |
|
49 | Rüdiger Pohl, Oliver Hardt, Markus Eisenhauer |
SARA - Ein kognitives Prozessmodell zur Erklaerung von Ankereffekt und Rueckschaufehler. |
Kognitionswissenschaft |
2000 |
DBLP DOI BibTeX RDF |
|
39 | Matthew Welsh |
Overconfident in Hindsight: Memory, Hindsight Bias and Overconfidence. |
CogSci |
2020 |
DBLP BibTeX RDF |
|
39 | Renzo Roel P. Tan, Kazushi Ikeda, John Paul C. Vergara |
Hindsight-Combined and Hindsight-Prioritized Experience Replay. |
ICONIP (2) |
2020 |
DBLP DOI BibTeX RDF |
|
33 | Rani Yaroshinsky, Ran El-Yaniv, Steven S. Seiden |
How to Better Use Expert Advice. |
Mach. Learn. |
2004 |
DBLP DOI BibTeX RDF |
online prediction, learning from expert advice, online learning |
33 | Avrim Blum, Shuchi Chawla 0001, Adam Kalai |
Static Optimality and Dynamic Search-Optimality in Lists and Trees. |
Algorithmica |
2003 |
DBLP DOI BibTeX RDF |
Adaptive data structures, Experts Analysis, Competitive Analysis, Binary search trees |
33 | Avrim Blum, Shuchi Chawla 0001, Adam Kalai |
Static optimality and dynamic search-optimality in lists and trees. |
SODA |
2002 |
DBLP BibTeX RDF |
|
19 | Natasha Butt, Blazej Manczak, Auke Wiggers, Corrado Rainone, David W. Zhang, Michaël Defferrard, Taco Cohen |
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
19 | Michael Lanier, Ying Xu, Nathan Jacobs, Chongjie Zhang, Yevgeniy Vorobeychik |
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
19 | Tonghe Zhang, Yu Chen, Longbo Huang |
Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
19 | Erdi Sayar, Vladislav Vintaykin, Giovanni Iacca, Alois Knoll |
Hindsight Experience Replay with Evolutionary Decision Trees for Curriculum Goal Generation. |
EvoApplications@EvoStar |
2024 |
DBLP DOI BibTeX RDF |
|
19 | Taeyoung Kim, Dongsoo Har |
Cluster-Based Sampling in Hindsight Experience Replay for Robotic Tasks (Student Abstract). |
AAAI |
2024 |
DBLP DOI BibTeX RDF |
|
19 | Roland Meyer 0001, Thomas Wies, Sebastian Wolff 0001 |
Embedding Hindsight Reasoning in Separation Logic. |
Proc. ACM Program. Lang. |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Chenjia Bai, Lingxiao Wang 0003, Yixin Wang, Zhaoran Wang 0001, Rui Zhao, Chenyao Bai, Peng Liu 0008 |
Addressing Hindsight Bias in Multigoal Reinforcement Learning. |
IEEE Trans. Cybern. |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Wendong Xiao, Liang Yuan, Teng Ran, Li He, Jianbo Zhang, Jianping Cui |
Multimodal fusion for autonomous navigation via deep reinforcement learning with sparse rewards and hindsight experience replay. |
Displays |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Keting Lu, Yan Cao, Xiaoping Chen, Shiqi Zhang 0001 |
Efficient Dialog Policy Learning With Hindsight, User Modeling, and Adaptation. |
IEEE Trans. Cogn. Dev. Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Xudong Yu, Chenjia Bai, Changhong Wang, Dengxiu Yu, C. L. Philip Chen, Zhen Wang 0004 |
Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling. |
IEEE Trans. Syst. Man Cybern. Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Yongle Luo, Yuxin Wang, Kun Dong, Qiang Zhang, Erkang Cheng, Zhiyong Sun, Bo Song |
Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards. |
Neurocomputing |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Matias Alvo, Daniel Russo 0001, Yash Kanoria |
Neural Inventory Control in Networks via Hindsight Differentiable Policy Optimization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Francisco Roldan Sanchez, Qiang Wang, David Cordova Bulens, Kevin McGuinness, Stephen J. Redmond, Noel E. O'Connor |
Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Jonathan N. Lee, Alekh Agarwal, Christoph Dann, Tong Zhang 0001 |
Learning in POMDPs is Sample-Efficient with Hindsight Observability. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Akash Velu, Skanda Vaidyanath, Dilip Arumugam |
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Kevin Denamganaï, Daniel Hernández 0008, Ozan Vardal, Sondess Missaoui, James Alfred Walker |
ETHER: Aligning Emergent Communication for Hindsight Experience Replay. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Jiayuan Gu, Sean Kirmani, Paul Wohlhart, Yao Lu 0006, Montserrat Gonzalez Arenas, Kanishka Rao, Wenhao Yu 0003, Chuyuan Fu, Keerthana Gopalakrishnan, Zhuo Xu, Priya Sundaresan, Peng Xu, Hao Su, Karol Hausman, Chelsea Finn, Quan Vuong, Ted Xiao |
RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Simon Guist, Jan Schneider, Alexander Dittrich, Vincent Berenz, Bernhard Schölkopf, Dieter Büchler |
Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Yuming Huang, Bin Ren |
RoMo-HER: Robust Model-based Hindsight Experience Replay. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Ming Shi, Yingbin Liang, Ness B. Shroff |
Theoretical Hardness and Tractability of POMDPs in RL with Partial Hindsight State Information. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Kibeom Kim, Kisung Shin, Min Whoo Lee, Moonhoen Lee, Minsu Lee, Byoung-Tak Zhang |
Visual Hindsight Self-Imitation Learning for Interactive Navigation. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Hao Liu 0055, Pieter Abbeel |
Emergent Agentic Transformer from Chain of Hindsight Experience. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Sicen Li, Yiming Pang, Panju Bai, Zhaojin Liu, Jiawei Li, Shihao Hu, Liquan Wang, Gang Wang |
Learning Agility and Adaptive Legged Locomotion via Curricular Hindsight Reinforcement Learning. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Rolando Garcia, Anusha Dandamudi, Gabriel Matute, Lehan Wan, Joseph Gonzalez 0001, Joseph M. Hellerstein, Koushik Sen |
Multiversion Hindsight Logging for Continuous Training. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Erdi Sayar, Zhenshan Bing, Carlo D'Eramo, Ozgur S. Oguz, Alois Knoll |
Contact Energy Based Hindsight Experience Prioritization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Tianjun Zhang, Fangchen Liu, Justin Wong, Pieter Abbeel, Joseph E. Gonzalez |
The Wisdom of Hindsight Makes Language Models Better Instruction Followers. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Jiacheng Guo, Minshuo Chen, Huan Wang, Caiming Xiong, Mengdi Wang, Yu Bai 0017 |
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Eli Gafni, Giuliano Losa |
Time is not a Healer, but it Sure Makes Hindsight 20: 20. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Hao Liu 0055, Carmelo Sferrazza, Pieter Abbeel |
Chain of Hindsight Aligns Language Models with Feedback. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Xiaotong Zhao, Jingli Du, Zhihan Wang |
HCS-R-HER: Hierarchical reinforcement learning based on cross subtasks rainbow hindsight experience replay. |
J. Comput. Sci. |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Myoung-Hoon Lee, Jun Moon |
Deep reinforcement learning-based model-free path planning and collision avoidance for UAVs: A soft actor-critic with hindsight experience replay approach. |
ICT Express |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Zhenshan Bing, Erick Álvarez, Long Cheng 0007, Fabrice O. Morin, Rui Li 0077, Xiaojie Su, Kai Huang 0001, Alois Knoll |
Robotic Manipulation in Dynamic Scenarios via Bounding-Box-Based Hindsight Goal Generation. |
IEEE Trans. Neural Networks Learn. Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Eli Gafni, Giuliano Losa |
Invited Paper: Time Is Not a Healer, but It Sure Makes Hindsight 20:20. |
SSS |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Rafal Muszynski, Hiep Luong |
Helios: Hyperspectral Hindsight Ostracker. |
WHISPERS |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Simon Guist, Jan Schneider, Vincent Berenz, Alexander Dittrich, Bernhard Schölkopf, Dieter Büchler |
Hindsight States: Blending Sim & Real Task Elements for Efficient Reinforcement Learning. |
Robotics: Science and Systems |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Lei Zhang, Zhiqiang Xie, Vaastav Anand, Ymir Vigfusson, Jonathan Mace |
The Benefit of Hindsight: Tracing Edge-Cases in Distributed Systems. |
NSDI |
2023 |
DBLP BibTeX RDF |
|
19 | Khaleel Alarabi, Firas Tayseer Mohammad Ayasrah, Mohannad Alkhalaileh, Ibtehal Mahmoud Aburezeq, Hanan Shaher Almarashdi, Mohammad Aleassa |
Students' Hindsight Judgments on Physics Education during the COVID-19 Pandemic. |
SNAMS |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Hao Liu 0055, Pieter Abbeel |
Emergent Agentic Transformer from Chain of Hindsight Experience. |
ICML |
2023 |
DBLP BibTeX RDF |
|
19 | Sean R. Sinclair, Felipe Vieira Frujeri, Ching-An Cheng, Luke Marshall, Hugo De Oliveira Barbalho, Jingling Li, Jennifer Neville, Ishai Menache, Adith Swaminathan |
Hindsight Learning for MDPs with Exogenous Inputs. |
ICML |
2023 |
DBLP BibTeX RDF |
|
19 | Jonathan Lee 0002, Alekh Agarwal, Christoph Dann, Tong Zhang 0001 |
Learning in POMDPs is Sample-Efficient with Hindsight Observability. |
ICML |
2023 |
DBLP BibTeX RDF |
|
19 | Daniel Jarrett, Corentin Tallec, Florent Altché, Thomas Mesnard, Rémi Munos, Michal Valko |
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments. |
ICML |
2023 |
DBLP BibTeX RDF |
|
19 | Tianjun Zhang, Fangchen Liu, Justin Wong, Pieter Abbeel, Joseph E. Gonzalez |
The Wisdom of Hindsight Makes Language Models Better Instruction Followers. |
ICML |
2023 |
DBLP BibTeX RDF |
|
19 | Qi Deng, Guangqing Liu, Ruyang Li, Qifu Hu, Yaqian Zhao, Rengang Li |
End-to-End Urban Autonomous Navigation with Decision Hindsight. |
ICONIP (15) |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Yoonsu Jang, Jongchan Baek, Soohee Han |
Hindsight Intermediate Targets for Mapless Navigation With Deep Reinforcement Learning. |
IEEE Trans. Ind. Electron. |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Luiz Felipe Vecchietti, Minah Seo, Dongsoo Har |
Sampling Rate Decay in Hindsight Experience Replay for Robot Control. |
IEEE Trans. Cybern. |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Taeyoung Kim, Dongsoo Har |
Cluster-based Sampling in Hindsight Experience Replay for Robot Control. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Adarsh Sehgal, Muskan Sehgal, Hung Manh La |
AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight Experience Replay. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Yurong You, Katie Z. Luo, Xiangyu Chen, Junan Chen, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger |
Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Yongle Luo, Yuxin Wang, Kun Dong, Qiang Zhang, Erkang Cheng, Zhiyong Sun, Bo Song |
Relay Hindsight Experience Replay: Continual Reinforcement Learning for Robot Manipulation Tasks with Sparse Rewards. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | |
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Lei Zhang, Vaastav Anand, Zhiqiang Xie, Ymir Vigfusson, Jonathan Mace |
The Benefit of Hindsight: Tracing Edge-Cases in Distributed Systems. |
CoRR |
2022 |
DBLP BibTeX RDF |
|
19 | Lunjun Zhang, Bradly C. Stadie |
Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Mudit Verma, Katherine Metcalf |
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Daniel Jarrett, Corentin Tallec, Florent Altché, Thomas Mesnard, Rémi Munos, Michal Valko |
Curiosity in hindsight. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Sean R. Sinclair, Felipe Frujeri, Ching-An Cheng, Adith Swaminathan |
Hindsight Learning for MDPs with Exogenous Inputs. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Roland Meyer 0001, Thomas Wies, Sebastian Wolff 0001 |
Embedding Hindsight Reasoning in Separation Logic. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Liam Schramm, Yunfu Deng, Edgar Granados, Abdeslam Boularias |
USHER: Unbiased Sampling for Hindsight Experience Replay. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Frank Röder, Manfred Eppe, Stefan Wermter |
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Chris Lengerich, Benjamin J. Lengerich |
Executive Function: A Contrastive Value Policy for Resampling and Relabeling Perceptions via Hindsight Summarization? |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Chengjing Li, Li Wang 0014, Zirong Huang |
Hindsight-aware deep reinforcement learning algorithm for multi-agent systems. |
Int. J. Mach. Learn. Cybern. |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Binyamin Manela, Armin Biess |
Curriculum learning with Hindsight Experience Replay for sequential object manipulation tasks. |
Neural Networks |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Jian Peng, Yi Yuan |
Moving Object Grasping Method of Mechanical Arm Based on Deep Deterministic Policy Gradient and Hindsight Experience Replay. |
J. Adv. Comput. Intell. Intell. Informatics |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Zhenshan Bing, Matthias Brucker, Fabrice O. Morin, Rui Li 0077, Xiaojie Su, Kai Huang 0001, Alois C. Knoll |
Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation. |
IEEE Trans. Neural Networks Learn. Syst. |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Sangmin Lee 0008, Seonghun Kim, Hanna Lee, Youdan Kim |
Quadrotor Attitude Control Based on Deep Deterministic Policy Gradient with Hindsight Experience Replay. |
ASCC |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Liam Schramm, Yunfu Deng, Edgar Granados, Abdeslam Boularias |
USHER: Unbiased Sampling for Hindsight Experience Replay. |
CoRL |
2022 |
DBLP BibTeX RDF |
|
19 | Frank Röder, Manfred Eppe, Stefan Wermter |
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics. |
ICDL |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Salaar Liaqat, Daniyal Liaqat, Tatiana Son, Andrea Gershon, Moshe Gabel, Robert Wu, Eyal de Lara |
Hindsight is 20/20: Retrospective Lessons for Conducting Longitudinal Wearable Sensing Studies. |
PerCom Workshops |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Kenny Young |
Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units. |
AAAI |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Bhargavi Krishnamurthy, Sajjan G. Shiva |
Scalable Hindsight Experience Replay based Q-learning Framework with Explainability for Big Data Applications in Fog Computing. |
CCWC |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Todor Davchev, Oleg Olegovich Sushkov, Jean-Baptiste Regli, Stefan Schaal, Yusuf Aytar, Markus Wulfmeier, Jon Scholz |
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation. |
ICLR |
2022 |
DBLP BibTeX RDF |
|
19 | Ashwin Paranjape, Omar Khattab, Christopher Potts, Matei Zaharia, Christopher D. Manning |
Hindsight: Posterior-guided training of retrievers for improved open-ended generation. |
ICLR |
2022 |
DBLP BibTeX RDF |
|
19 | Yurong You, Katie Z. Luo, Xiangyu Chen, Junan Chen, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger |
Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception. |
ICLR |
2022 |
DBLP BibTeX RDF |
|
19 | Lorenzo Moro, Amarildo Likmeta, Enrico Prati, Marcello Restelli |
Goal-Directed Planning via Hindsight Experience Replay. |
ICLR |
2022 |
DBLP BibTeX RDF |
|
19 | Michael Wan, Jian Peng 0001, Tanmay Gangwani |
Hindsight Foresight Relabeling for Meta-Reinforcement Learning. |
ICLR |
2022 |
DBLP BibTeX RDF |
|
19 | Hiroki Furuta, Yutaka Matsuo, Shixiang Shane Gu |
Generalized Decision Transformer for Offline Hindsight Information Matching. |
ICLR |
2022 |
DBLP BibTeX RDF |
|
19 | Eser Aygün, Ankit Anand, Laurent Orseau, Xavier Glorot, Stephen Marcus McAleer, Vlad Firoiu, Lei M. Zhang, Doina Precup, Shibl Mourad |
Proving Theorems using Incremental Learning and Hindsight Experience Replay. |
ICML |
2022 |
DBLP BibTeX RDF |
|
19 | Carmen Cheh, Nicholas Tay, Binbin Chen 0001 |
From Hindsight to Foresight: Enhancing Design Artifacts for Business Logic Flaw Discovery. |
ACSAC |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Mengxuan Shao, Feng Jiang, Shaohui Liu, Kun Han, Debin Zhao |
Hindsight Balanced Reward Shaping. |
ICONIP (5) |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Mengxuan Shao, Feng Jiang 0001, Shaohui Liu, Kun Han, Debin Zhao |
Learning from Hindsight Demonstrations. |
ICONIP (5) |
2022 |
DBLP DOI BibTeX RDF |
|
19 | Swati Gupta 0001, Vijay Kamble |
Individual Fairness in Hindsight. |
J. Mach. Learn. Res. |
2021 |
DBLP BibTeX RDF |
|
19 | Tung Minh Luu, Chang D. Yoo |
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment. |
IEEE Access |
2021 |
DBLP DOI BibTeX RDF |
|
19 | Paulo E. Rauber, Avinash Ummadisingu, Filipe Mutz, Jürgen Schmidhuber |
Reinforcement Learning in Sparse-Reward Environments With Hindsight Policy Gradients. |
Neural Comput. |
2021 |
DBLP DOI BibTeX RDF |
|
19 | Binyamin Manela, Armin Biess |
Bias-reduced hindsight experience replay with virtual goal prioritization. |
Neurocomputing |
2021 |
DBLP DOI BibTeX RDF |
|
19 | Muhammad AbdurRafae |
HindSight: A Graph-Based Vision Model Architecture For Representing Part-Whole Hierarchies. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
19 | Marios Fournarakis, Markus Nagel |
In-Hindsight Quantization Range Estimation for Quantized Training. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
19 | Jiaming Guo, Rui Zhang 0040, Xishan Zhang, Shaohui Peng, Qi Yi, Zidong Du, Xing Hu 0001, Qi Guo 0001, Yunji Chen |
Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
19 | Charles Packer, Pieter Abbeel, Joseph E. Gonzalez |
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
19 | Jing Li 0009, Yuangang Pan, Ivor W. Tsang |
Taming Overconfident Prediction on Unlabeled Data from Hindsight. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
19 | Sindre Benjamin Remman, Anastasios M. Lekkas |
Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
19 | Hiroki Furuta, Yutaka Matsuo, Shixiang Shane Gu |
Generalized Decision Transformer for Offline Hindsight Information Matching. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
Displaying result #1 - #100 of 311 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ >>] |
|