The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for phrase Dyna-Q (changed automatically) with no syntactic query expansion in all metadata.

Publication years (Num. hits)
2000-2018 (18) 2019-2023 (15) 2024 (2)
Publication types (Num. hits)
article(22) inproceedings(13)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 5 occurrences of 4 keywords

Results
Found 35 publication records. Showing 35 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
163Gerhard Weiß 0001 A Multiagent Variant of Dyna-Q. Search on Bibsonomy ICMAS The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
160Gerhard Weiß 0001 An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning. Search on Bibsonomy ATAL The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
96Roman Zajdel Epoch-Incremental Queue-Dyna Algorithm. Search on Bibsonomy ICAISC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Dyna-Q, prioritized sweeping, reinforcement learning
46Tarek Faycal, Claudio Zito Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees. Search on Bibsonomy CoRR The full citation details ... 2022 DBLP  BibTeX  RDF
23Xuecheng Niu, Akinori Ito, Takashi Nose Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. Search on Bibsonomy IEEE Access The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Xuecheng Niu, Akinori Ito, Takashi Nose Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. Search on Bibsonomy CoRR The full citation details ... 2024 DBLP  DOI  BibTeX  RDF
23Silvia Del Giorno, Federico D'Antoni, Vincenzo Piemonte, Mario Merone A New Glycemic closed-loop control based on Dyna-Q for Type-1-Diabetes. Search on Bibsonomy Biomed. Signal Process. Control. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Cheng Fan 0003, Zhaohui Wang, Kaichen Yang Energy-efficient underwater acoustic communication based on Dyna-Q with an adaptive action space. Search on Bibsonomy Phys. Commun. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Yubin Fu, Xiaochuan Ma, Chao Feng, Xingxuan Pei, Pengzhuo Li Model-based optimal action selection for Dyna-Q reverberation suppression cognitive sonar. Search on Bibsonomy EURASIP J. Adv. Signal Process. The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
23Muleilan Pei, Hao An, Bo Liu 0066, Changhong Wang An Improved Dyna-Q Algorithm for Mobile Robot Path Planning in Unknown Dynamic Environment. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Syst. The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
23Marcos Maroto-Gómez, Rodrigo González, Álvaro Castro González, María Malfaz, Miguel Ángel Salichs Speeding-Up Action Learning in a Social Robot With Dyna-Q+: A Bioinspired Probabilistic Model Approach. Search on Bibsonomy IEEE Access The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Rui Zhang 0046, Zhenyu Wang 0001, Mengdan Zheng, Yangyang Zhao, Zhenhua Huang 0002 Emotion-sensitive deep dyna-Q learning for task-completion dialogue policy learning. Search on Bibsonomy Neurocomputing The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Yuan Chai, Xiao-Jun Zeng A multi-objective Dyna-Q based routing in wireless mesh network. Search on Bibsonomy Appl. Soft Comput. The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Guanlin Wu, Wenqi Fang, Ji Wang 0002, Jiang Cao, Weidong Bao, Yang Ping, Xiaomin Zhu 0001, Zheng Wang Gaussian Process based Deep Dyna-Q approach for Dialogue Policy Learning. Search on Bibsonomy ACL/IJCNLP (Findings) The full citation details ... 2021 DBLP  DOI  BibTeX  RDF
23Fan Wang, Jie Gao 0002, Mushu Li, Lian Zhao Autonomous PEV Charging Scheduling Using Dyna-Q Reinforcement Learning. Search on Bibsonomy IEEE Trans. Veh. Technol. The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Yohei Hayamizu, Saeid Amiri, Kishan Chandan, Shiqi Zhang 0001, Keiki Takadama Guided Dyna-Q for Mobile Robot Exploration and Navigation. Search on Bibsonomy CoRR The full citation details ... 2020 DBLP  BibTeX  RDF
23Yangyang Zhao, Zhenyu Wang 0001, Kai Yin, Rui Zhang 0046, Zhenhua Huang 0002, Pei Wang Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments. Search on Bibsonomy AAAI The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Lixin Zou, Long Xia, Pan Du 0001, Zhuo Zhang, Ting Bai, Weidong Liu 0001, Jian-Yun Nie, Dawei Yin Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation. Search on Bibsonomy WSDM The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
23Zhiwei Li 0003, Yu Lu, Yun Shi, Zengguang Wang, Wenxin Qiao, Yicen Liu A Dyna-Q-Based Solution for UAV Networks Against Smart Jamming Attacks. Search on Bibsonomy Symmetry The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
23Yuexin Wu, Xiujun Li, Jingjing Liu 0001, Jianfeng Gao 0001, Yiming Yang Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning. Search on Bibsonomy AAAI The full citation details ... 2019 DBLP  DOI  BibTeX  RDF
23Haobin Shi, Shike Yang, Kao-Shing Hwang, Jialin Chen, Mengkai Hu, Heng-sheng Zhang A Sample Aggregation Approach to Experiences Replay of Dyna-Q Learning. Search on Bibsonomy IEEE Access The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
23Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen, Iris Hwang Model Learning for Multistep Backward Prediction in Dyna-Q Learning. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Syst. The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
23Shang-Yu Su, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Yun-Nung Chen Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
23Yuexin Wu, Xiujun Li, Jingjing Liu 0001, Jianfeng Gao 0001, Yiming Yang Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning. Search on Bibsonomy CoRR The full citation details ... 2018 DBLP  BibTeX  RDF
23Emanuele Vitolo, Alberto San Miguel, Javier Civera 0001, Cristian Mahulea Performance Evaluation of the Dyna-Q algorithm for Robot Navigation. Search on Bibsonomy CASE The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
23Shang-Yu Su, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Yun-Nung Chen Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning. Search on Bibsonomy EMNLP The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
23Baolin Peng, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Kam-Fai Wong Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning. Search on Bibsonomy ACL (1) The full citation details ... 2018 DBLP  BibTeX  RDF
23Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen Pheromone-Based Planning Strategies in Dyna-Q Learning. Search on Bibsonomy IEEE Trans. Ind. Informatics The full citation details ... 2017 DBLP  DOI  BibTeX  RDF
23Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen Model Learning and Knowledge Sharing for a Multiagent System With Dyna-Q Learning. Search on Bibsonomy IEEE Trans. Cybern. The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
23Yi-Jia Tseng, Kao-Shing Hwang, Wei-Cheng Jiang, Tsung-Chuan Huang, Song-Shyong Chen An Improved Dyna-Q Algorithm Based in Reverse Model Learning. Search on Bibsonomy ICSSE The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
23Shao-Ming Hung, Sidney Nascimento Givigi, Aboelmagd Noureldin A Dyna-Q (Lambda) Approach to Flocking with Fixed-Wing UAVs in a Stochastic Environment. Search on Bibsonomy SMC The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
23Hoang Huu Viet, Sang Hyeok An, TaeChoong Chung Dyna-Q-based vector direction for path planning problem of autonomous mobile robots in unknown environments. Search on Bibsonomy Adv. Robotics The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
23Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen Adaptive Model Learning Based on Dyna-Q Learning. Search on Bibsonomy Cybern. Syst. The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
23Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen, Wei-Han Wang Model-Based Indirect Learning Method Based on Dyna-Q Architecture. Search on Bibsonomy SMC The full citation details ... 2013 DBLP  DOI  BibTeX  RDF
20Arthur Plínio de S. Braga, Aluízio F. R. Araújo A topological reinforcement learning agent for navigation. Search on Bibsonomy Neural Comput. Appl. The full citation details ... 2003 DBLP  DOI  BibTeX  RDF Latent learning, Neural networks, Navigation, Reinforcement learning, Topological maps
Displaying result #1 - #35 of 35 (100 per page; Change: )
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license