|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 5 occurrences of 4 keywords
|
|
|
Results
Found 35 publication records. Showing 35 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
163 | Gerhard Weiß 0001 |
A Multiagent Variant of Dyna-Q. |
ICMAS |
2000 |
DBLP DOI BibTeX RDF |
|
160 | Gerhard Weiß 0001 |
An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning. |
ATAL |
2000 |
DBLP DOI BibTeX RDF |
|
96 | Roman Zajdel |
Epoch-Incremental Queue-Dyna Algorithm. |
ICAISC |
2008 |
DBLP DOI BibTeX RDF |
Dyna-Q, prioritized sweeping, reinforcement learning |
46 | Tarek Faycal, Claudio Zito |
Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees. |
CoRR |
2022 |
DBLP BibTeX RDF |
|
23 | Xuecheng Niu, Akinori Ito, Takashi Nose |
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. |
IEEE Access |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Xuecheng Niu, Akinori Ito, Takashi Nose |
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
23 | Silvia Del Giorno, Federico D'Antoni, Vincenzo Piemonte, Mario Merone |
A New Glycemic closed-loop control based on Dyna-Q for Type-1-Diabetes. |
Biomed. Signal Process. Control. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Cheng Fan 0003, Zhaohui Wang, Kaichen Yang |
Energy-efficient underwater acoustic communication based on Dyna-Q with an adaptive action space. |
Phys. Commun. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Yubin Fu, Xiaochuan Ma, Chao Feng, Xingxuan Pei, Pengzhuo Li |
Model-based optimal action selection for Dyna-Q reverberation suppression cognitive sonar. |
EURASIP J. Adv. Signal Process. |
2023 |
DBLP DOI BibTeX RDF |
|
23 | Muleilan Pei, Hao An, Bo Liu 0066, Changhong Wang |
An Improved Dyna-Q Algorithm for Mobile Robot Path Planning in Unknown Dynamic Environment. |
IEEE Trans. Syst. Man Cybern. Syst. |
2022 |
DBLP DOI BibTeX RDF |
|
23 | Marcos Maroto-Gómez, Rodrigo González, Álvaro Castro González, María Malfaz, Miguel Ángel Salichs |
Speeding-Up Action Learning in a Social Robot With Dyna-Q+: A Bioinspired Probabilistic Model Approach. |
IEEE Access |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Rui Zhang 0046, Zhenyu Wang 0001, Mengdan Zheng, Yangyang Zhao, Zhenhua Huang 0002 |
Emotion-sensitive deep dyna-Q learning for task-completion dialogue policy learning. |
Neurocomputing |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Yuan Chai, Xiao-Jun Zeng |
A multi-objective Dyna-Q based routing in wireless mesh network. |
Appl. Soft Comput. |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Guanlin Wu, Wenqi Fang, Ji Wang 0002, Jiang Cao, Weidong Bao, Yang Ping, Xiaomin Zhu 0001, Zheng Wang |
Gaussian Process based Deep Dyna-Q approach for Dialogue Policy Learning. |
ACL/IJCNLP (Findings) |
2021 |
DBLP DOI BibTeX RDF |
|
23 | Fan Wang, Jie Gao 0002, Mushu Li, Lian Zhao |
Autonomous PEV Charging Scheduling Using Dyna-Q Reinforcement Learning. |
IEEE Trans. Veh. Technol. |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Yohei Hayamizu, Saeid Amiri, Kishan Chandan, Shiqi Zhang 0001, Keiki Takadama |
Guided Dyna-Q for Mobile Robot Exploration and Navigation. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
23 | Yangyang Zhao, Zhenyu Wang 0001, Kai Yin, Rui Zhang 0046, Zhenhua Huang 0002, Pei Wang |
Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments. |
AAAI |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Lixin Zou, Long Xia, Pan Du 0001, Zhuo Zhang, Ting Bai, Weidong Liu 0001, Jian-Yun Nie, Dawei Yin |
Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation. |
WSDM |
2020 |
DBLP DOI BibTeX RDF |
|
23 | Zhiwei Li 0003, Yu Lu, Yun Shi, Zengguang Wang, Wenxin Qiao, Yicen Liu |
A Dyna-Q-Based Solution for UAV Networks Against Smart Jamming Attacks. |
Symmetry |
2019 |
DBLP DOI BibTeX RDF |
|
23 | Yuexin Wu, Xiujun Li, Jingjing Liu 0001, Jianfeng Gao 0001, Yiming Yang |
Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning. |
AAAI |
2019 |
DBLP DOI BibTeX RDF |
|
23 | Haobin Shi, Shike Yang, Kao-Shing Hwang, Jialin Chen, Mengkai Hu, Heng-sheng Zhang |
A Sample Aggregation Approach to Experiences Replay of Dyna-Q Learning. |
IEEE Access |
2018 |
DBLP DOI BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen, Iris Hwang |
Model Learning for Multistep Backward Prediction in Dyna-Q Learning. |
IEEE Trans. Syst. Man Cybern. Syst. |
2018 |
DBLP DOI BibTeX RDF |
|
23 | Shang-Yu Su, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Yun-Nung Chen |
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
23 | Yuexin Wu, Xiujun Li, Jingjing Liu 0001, Jianfeng Gao 0001, Yiming Yang |
Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
23 | Emanuele Vitolo, Alberto San Miguel, Javier Civera 0001, Cristian Mahulea |
Performance Evaluation of the Dyna-Q algorithm for Robot Navigation. |
CASE |
2018 |
DBLP DOI BibTeX RDF |
|
23 | Shang-Yu Su, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Yun-Nung Chen |
Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning. |
EMNLP |
2018 |
DBLP DOI BibTeX RDF |
|
23 | Baolin Peng, Xiujun Li, Jianfeng Gao 0001, Jingjing Liu 0001, Kam-Fai Wong |
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning. |
ACL (1) |
2018 |
DBLP BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen |
Pheromone-Based Planning Strategies in Dyna-Q Learning. |
IEEE Trans. Ind. Informatics |
2017 |
DBLP DOI BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen |
Model Learning and Knowledge Sharing for a Multiagent System With Dyna-Q Learning. |
IEEE Trans. Cybern. |
2015 |
DBLP DOI BibTeX RDF |
|
23 | Yi-Jia Tseng, Kao-Shing Hwang, Wei-Cheng Jiang, Tsung-Chuan Huang, Song-Shyong Chen |
An Improved Dyna-Q Algorithm Based in Reverse Model Learning. |
ICSSE |
2015 |
DBLP DOI BibTeX RDF |
|
23 | Shao-Ming Hung, Sidney Nascimento Givigi, Aboelmagd Noureldin |
A Dyna-Q (Lambda) Approach to Flocking with Fixed-Wing UAVs in a Stochastic Environment. |
SMC |
2015 |
DBLP DOI BibTeX RDF |
|
23 | Hoang Huu Viet, Sang Hyeok An, TaeChoong Chung |
Dyna-Q-based vector direction for path planning problem of autonomous mobile robots in unknown environments. |
Adv. Robotics |
2013 |
DBLP DOI BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen |
Adaptive Model Learning Based on Dyna-Q Learning. |
Cybern. Syst. |
2013 |
DBLP DOI BibTeX RDF |
|
23 | Kao-Shing Hwang, Wei-Cheng Jiang, Yu-Jen Chen, Wei-Han Wang |
Model-Based Indirect Learning Method Based on Dyna-Q Architecture. |
SMC |
2013 |
DBLP DOI BibTeX RDF |
|
20 | Arthur Plínio de S. Braga, Aluízio F. R. Araújo |
A topological reinforcement learning agent for navigation. |
Neural Comput. Appl. |
2003 |
DBLP DOI BibTeX RDF |
Latent learning, Neural networks, Navigation, Reinforcement learning, Topological maps |
Displaying result #1 - #35 of 35 (100 per page; Change: )
|
|