Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
96 | Ranjit Nair, Milind Tambe |
Coordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach. |
PROMAS |
2004 |
DBLP DOI BibTeX RDF |
|
77 | Arsalan Farrokh, Vikram Krishnamurthy |
Optimal Threshold Policies for Operation of a Dedicated-Platform with Imperfect State Information - A POMDP Framework. |
ECSQARU |
2005 |
DBLP DOI BibTeX RDF |
Partially Observable Markov Decision Process (POMDP), optimal threshold policies, Bellman equation, two-state Markov chain, overlook probability, dynamic programming, Hidden Markov Model (HMM), sufficient statistics, optimal search |
70 | Xin Li 0033, William K. Cheung 0001, Jiming Liu 0001 |
On Compressibility and Acceleration of Orthogonal NMF for POMDP Compression. |
ACML |
2009 |
DBLP DOI BibTeX RDF |
|
68 | Pradeep Varakantham, Ranjit Nair, Milind Tambe, Makoto Yokoo |
Winning back the CUP for distributed POMDPs: planning over continuous belief spaces. |
AAMAS |
2006 |
DBLP DOI BibTeX RDF |
continuous initial beliefs, distributed POMDP, partially observable Markov decision process (POMDP), multi-agent systems |
66 | Martijn C. Schut, Michael J. Wooldridge, Simon Parsons |
On Partially Observable MDPs and BDI Models. |
Foundations and Applications of Multi-Agent Systems |
2002 |
DBLP DOI BibTeX RDF |
|
64 | Baris Eker, H. Levent Akin |
Using evolution strategies to solve DEC-POMDP problems. |
Soft Comput. |
2010 |
DBLP DOI BibTeX RDF |
DEC-POMDP, Multi-robot decision making, Evolution control, Evolution strategies |
64 | Qi Feng, Xuezhong Zhou, Houkuan Huang, Xiaoping Zhang |
An Uncertainty-Based Belief Selection Method for POMDP Value Iteration. |
ECSQARU |
2009 |
DBLP DOI BibTeX RDF |
value iteration, point-based algorithm, belief selection, uncertainty, POMDP |
60 | Sven R. Schmidt-Rohr, Steffen Knoop, Martin Lösch, Rüdiger Dillmann |
Reasoning for a multi-modal service robot considering uncertainty in human-robot interaction. |
HRI |
2008 |
DBLP DOI BibTeX RDF |
robot decision making, pomdp, HRI |
60 | Jeongsoo Han |
Network-Adaptive QoS Routing Using Local Information. |
APNOMS |
2006 |
DBLP DOI BibTeX RDF |
Localized Adaptive QoS Routing, Exploration Bonus, Certainty Equivalency Approximation, Edge-disjoint multi-path, Reinforcement Learning, POMDP |
60 | Shin Ishii, Hajime Fujita 0001, Masaoki Mitsutake, Tatsuya Yamazaki, Jun Matsuda, Yoichiro Matsuno |
A Reinforcement Learning Scheme for a Partially-Observable Multi-Agent Game. |
Mach. Learn. |
2005 |
DBLP DOI BibTeX RDF |
multi-agent system, reinforcement learning, model-based, POMDP, card game |
60 | Pradeep Varakantham, Rajiv T. Maheswaran, Milind Tambe |
Exploiting belief bounds: practical POMDPs for personal assistant agents. |
AAMAS |
2005 |
DBLP DOI BibTeX RDF |
meeting rescheduling, partially observable markov decision process (POMDP), task allocation |
60 | Milind Tambe, Emma Bowring, Hyuckchul Jung, Gal A. Kaminka, Rajiv T. Maheswaran, Janusz Marecki, Pragnesh Jay Modi, Ranjit Nair, Stephen Okamoto, Jonathan P. Pearce, Praveen Paruchuri, David V. Pynadath, Paul Scerri, Nathan Schurr, Pradeep Varakantham |
Conflicts in teamwork: hybrids to the rescue. |
AAMAS |
2005 |
DBLP DOI BibTeX RDF |
game theory, BDI, POMDP, DCOP |
56 | David Hsu, Wee Sun Lee, Nan Rong |
A point-based POMDP planner for target tracking. |
ICRA |
2008 |
DBLP DOI BibTeX RDF |
|
56 | Xin Li 0033, William K. Cheung 0001, Jiming Liu 0001 |
Integrating Value-Directed Compression and Belief Space Analysis for POMDP Decomposition. |
IAT |
2006 |
DBLP DOI BibTeX RDF |
|
56 | Xin Li 0033, William K. Cheung 0001, Jiming Liu 0001 |
Decomposing Large-Scale POMDP Via Belief State Analysis. |
IAT |
2005 |
DBLP DOI BibTeX RDF |
|
56 | Iadine Chades, Bruno Scherrer, François Charpillet |
A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem. |
SAC |
2002 |
DBLP DOI BibTeX RDF |
decision theoretic agents, multiagent systems |
53 | Jesse Hoey, James J. Little |
Decision Theoretic Modeling of Human Facial Displays. |
ECCV (3) |
2004 |
DBLP DOI BibTeX RDF |
|
51 | Jaeyoung Park, Kee-Eung Kim, Sungho Jo |
A POMDP approach to P300-based brain-computer interfaces. |
IUI |
2010 |
DBLP DOI BibTeX RDF |
brain-computer interface (bci), partially observable markov decision process (pomdp), P300 |
51 | Manuel Ocaña, Luis Miguel Bergasa, Miguel Ángel Sotelo, Ramón Flores, Elena López Guillén, Rafael Barea |
Comparison of WiFi Map Construction Methods for WiFi POMDP Navigation Systems. |
EUROCAST |
2007 |
DBLP DOI BibTeX RDF |
Navigation, Localization, WiFi, POMDP |
47 | Hao Zhao, Tao Luo, Guangxin Yue, Xin He |
Myopic sensing for opportunistic spectrum access using channel correlation. |
IWCMC |
2009 |
DBLP DOI BibTeX RDF |
channel correlation, myopic sensing, POMDP, opportunistic spectrum access |
47 | Maayan Roth, Reid G. Simmons, Manuela M. Veloso |
Reasoning about joint beliefs for execution-time communication decisions. |
AAMAS |
2005 |
DBLP DOI BibTeX RDF |
communication, POMDP, distributed execution, robot teams |
43 | Sylvie C. W. Ong, David Hsu, Wee Sun Lee, Hanna Kurniawati |
Partially Observable Markov Decision Process (POMDP) Technologies for Sign Language Based Human-Computer Interaction. |
HCI (7) |
2009 |
DBLP DOI BibTeX RDF |
human-computer interaction, Sign language recognition, planning under uncertainty |
43 | Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony |
Prioritizing Point-Based POMDP Solvers. |
IEEE Trans. Syst. Man Cybern. Part B |
2008 |
DBLP DOI BibTeX RDF |
|
43 | Shaohui Ma, Hao Zhang |
A Dynamic CRM Model Based on POMDP. |
FSKD (2) |
2008 |
DBLP DOI BibTeX RDF |
|
43 | Sven R. Schmidt-Rohr, Rainer Jäkel, Martin Lösch, Rüdiger Dillmann |
Compiling POMDP Models for a Multimodal Service Robot from Background Knowledge. |
EUROS |
2008 |
DBLP DOI BibTeX RDF |
|
43 | Qing Zhao 0001, Lang Tong, Ananthram Swami, Yunxia Chen |
Decentralized cognitive MAC for opportunistic spectrum access in ad hoc networks: A POMDP framework. |
IEEE J. Sel. Areas Commun. |
2007 |
DBLP DOI BibTeX RDF |
|
43 | Leigh A. Johnston, Vikram Krishnamurthy |
Opportunistic file transfer over a fading channel: A POMDP search theory formulation with optimal threshold policies. |
IEEE Trans. Wirel. Commun. |
2006 |
DBLP DOI BibTeX RDF |
|
43 | Ashok K. Karmokar, Dejan V. Djonin, Vijay K. Bhargava |
POMDP-Based Coding Rate Adaptation for Type-I Hybrid ARQ Systems over Fading Channels with Memory. |
IEEE Trans. Wirel. Commun. |
2006 |
DBLP DOI BibTeX RDF |
|
43 | Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony |
Prioritizing Point-Based POMDP Solvers. |
ECML |
2006 |
DBLP DOI BibTeX RDF |
|
43 | Sébastien Paquet, Ludovic Tobin, Brahim Chaib-draa |
An Online POMDP Algorithm Used by the PoliceForce Agents in the RoboCupRescue Simulation. |
RoboCup |
2005 |
DBLP DOI BibTeX RDF |
|
43 | Matthijs T. J. Spaan, Nikos Vlassis |
A Point-based POMDP Algorithm for Robot Planning. |
ICRA |
2004 |
DBLP DOI BibTeX RDF |
|
43 | Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasiewicz |
Formalizing Multi-Agent POMDP's in the context of network routing. |
HICSS |
2003 |
DBLP DOI BibTeX RDF |
|
41 | Pradeep Varakantham, Janusz Marecki, Yuichi Yabu, Milind Tambe, Makoto Yokoo |
Letting loose a SPIDER on a network of POMDPs: generating quality guaranteed policies. |
AAMAS |
2007 |
DBLP DOI BibTeX RDF |
distributed POMDP, globally optimal solution, partially observable markov decision process (POMDP), multi-agent systems |
41 | Praveen Paruchuri, Milind Tambe, Fernando Ordóñez, Sarit Kraus |
Security in multiagent systems by policy randomization. |
AAMAS |
2006 |
DBLP DOI BibTeX RDF |
decentralized POMDP, safety and security in agent systems, teamwork, POMDP, MDP |
40 | Ying Tan, Qinru Qiu |
A Framework of Stochastic Power Management Using Hidden Markov Model. |
DATE |
2008 |
DBLP DOI BibTeX RDF |
|
40 | Blaise Thomson, Jost Schatzmann, Steve J. Young |
Bayesian update of dialogue state for robust dialogue systems. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
40 | Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith |
Information state for Markov decision processes with network delays. |
CDC |
2008 |
DBLP DOI BibTeX RDF |
|
40 | Finale Doshi, Joelle Pineau, Nicholas Roy |
Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs. |
ICML |
2008 |
DBLP DOI BibTeX RDF |
|
40 | Jason D. Williams, S. Young |
Scaling POMDPs for Spoken Dialog Management. |
IEEE Trans. Speech Audio Process. |
2007 |
DBLP DOI BibTeX RDF |
|
40 | Mohammad Rezaeian |
Sensor Scheduling for Optimal Observability Using Estimation Entropy. |
PerCom Workshops |
2007 |
DBLP DOI BibTeX RDF |
|
40 | Masoumeh T. Izadi, Doina Precup |
Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes. |
ECML |
2005 |
DBLP DOI BibTeX RDF |
|
40 | Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony |
Model-Based Online Learning of POMDPs. |
ECML |
2005 |
DBLP DOI BibTeX RDF |
|
40 | Robin Jaulmes, Joelle Pineau, Doina Precup |
Active Learning in Partially Observable Markov Decision Processes. |
ECML |
2005 |
DBLP DOI BibTeX RDF |
|
37 | Sébastien Paquet, Ludovic Tobin, Brahim Chaib-draa |
An online POMDP algorithm for complex multiagent environments. |
AAMAS |
2005 |
DBLP DOI BibTeX RDF |
online search, POMDP |
34 | Augusto Cesar Espíndola Baffa, Angelo E. M. Ciarlini |
Modeling POMDPs for generating and simulating stock investment policies. |
SAC |
2010 |
DBLP DOI BibTeX RDF |
simulation, stock market, POMDP, technical analysis |
34 | Abir-Beatrice Karami, Laurent Jeanpierre, Abdel-Illah Mouaddib |
Human-robot collaboration for a shared mission. |
HRI |
2010 |
DBLP DOI BibTeX RDF |
human-robot collaboration, pomdp |
34 | Faustino J. Gomez, Jürgen Schmidhuber |
Co-evolving recurrent neurons learn deep memory POMDPs. |
GECCO |
2005 |
DBLP DOI BibTeX RDF |
recurrent neural networks, coevolution, POMDP |
34 | Juan Jesús Torre Tresols, Caroline P. C. Chanel, Frédéric Dehais |
POMDP-BCI: A Benchmark of (Re)Active BCI Using POMDP to Issue Commands. |
IEEE Trans. Biomed. Eng. |
2024 |
DBLP DOI BibTeX RDF |
|
34 | Aditi Ramachandran, Sarah Strohkorb Sebo, Brian Scassellati |
Personalized Robot Tutoring Using the Assistive Tutor POMDP (AT-POMDP). |
AAAI |
2019 |
DBLP DOI BibTeX RDF |
|
34 | Joni Pajarinen, Jaakko Peltonen |
Periodic Finite State Controllers for Efficient POMDP and DEC-POMDP Planning. |
NIPS |
2011 |
DBLP BibTeX RDF |
|
34 | Gabriel Corona |
Utilisation de croyances heuristiques pour la planification multi-agent dans le cadre des Dec-POMDP. (Using heuristic belief points for Dec-POMDP planning). |
|
2011 |
RDF |
|
34 | Eddy C. Borera, Larry D. Pyeatt, Arisoa S. Randrianasolo, Mahdi Naser-Moghadasi |
POMDP Filter: Pruning POMDP Value Functions with the Kaczmarz Iterative Method. |
MICAI (1) |
2010 |
DBLP DOI BibTeX RDF |
|
30 | Tarek Taha, Jaime Valls Miró, Gamini Dissanayake |
POMDP-based long-term user intention prediction for wheelchair navigation. |
ICRA |
2008 |
DBLP DOI BibTeX RDF |
|
30 | Tapana Gupta, Pradeep Varakantham, Timothy W. Rauenbusch, Milind Tambe |
Demonstration of teamwork in uncertain domains using hybrid BDI-POMDP systems. |
AAMAS |
2007 |
DBLP DOI BibTeX RDF |
multi-agent systems, teamwork, hybrids, personal assistants |
30 | Manuel Ocaña, Luis Miguel Bergasa, Miguel Ángel Sotelo, Ramón Flores, Elena López Guillén, Rafael Barea |
Training Method Improvements of a WiFi Navigation System Based on POMDP. |
IROS |
2006 |
DBLP DOI BibTeX RDF |
|
30 | Joelle Pineau, Geoffrey J. Gordon |
POMDP Planning for Robust Robot Control. |
ISRR |
2005 |
DBLP DOI BibTeX RDF |
|
30 | Hyuckchul Jung, Milind Tambe |
Performance models for large scale multiagent systems: using distributed POMDP building blocks. |
AAMAS |
2003 |
DBLP DOI BibTeX RDF |
|
30 | Hyuckchul Jung, Ranjit Nair, Milind Tambe, Stacy Marsella |
Computational Models for Multiagent Coordination Analysis: Extending Distributed POMDP Models. |
FAABS |
2002 |
DBLP DOI BibTeX RDF |
|
26 | Richard Seymour, Gilbert L. Peterson |
A Trust-Based Multiagent System. |
CSE (3) |
2009 |
DBLP DOI BibTeX RDF |
|
26 | Akshat Kumar, Shlomo Zilberstein |
Constraint-based dynamic programming for decentralized POMDPs with structured interactions. |
AAMAS (1) |
2009 |
DBLP BibTeX RDF |
DEC-POMDPs, multiagent planning |
26 | Jilles Steeve Dibangoye, Abdel-Illah Mouaddib, Brahim Chaib-draa |
Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs. |
AAMAS (1) |
2009 |
DBLP BibTeX RDF |
decentralized pomdps, point-based solver, artificial intelligence, branch-and-bound, planning under uncertainty |
26 | Yunxia Chen, Qing Zhao 0001, Ananthram Swami |
Joint Design and Separation Principle for Opportunistic Spectrum Access in the Presence of Sensing Errors. |
IEEE Trans. Inf. Theory |
2008 |
DBLP DOI BibTeX RDF |
|
26 | Nicholas Armstrong-Crews, Manuela M. Veloso |
An approximate algorithm for solving oracular POMDPs. |
ICRA |
2008 |
DBLP DOI BibTeX RDF |
|
26 | Abdeslam Boularias, Masoumeh T. Izadi, Brahim Chaib-draa |
Prediction-Directed Compression of POMDPs. |
ICMLA |
2008 |
DBLP DOI BibTeX RDF |
|
26 | Mei Si, Stacy C. Marsella, David V. Pynadath |
Modeling Appraisal in Theory of Mind Reasoning. |
IVA |
2008 |
DBLP DOI BibTeX RDF |
|
26 | Alexander W. Min, Kang G. Shin |
An Optimal Transmission Strategy for IEEE 802.11 Wireless LANs: Stochastic Control Approach. |
SECON |
2008 |
DBLP DOI BibTeX RDF |
|
26 | François Laviolette, Ludovic Tobin |
A Stochastic Point-Based Algorithm for POMDPs. |
Canadian AI |
2008 |
DBLP DOI BibTeX RDF |
|
26 | Lei Zheng, Siu-Yeung Cho, Chai Quek |
A memory-based reinforcement learning algorithm for partially observable Markovian decision processes. |
IJCNN |
2008 |
DBLP DOI BibTeX RDF |
|
26 | Jesse Hoey, James J. Little |
Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes. |
IEEE Trans. Pattern Anal. Mach. Intell. |
2007 |
DBLP DOI BibTeX RDF |
machine learning, dynamic programming, motion, video analysis, statistical models, clustering algorithms, control theory, Face and gesture recognition, parameter learning |
26 | Shihao Ji, Ronald Parr, Lawrence Carin |
Nonmyopic Multiaspect Sensing With Partially Observable Markov Decision Processes. |
IEEE Trans. Signal Process. |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Vikram Krishnamurthy, Dejan V. Djonin |
Structured Threshold Policies for Dynamic Sensor Scheduling - A Partially Observed Markov Decision Process Approach. |
IEEE Trans. Signal Process. |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Robin Jaulmes, Joelle Pineau, Doina Precup |
A formal framework for robot learning and control under model uncertainty. |
ICRA |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Pantelis Elinas, James J. Little |
Decision theoretic task coordination for a visually-guided interactive mobile robot. |
IROS |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Sudipto Guha, Kamesh Munagala |
Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards. |
FOCS |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Dejan V. Djonin, Qing Zhao, Vikram Krishnamurthy |
Optimality and Complexity of Opportunistic Spectrum Access: A Truncated Markov Decision Process Formulation. |
ICC |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Michael Groble, Will Thompson |
Supporting Multiple User Types with a Multimodal Dialog Agent. |
Web Intelligence/IAT Workshops |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Daan Wierstra, Jürgen Schmidhuber |
Policy Gradient Critics. |
ECML |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Xin Li 0033, William Kwok-Wai Cheung, Jiming Liu 0001, Zhili Wu |
A novel orthogonal NMF-based belief compression for POMDPs. |
ICML |
2007 |
DBLP DOI BibTeX RDF |
|
26 | Deepak Verma, Rajesh P. N. Rao |
Planning and Acting in Uncertain Environments using Probabilistic Inference. |
IROS |
2006 |
DBLP DOI BibTeX RDF |
|
26 | José Antonio Montero, Luis Enrique Sucar |
A Decision-Theoretic Video Conference System Based on Gesture Recognition. |
FGR |
2006 |
DBLP DOI BibTeX RDF |
|
26 | Christopher Amato, Daniel S. Bernstein, Shlomo Zilberstein |
Solving POMDPs using quadratically constrained linear programs. |
AAMAS |
2006 |
DBLP DOI BibTeX RDF |
optimization, POMDPs, planning under uncertainty |
26 | Pradeep Varakantham, Rajiv T. Maheswaran, Milind Tambe |
Implementation Techniques for Solving POMDPs in Personal Assistant Agents. |
PROMAS |
2005 |
DBLP DOI BibTeX RDF |
|
26 | Georgios Theocharous, Kevin P. Murphy, Leslie Pack Kaelbling |
Representing Hierarchical POMDPs as DBNs for Multi-scale Robot Localization. |
ICRA |
2004 |
DBLP DOI BibTeX RDF |
|
26 | Kaustav Das, Andrew W. Moore 0001, Jeff G. Schneider |
Belief state approaches to signaling alarms in surveillance systems. |
KDD |
2004 |
DBLP DOI BibTeX RDF |
scan statistic, signaling alarms, probabilistic model, surveillance systems |
26 | Arthur Plínio de S. Braga, Aluízio F. R. Araújo, Jeremy L. Wyatt |
Incremental topological reinforcement learning agent in non-structured environments. |
SMC (6) |
2004 |
DBLP DOI BibTeX RDF |
|
26 | Hiroshi Osada, Satoshi Fujita |
CHQ: A Multi-Agent Reinforcement Learning Scheme CHQ: A Multi-Agent Reinforcement Learning Scheme. |
IAT |
2004 |
DBLP DOI BibTeX RDF |
|
26 | Martijn C. Schut, Michael J. Wooldridge, Simon Parsons |
Reasoning about Intentions in Uncertain Domains. |
ECSQARU |
2001 |
DBLP DOI BibTeX RDF |
|
21 | Jun Na, Bin Zhang 0001, Yan Gao 0001, Li Zhang, Zhiliang Zhu 0001 |
Long-Term Benefit Driven Adaptation in Service-Based Software Systems. |
ICWS |
2011 |
DBLP DOI BibTeX RDF |
service-based software system, long-term benefit, sequential decision, adaptation, POMDP |
21 | Pengchao Xu, Shen Gu, Lin Gao 0001, Hui Yu 0002, Xinbing Wang, Xiaoying Gan, Shenglong Dong |
Myopic sensing for multiple SUs in multichannel opportunistic access. |
IWCMC |
2010 |
DBLP DOI BibTeX RDF |
multichannel MAC, multiple SUs, myopic policy, opportunistic access, cognitive radio, POMDP |
21 | Yong (Yates) Lin, Kyungseo Park, Fillia Makedon |
From dialogue management to pervasive interaction based assistive technology. |
PETRA |
2010 |
DBLP DOI BibTeX RDF |
pervasive interaction, multimodal, POMDP, assistive environment |
21 | Scott Blunsden, Brandi Richards, Jennifer Boger, Alex Mihailidis, Tom Bartindale, Daniel Jackson 0002, Patrick Olivier, Jesse Hoey |
Design and prototype of a device to engage cognitively disabled older adults in visual artwork. |
PETRA |
2009 |
DBLP DOI BibTeX RDF |
art therapy, computer vision, face detection, touch screen, POMDP, partially observable Markov decision process, MDP |
21 | Michael R. James 0001, Satinder Singh 0001 |
SarsaLandmark: an algorithm for learning in POMDPs with landmarks. |
AAMAS (1) |
2009 |
DBLP BibTeX RDF |
reinforcement learning, landmark, POMDP, partial observability |
21 | Joaquín Lopez Fernández, Rafael Sanz, Reid G. Simmons, Amador R. Diéguez |
Heuristic anytime approaches to stochastic decision processes. |
J. Heuristics |
2006 |
DBLP DOI BibTeX RDF |
Planning, Heuristic algorithms, POMDP, Partially observable Markov decision process, Decision Systems |
21 | David J. Montana, Eric Van Wyk, Marshall Brinn, Joshua Montana, Stephen Milligan |
Genomic computing networks learn complex POMDPs. |
GECCO |
2006 |
DBLP DOI BibTeX RDF |
POMDP, evolutionary neural networks |
21 | Jiaying Shen, Raphen Becker, Victor R. Lesser |
Agent interaction in distributed POMDPs and its implications on complexity. |
AAMAS |
2006 |
DBLP DOI BibTeX RDF |
distributed POMDP, interaction, complexity |
17 | Tarik Selimovic, Marijana Peti, Stjepan Bogdan |
Multi-Agent Active Perception Based on Reinforcement Learning and POMDP. |
IEEE Access |
2024 |
DBLP DOI BibTeX RDF |
|
17 | Mohammed Chelouati, Abderraouf Boussif, Julie Beugin, El-Miloudi El-Koursi |
A Risk-Based Decision-Making Process for Autonomous Trains Using POMDP: Case of the Anti-Collision Function. |
IEEE Access |
2024 |
DBLP DOI BibTeX RDF |
|
17 | Sushmita Bhattacharya, Siva Kailas, Sahil Badyal, Stephanie Gil, Dimitri P. Bertsekas |
Multiagent Reinforcement Learning: Rollout and Policy Iteration for POMDP With Application to Multirobot Problems. |
IEEE Trans. Robotics |
2024 |
DBLP DOI BibTeX RDF |
|
17 | Benjamin Kraske, Zakariya Laouar, Zachary Sunberg |
Leveraging Counterfactual Paths for Contrastive Explanations of POMDP Policies. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
17 | Paula Stocco, Suhas Chundi, Arec L. Jamgochian, Mykel J. Kochenderfer |
Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|