|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 1997 occurrences of 869 keywords
|
|
|
Results
Found 40620 publication records. Showing 40620 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
75 | James S. Duncan, Thomas Birkhölzer |
Reinforcement of Linear Structure using Parametrized Relaxation Labeling. |
IEEE Trans. Pattern Anal. Mach. Intell. |
1992 |
DBLP DOI BibTeX RDF |
linear structure reinforcement, label strength, information suppression, neighbourhood influence biassing, bar reinforcement, line segment reinforcement, parametrized relaxation labeling, local evidence, orientation labels, sigmoidal thresholding function, noise-suppressed labelings, edge reinforcement, computational complexity, computational complexity, picture processing, convergence, thinning, noisy images |
61 | SeungGwan Lee |
A Cooperation Online Reinforcement Learning Approach in Ant-Q. |
ICONIP (1) |
2006 |
DBLP DOI BibTeX RDF |
|
60 | Arthur Plínio de S. Braga, Aluízio F. R. Araújo |
A topological reinforcement learning agent for navigation. |
Neural Comput. Appl. |
2003 |
DBLP DOI BibTeX RDF |
Latent learning, Neural networks, Navigation, Reinforcement learning, Topological maps |
56 | Valdinei Freire da Silva, Pedro U. Lima, Anna Helena Reali Costa |
Eliciting preferences over observed behaviours based on relative evaluations. |
IROS |
2007 |
DBLP DOI BibTeX RDF |
|
55 | Chin-Teng Lin, Chong-Ping Jou |
Controlling chaos by GA-based reinforcement learning neural network. |
IEEE Trans. Neural Networks |
1999 |
DBLP DOI BibTeX RDF |
|
54 | Drew Mellor |
Model-Based Reinforcement Learning for Alternating Markov Games. |
Australian Conference on Artificial Intelligence |
2003 |
DBLP DOI BibTeX RDF |
machine learning, search, reinforcement learning, planning, Game playing |
53 | J. J. McDowell, Paul L. Soto, Jesse Dallery, Saule Kulubekova |
A computational theory of adaptive behavior based on an evolutionary reinforcement mechanism. |
GECCO |
2006 |
DBLP DOI BibTeX RDF |
Rescorla-Wagner rule, conditioned reinforcement, credit assignment, delay-reduction theory, matching theory, stimulus control, evolutionary algorithms, reinforcement learning, adaptive agents, adaptive behavior |
47 | Omid Aghazadeh, Maziar Ahmad Sharbafi, Abolfazl Toroghi Haghighat |
Implementing Parametric Reinforcement Learning in Robocup Rescue Simulation. |
RoboCup |
2007 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Decision Making, Multi Agent Coordination |
47 | Joost Broekens |
Emotion and Reinforcement: Affective Facial Expressions Facilitate Robot Learning. |
Artifical Intelligence for Human Computing |
2007 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Affect, Human-in-the-Loop |
47 | Kengo Katayama, Takahiro Koshiishi, Hiroyuki Narihisa |
Reinforcement learning agents with primary knowledge designed by analytic hierarchy process. |
SAC |
2005 |
DBLP DOI BibTeX RDF |
profit sharing, pursuit problem, sokoban, reinforcement learning, deadlock, analytic hierarchy process |
46 | Jing Shen, Guochang Gu, Haibo Liu |
Multi-Agent Hierarchical Reinforcement Learning by Integrating Options into MAXQ. |
IMSCCS (1) |
2006 |
DBLP DOI BibTeX RDF |
MAXQ, Options, hierarchical reinforcement learning, multi-agent reinforcement learning |
46 | Andrea Bonarini, Francesco Montrone, Marcello Restelli |
Reinforcement Distribution in Continuous State Action Space Fuzzy Q-Learning: A Novel Approach. |
WILF |
2005 |
DBLP DOI BibTeX RDF |
Fuzzy Q–learning, continuous state-action space, reinforcement distribution, Fuzzy logic, Reinforcement Learning |
42 | Dusko Katic |
New Trends in Robotic Reinforcement Learning: Single and Multi-robot Case. |
Towards Intelligent Engineering and Information Technology |
2009 |
DBLP DOI BibTeX RDF |
|
41 | Hisashi Handa |
EDA-RL: estimation of distribution algorithms for reinforcement learning problems. |
GECCO |
2009 |
DBLP DOI BibTeX RDF |
reinforcement learning problems, estimation of distribution algorithms, conditional random fields |
41 | Huang Zhen, Li Dongyu, Li Chengwei |
Implementation of Reinforcement and Reduction of Traditional Acupuncture and Moxibustion. |
BMEI (1) |
2008 |
DBLP DOI BibTeX RDF |
Reinforcement and Reduction, Laser Acupuncture |
41 | Sachin B. Patkar, H. Narayanan |
Fast On-Line/Off-Line Algorithms for Optimal Reinforcement of a Network and its Connections with Principal Partition. |
J. Comb. Optim. |
2003 |
DBLP DOI BibTeX RDF |
Principal Partition, network, graph, on-line algorithm, reinforcement, polymatroid, strength |
41 | Albert Y. Zomaya, Matthew Clements, Stephan Olariu |
A Framework for Reinforcement-Based Scheduling in Parallel Processor Systems. |
IEEE Trans. Parallel Distributed Syst. |
1998 |
DBLP DOI BibTeX RDF |
scheduling, Neural networks, parallel processing, reinforcement learning, randomization, task allocation |
41 | Alessandro Lazaric |
Transfer in Reinforcement Learning: A Framework and a Survey. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | Ann Nowé, Peter Vrancx, Yann-Michaël De Hauwere |
Game Theory and Multi-agent Reinforcement Learning. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | Hado van Hasselt |
Reinforcement Learning in Continuous State and Action Spaces. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | Jens Kober, Jan Peters 0001 |
Reinforcement Learning in Robotics: A Survey. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | Martijn van Otterlo, Marco A. Wiering |
Reinforcement Learning and Markov Decision Processes. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | Shimon Whiteson |
Evolutionary Computation for Reinforcement Learning. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | István Szita |
Reinforcement Learning in Games. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | Sascha Lange, Thomas Gabel, Martin A. Riedmiller |
Batch Reinforcement Learning. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | Ashvin Shah |
Psychological and Neuroscientific Connections with Reinforcement Learning. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
41 | Nikos Vlassis, Mohammad Ghavamzadeh, Shie Mannor, Pascal Poupart |
Bayesian Reinforcement Learning. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
40 | Marc J. V. Ponsen, Tom Croonenborghs, Karl Tuyls, Jan Ramon, Kurt Driessens |
Learning with whom to communicate using relational reinforcement learning. |
AAMAS (2) |
2009 |
DBLP BibTeX RDF |
relational reinforcement learning, multi-agent systems, reinforcement learning |
40 | Carlos Diuk, Alexander L. Strehl, Michael L. Littman |
A hierarchical approach to efficient reinforcement learning in deterministic domains. |
AAMAS |
2006 |
DBLP DOI BibTeX RDF |
factored representations, reinforcement learning, hierarchical reinforcement learning, sample complexity |
36 | Lashon B. Booker |
Adaptive value function approximations in classifier systems. |
GECCO Workshops |
2005 |
DBLP DOI BibTeX RDF |
hyperplane coding, tile coding, reinforcement learning, learning classifier systems, function approximation |
36 | Gavin Taylor, Ronald Parr |
Kernelized value function approximation for reinforcement learning. |
ICML |
2009 |
DBLP DOI BibTeX RDF |
|
36 | Zahra Abbasi, Mohammad Ali Abbasi |
Reinforcement Distribution in a Team of Cooperative Q-learning Agents. |
SNPD |
2008 |
DBLP DOI BibTeX RDF |
Agent learning, evolution and adaptation, Cooperative distributed problem, solving, and teamwork, Coordination, Multiagent Systems, Multiagent Systems, cooperation, Multiagent learning |
36 | Hisashi Handa |
Evolutionary Computation on Multitask Reinforcement Learning Problems. |
ICNSC |
2007 |
DBLP DOI BibTeX RDF |
|
36 | Jian Fan, Yang Song 0003, Minrui Fei, Qijie Zhao |
A Novel Neural Network Based Reinforcement Learning. |
LSMS (1) |
2007 |
DBLP DOI BibTeX RDF |
|
36 | Guanghua Hu, Yuqin Qiu, Liming Xiang |
Kernel-Based Reinforcement Learning. |
ICIC (1) |
2006 |
DBLP DOI BibTeX RDF |
|
36 | Farhang Sahba, Hamid R. Tizhoosh, Magdy M. A. Salama |
A Reinforcement Learning Framework for Medical Image Segmentation. |
IJCNN |
2006 |
DBLP DOI BibTeX RDF |
|
36 | Chia-Feng Juang |
Combination of online clustering and Q-value based GA for reinforcement fuzzy system design. |
IEEE Trans. Fuzzy Syst. |
2005 |
DBLP DOI BibTeX RDF |
|
36 | Ben-Nian Wang, Yang Gao 0001, Zhaoqian Chen, Jun-Yuan Xie, Shifu Chen |
LMRL: A Multi-Agent Reinforcement Learning Model and Algorithm. |
ICITA (1) |
2005 |
DBLP DOI BibTeX RDF |
|
36 | Norihisa Sato, Masaharu Adachi, Makoto Kotani |
Control of Associative Chaotic Neural Networks Using a Reinforcement Learning. |
ISNN (1) |
2004 |
DBLP DOI BibTeX RDF |
|
36 | Elhadi M. Shakshuki, Karim Rahim |
Multiple Reinforcement Learning Agents in a Static Environment. |
IEA/AIE |
2004 |
DBLP DOI BibTeX RDF |
|
36 | Kurt Driessens |
Relational Reinforcement Learning. |
EASSS |
2001 |
DBLP DOI BibTeX RDF |
|
36 | Bernhard Hengst |
Hierarchical Approaches. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
36 | Frans A. Oliehoek |
Decentralized POMDPs. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
36 | Lihong Li 0001 |
Sample Complexity Bounds of Exploration. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
36 | Todd Hester, Peter Stone |
Learning and Using Models. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
36 | Martijn van Otterlo |
Solving Relational and First-Order Logical Markov Decision Processes: A Survey. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
36 | David Wingate |
Predictively Defined Representations of State. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
36 | Matthijs T. J. Spaan |
Partially Observable Markov Decision Processes. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
36 | Marco A. Wiering, Martijn van Otterlo |
Conclusions, Future Directions and Outlook. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
36 | Lucian Busoniu, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos, Robert Babuska, Bart De Schutter |
Least-Squares Methods for Policy Iteration. |
Reinforcement Learning |
2012 |
DBLP DOI BibTeX RDF |
|
35 | Güray Erus, Faruk Polat |
A layered approach to learning coordination knowledge in multiagent environments. |
Appl. Intell. |
2007 |
DBLP DOI BibTeX RDF |
Reinforcement learning, Multiagent learning, Hierarchical reinforcement learning |
35 | Anthony Knittel |
An activation reinforcement based classifier system for balancing generalisation and specialisation (ARCS). |
GECCO (Companion) |
2010 |
DBLP DOI BibTeX RDF |
accessibility, reinforcement learning, memory, learning classifier systems, concepts |
35 | Jartuwat Rajruangrabin, Dan O. Popa |
Reinforcement learning of interface mapping for interactivity enhancement of robot control in assistive environments. |
PETRA |
2010 |
DBLP DOI BibTeX RDF |
reinforcement learning, human-robot interface |
35 | Mehrtash Tafazzoli Harandi, Majid Nili Ahmadabadi, Babak Nadjar Araabi |
Optimal Local Basis: A Reinforcement Learning Approach for Face Recognition. |
Int. J. Comput. Vis. |
2009 |
DBLP DOI BibTeX RDF |
Feature selection, Face recognition, Reinforcement learning |
35 | Peter Vamplew 0001, Richard Dazeley, Ewan Barker, Andrei V. Kelarev |
Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks. |
Australasian Conference on Artificial Intelligence |
2009 |
DBLP DOI BibTeX RDF |
scalarisation, reinforcement learning, Pareto fronts, multiobjective |
35 | Jian Fan, Minrui Fei, Likang Shao, Feng Huang |
A Novel Multi-robot Coordination Method Based on Reinforcement Learning. |
ICIC (1) |
2008 |
DBLP DOI BibTeX RDF |
behavior weight, role transformation, reinforcement learning, multi-robot |
35 | Furu Wei, Wenjie Li 0002, Qin Lu 0001, Yanxiang He |
Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization. |
SIGIR |
2008 |
DBLP DOI BibTeX RDF |
mutual reinforcement chain, query-sensitive similarity, ranking algorithms, query-oriented summarization |
35 | Mohammad Hossein Fazel Zarandi, Javid Jouzdani, I. Burhan Türksen |
Generalized Reinforcement Learning Fuzzy Control with Vague States. |
Analysis and Design of Intelligent Systems using Soft Computing Techniques |
2007 |
DBLP DOI BibTeX RDF |
reinforcement learning, fuzzy control, fuzzy systems |
35 | Xiaobei Cheng, Jing Shen, Haibo Liu, Guochang Gu |
Multi-robot Cooperation Based on Hierarchical Reinforcement Learning. |
International Conference on Computational Science (3) |
2007 |
DBLP DOI BibTeX RDF |
cooperation, multi-robot, hierarchical reinforcement learning |
35 | Luiz A. Celiberto, Jackson Paul Matsuura, Reinaldo A. C. Bianchi |
Heuristic Q-Learning Soccer Players: A New Reinforcement Learning Approach to RoboCup Simulation. |
EPIA Workshops |
2007 |
DBLP DOI BibTeX RDF |
RoboCup Simulation 2D, Reinforcement Learning, Cognitive Robotics |
35 | Quan Liu, Yang Gao 0001, Zhiming Cui, WangShu Yao, ZhongWen Chen |
An Tableau Automated Theorem Proving Method Using Logical Reinforcement Learning. |
ISICA |
2007 |
DBLP DOI BibTeX RDF |
logical reinforcement learning, tableau automated theorem proving, LOMDP |
35 | Mazda Ahmadi, Matthew E. Taylor, Peter Stone |
IFSA: incremental feature-set augmentation for reinforcement learning tasks. |
AAMAS |
2007 |
DBLP DOI BibTeX RDF |
reinforcement learning |
35 | Bernhard Hengst |
Safe State Abstraction and Reusable Continuing Subtasks in Hierarchical Reinforcement Learning. |
Australian Conference on Artificial Intelligence |
2007 |
DBLP DOI BibTeX RDF |
state abstraction, task hierarchies, decomposition, hierarchical reinforcement learning |
35 | Shimon Whiteson, Peter Stone |
On-line evolutionary computation for reinforcement learning in stochastic domains. |
GECCO |
2006 |
DBLP DOI BibTeX RDF |
neural networks, evolutionary computation, reinforcement learning, on-line learning |
35 | Daniela Pereira Alves, Li Weigang 0001, Bueno Borges de Souza |
Using Meta-Level Control with Reinforcement Learning to Improve the Performance of the Agents. |
FSKD |
2006 |
DBLP DOI BibTeX RDF |
Air Traffic Flow Management, Meta-Level Control, Reinforcement learning |
35 | Fernando Fernández 0001, Daniel Borrajo, Lynne E. Parker |
A Reinforcement Learning Algorithm in Cooperative Multi-Robot Domains. |
J. Intell. Robotic Syst. |
2005 |
DBLP DOI BibTeX RDF |
state space discretizations, collaborative multi-robot domains, reinforcement learning, function approximation |
35 | Atsushi Wada, Keiki Takadama, Katsunori Shimohara |
Learning classifier system equivalent with reinforcement learning with function approximation. |
GECCO Workshops |
2005 |
DBLP DOI BibTeX RDF |
reinforcement learning, learning classifier systems, function approximation, genetic-based machine learning |
35 | Katja Verbeeck, Ann Nowé, Karl Tuyls |
Coordinated exploration in multi-agent reinforcement learning: an application to load-balancing. |
AAMAS |
2005 |
DBLP DOI BibTeX RDF |
load-balancing, reinforcement learning, exploration |
35 | Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna Helena Reali Costa |
Heuristically Accelerated Q-Learning: A New Approach to Speed Up Reinforcement Learning. |
SBIA |
2004 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Cognitive Robotics |
35 | Fang Liu, Jianbo Su |
An Online Feature Learning Algorithm Using HCI-Based Reinforcement Learning. |
ISNN (1) |
2004 |
DBLP DOI BibTeX RDF |
Feature learning, HCI, reinforcement learning |
35 | Paul Juell, Patrick Paulson |
Using Reinforcement Learning for Similarity Assessment in Case-Based Systems. |
IEEE Intell. Syst. |
2003 |
DBLP DOI BibTeX RDF |
neural networks, learning, reinforcement learning, reasoning, case-based reasoning |
35 | Pier Luca Lanzi |
Learning classifier systems from a reinforcement learning perspective. |
Soft Comput. |
2002 |
DBLP DOI BibTeX RDF |
Genetic algorithms, Reinforcement learning, XCS, Q-learning |
35 | Nobuo Suematsu, Akira Hayashi |
A multiagent reinforcement learning algorithm using extended optimal response. |
AAMAS |
2002 |
DBLP DOI BibTeX RDF |
Markov games, reinforcement learning, Q-learning, stochastic games |
35 | Carlos H. C. Ribeiro |
Embedding a Priori Knowledge in Reinforcement Learning. |
J. Intell. Robotic Syst. |
1998 |
DBLP DOI BibTeX RDF |
Q-learning algorithm, experience generalisation, reinforcement learning |
35 | Cheng-Peng Kuan, Kuu-young Young |
Reinforcement Learning and Robust Control for Robot Compliance Tasks. |
J. Intell. Robotic Syst. |
1998 |
DBLP DOI BibTeX RDF |
compliance tasks, reinforcement learning, robust control |
34 | Dongbing Gu, Erfu Yang |
Fuzzy Policy Reinforcement Learning in Cooperative Multi-robot Systems. |
J. Intell. Robotic Syst. |
2007 |
DBLP DOI BibTeX RDF |
flocking behavior, policy gradient reinforcement learning, cooperative control, multi-agent reinforcement learning |
31 | Markus M. Geipel, Michael Beetz |
Learning to Shoot Goals Analysing the Learning Process and the Resulting Policies. |
RoboCup |
2006 |
DBLP DOI BibTeX RDF |
|
31 | Hwan-Chun Myung, Z. Zenn Bien |
Similarity between Fuzzy Multi-objective Control and Eligibility. |
AFSS |
2002 |
DBLP DOI BibTeX RDF |
|
30 | Haifeng Chen, Guofei Jiang, Hui Zhang 0002, Kenji Yoshihira |
Boosting the performance of computing systems through adaptive configuration tuning. |
SAC |
2009 |
DBLP DOI BibTeX RDF |
configuration tuning, reinforcement learning, system management |
30 | Olivier Pietquin |
Machine Learning for Spoken Dialogue Management: An Experiment with Speech-Based Database Querying. |
AIMSA |
2006 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Spoken Dialogue Systems, Dialogue Management |
30 | Ji Wu 0006, Chaoqun Ye, Shiyao Jin |
Opponent Learning for Multi-agent System Simulation. |
RSKT |
2006 |
DBLP DOI BibTeX RDF |
reinforcement learning, Markov decision processes, multi-agent simulation, Opponent modeling |
30 | Aram Galstyan, Karl Czajkowski, Kristina Lerman |
Resource Allocation in the Grid with Learning Agents. |
J. Grid Comput. |
2005 |
DBLP DOI BibTeX RDF |
multi-agent system, Grid, resource allocation, reinforcement learning |
30 | Xin-Jing Wang, Wei-Ying Ma, Lei Zhang 0001, Xing Li 0001 |
Iteratively clustering web images based on link and attribute reinforcements. |
ACM Multimedia |
2005 |
DBLP DOI BibTeX RDF |
iterative reinforcement, image clustering, link mining |
30 | Atsushi Wada, Keiki Takadama, Katsunori Shimohara |
Counter example for Q-bucket-brigade under prediction problem. |
GECCO Workshops |
2005 |
DBLP DOI BibTeX RDF |
reinforcement learning, convergence, learning classifier systems, function approximation, genetic-based machine learning |
30 | Reda Alhajj, Mehmet Kaya |
Multiagent Association Rules Mining in Cooperative Learning Systems. |
ADMA |
2005 |
DBLP DOI BibTeX RDF |
pursuit domain, data mining, association rules, reinforcement learning, multiagent systems |
30 | Alexandros M. Grigoriadis, Georgios Paliouras |
Focused Crawling Using Temporal Difference-Learning. |
SETN |
2004 |
DBLP DOI BibTeX RDF |
Machine learning, reinforcement learning, web mining, focused crawling |
30 | Kagan Tumer, Adrian K. Agogino, David H. Wolpert |
Learning sequences of actions in collectives of autonomous agents. |
AAMAS |
2002 |
DBLP DOI BibTeX RDF |
MAS, reinforcement learning, Q-learning |
30 | Reinaldo A. C. Bianchi, Raquel Ros, Ramón López de Mántaras |
Improving Reinforcement Learning by Using Case Based Heuristics. |
ICCBR |
2009 |
DBLP DOI BibTeX RDF |
|
30 | Fangju Wang, Kyle Swegles |
An Online Algorithm for Applying Reinforcement Learning to Handle Ambiguity in Spoken Dialogues. |
TAMC |
2009 |
DBLP DOI BibTeX RDF |
|
30 | Patrick D. Roberts, Roberto A. Santiago, Gerardo Lafferriere |
An implementation of reinforcement learning based on spike timing dependent plasticity. |
Biol. Cybern. |
2008 |
DBLP DOI BibTeX RDF |
Learning, Computational neuroscience, Synaptic plasticity, Spiking neuron model |
30 | Dimitri Ognibene, Christian Balkenius, Gianluca Baldassarre |
A Reinforcement-Learning Model of Top-Down Attention Based on a Potential-Action Map. |
The Challenge of Anticipation |
2008 |
DBLP DOI BibTeX RDF |
|
30 | Lasheng Yu, Alonso Marin, Fei Hong, Jian Lin |
Studies on Hierarchical Reinforcement Learning in Multi-Agent Environment. |
ICNSC |
2008 |
DBLP DOI BibTeX RDF |
|
30 | Yuxi Li, Dale Schuurmans |
Policy Iteration for Learning an Exercise Policy for American Options. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
30 | Takeshi Shibuya, Shingo Shimada, Tomoki Hamagami |
Experimental study of the eligibility traces in complex valued reinforcement learning. |
SMC |
2007 |
DBLP DOI BibTeX RDF |
|
30 | Toshihiko Watanabe, Yoshiya Takahashi |
Hierarchical reinforcement learning using a modular fuzzy model for multi-agent problem. |
SMC |
2007 |
DBLP DOI BibTeX RDF |
|
30 | Terrence S. T. Mak, Kai-Pui Lam, H. S. Ng, Guy Rachmuth, Chi-Sang Poon |
A Current-Mode Analog Circuit for Reinforcement Learning Problems. |
ISCAS |
2007 |
DBLP DOI BibTeX RDF |
|
30 | Erfu Yang, Dongbing Gu |
A Multiagent Fuzzy Policy Reinforcement Learning Algorithm with Application to Leader-Follower Robotic Systems. |
IROS |
2006 |
DBLP DOI BibTeX RDF |
|
30 | J. J. McDowell, Zahra Ansari |
The Quantitative Law of Effect is a Robust Emergent Property of an Evolutionary Algorithm for Reinforcement Learning. |
ECAL |
2005 |
DBLP DOI BibTeX RDF |
|
30 | SeungGwan Lee, TaeChoong Chung |
A Reinforcement Learning Algorithm Using Temporal Difference Error in Ant Model. |
IWANN |
2005 |
DBLP DOI BibTeX RDF |
|
30 | Tom Croonenborghs, Karl Tuyls, Jan Ramon, Maurice Bruynooghe |
Multi-agent Relational Reinforcement Learning. |
LAMAS |
2005 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 40620 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|