Towards a Generic Architecture for Multi-Vehicle Autonomy (2008)
Malcolm Strens, Spiros Kapetanakis, Jeremy Baxter
We describe the motivation for a programme of research “MP012: Software Architecture for Hybrid Decision Making ” that is being undertaken in SEAS DTC Year 2. There are many ways to endow an...
Learning to Coordinate Using Commitment Sequences in Cooperative Multi-Agent Systems (2007)
Spiros Kapetanakis, Daniel Kudenko, Malcolm Strens
We report on an investigation of the learning of coordination in cooperative multi-agent systems. Specifically, we study solutions that are applicable to independent agents i.e. agents that do not...
Between collaboration and competition: An Initial Formalization using Distributed POMDPs (2007)
Praveen Paruchuri, Milind Tambe, Spiros Kapetanakis, Sarit Kraus
This paper presents an initial formalization of teamwork in multi-agent domains. Although analyses of teamwork already exist in the literature of multi-agent systems, almost no work has dealt with...
Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems (2004)
Spiros Kapetanakis, Daniel Kudenko
systems
Learning to coordinate using commitment sequences in cooperative multiagent-systems (2003)
Spiros Kapetanakis, Daniel Kudenko, Malcolm Strens
We report on an investigation of the learning of coordination in cooperative multiagent systems. Specifically, we study solutions that are applicable to independent agents, i.e., agents that do not...
Reinforcement Learning of Coordination in Cooperative Multi-Agent Systems (2002)
Spiros Kapetanakis, Daniel Kudenko
We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multiagent systems. Specifically, we focus on a novel action selection strategy for...
Improving on the Reinforcement Learning of Coordination in Cooperative Multi-Agent Systems (2002)
Spiros Kapetanakis, Daniel Kudenko
We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multiagent systems. These techniques are variants of Q-learning (Watkins, 1989) that...