TY - GEN
T1 - Distributed dynamic reinforcement of efficient outcomes in multiagent coordination
AU - Chasparis, Georgios C.
AU - Shamma, Jeff S.
N1 - Publisher Copyright:
© 2007 EUCA.
PY - 2007
Y1 - 2007
N2 - We consider the problem of achieving distributed convergence to coordination in a multiagent environment. Each agent is modeled as a learning automaton which repeatedly interacts with an unknown environment, receives a reward, and updates the probabilities of its next action based on its own previous actions and received rewards. In this class of problems, more than one stable equilibrium (i.e., coordination structure) exists. We analyze the dynamic behavior of the distributed system in terms of convergence to an efficient equilibrium, suitably defined. In particular, we analyze the effect of dynamic processing on convergence properties, where agents include the derivative of their own reward into the decision process (i.e., derivative action). We show that derivative action can be used as an equilibrium selection scheme by appropriately adjusting derivative feedback gains.
AB - We consider the problem of achieving distributed convergence to coordination in a multiagent environment. Each agent is modeled as a learning automaton which repeatedly interacts with an unknown environment, receives a reward, and updates the probabilities of its next action based on its own previous actions and received rewards. In this class of problems, more than one stable equilibrium (i.e., coordination structure) exists. We analyze the dynamic behavior of the distributed system in terms of convergence to an efficient equilibrium, suitably defined. In particular, we analyze the effect of dynamic processing on convergence properties, where agents include the derivative of their own reward into the decision process (i.e., derivative action). We show that derivative action can be used as an equilibrium selection scheme by appropriately adjusting derivative feedback gains.
UR - http://www.scopus.com/inward/record.url?scp=84927739360&partnerID=8YFLogxK
U2 - 10.23919/ecc.2007.7069003
DO - 10.23919/ecc.2007.7069003
M3 - Conference contribution
AN - SCOPUS:84927739360
T3 - 2007 European Control Conference, ECC 2007
SP - 2505
EP - 2512
BT - 2007 European Control Conference, ECC 2007
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2007 9th European Control Conference, ECC 2007
Y2 - 2 July 2007 through 5 July 2007
ER -