Powell, W. B., A. George, J. Berger, A. Boukhtouta, “An adaptive-learning Framework for Semi-cooperative Multi-agent coordination,” SSCI 2011 ADPRL – 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, Paris, April, 2011.
Complex problems involving multiple agents exhibit varying degrees of cooperation. The levels of cooperation might reflect both differences in information as well as differences in goals. In this research, we develop a general mathematical model for distributed, semi-cooperative planning and suggest a solution strategy which involves decomposing the system into subproblems, each of which is specified at a certain period in time and controlled by an agent. The agents communicate marginal values of resources to each other, possibly with distortion. We design experiments to demonstrate the benefits of communication between the agents and show that, with communication, the solution quality approaches that of the ideal situation where the entire problem is controlled by a single agent.
Topaloglu, H. and W.B. Powell, “A Distributed Decision-Making Structure for Dynamic Resource Allocation with Nonlinear Functional Approximations,” Operations Research, Vol. 53, No. 2, pp. 281-297 (2005) (c) Informs
Many complex problems in transportation are characterized by multiple decision-makers who control their own decisions with their own information. Coordinating these decision makers is a well-recognized challenge. This paper proposes to use a nonlinear approximation architecture, where the impact of decisions to move resources from one part of the network to another is captured using a nonlinear function. This introduces unique complications that do not arise with more traditional linear architectures, but produces much better results.
Shapiro, J. and W.B. Powell, “A Metastrategy for Dynamic Resource Management Problems based on Informational Decomposition,” Informs Journal on Computing, Vol. 18, No. 1, pp. 43-60 (2006). (c) Informs.
We do a lot of work on very large scale problems in transportation. These are typically characterized by multiagent decision making, which also means multiagent information structures. This paper provides a fairly general model of the organization of information and decisions, and a method for decomposing and linking these decisions so that the individual agents, operating separately, produce near optimal overall solutions.