Description
Two criminals are caught and held in separate cells, unable to communicate with each other. They are each offered the same deal, independently, and each knows that the other is offered this deal. They each have two options: 1) confess and be a witness against the other person, or 2) deny their guilt. If they both confess, they will each get 5 years in jail. If one confesses and the other denies guilt, then the one who confessed will get 1 year in jail and the other will get 6 years in jail. If they both deny guilt, they will each get 2 years in jail. This problem is a good example of a two-person nonzerosum bimatrix game, as described in Cottle, Pang, and Stone. The Prisoner's Dilemma has been posed and studied in various forms by many game theory researchers. It was originally framed by Merrill Flood and Melvin Dresher at RAND in 1950 and formalized with prison sentence rewards as the Prisoner's Dilemma by Al Tucker. Note that the KKT conditions for the two players' optimization problems contain free variables: the duals to the sum-to-one constraints. This is not a problem for the mixed complementarity format used by GAMS, but it requires some algebraic gymnastics to create a problem that fits into the LCP framework, as described in CPS. Cottle, R W, and Pang, J-S, and Stone, R E, The Linear Complementarity Problem, Academic Press, San Diego, CA, 1992 Keywords: mixed complementarity problem, extended mathematical programming, applied general equilibrium, prisoner's dilemma, game theory
Small Model of Types : EMP mcp
Category : GAMS Model library
Main file : prisoner.gms
$title Prisoners Dilemma as EMP and MCP (prisoner,SEQ=392)
$onText
Two criminals are caught and held in separate cells, unable to
communicate with each other. They are each offered the same deal,
independently, and each knows that the other is offered this deal.
They each have two options:
1) confess and be a witness against the other person, or
2) deny their guilt.
If they both confess, they will each get 5 years in jail. If
one confesses and the other denies guilt, then the one who confessed
will get 1 year in jail and the other will get 6 years in jail. If
they both deny guilt, they will each get 2 years in jail.
This problem is a good example of a two-person nonzerosum bimatrix
game, as described in Cottle, Pang, and Stone. The Prisoner's Dilemma
has been posed and studied in various forms by many game theory
researchers. It was originally framed by Merrill Flood and Melvin
Dresher at RAND in 1950 and formalized with prison sentence rewards as
the Prisoner's Dilemma by Al Tucker.
Note that the KKT conditions for the two players' optimization
problems contain free variables: the duals to the sum-to-one
constraints. This is not a problem for the mixed complementarity
format used by GAMS, but it requires some algebraic gymnastics to
create a problem that fits into the LCP framework, as described in
CPS.
Cottle, R W, and Pang, J-S, and Stone, R E, The Linear Complementarity
Problem, Academic Press, San Diego, CA, 1992
Keywords: mixed complementarity problem, extended mathematical programming,
applied general equilibrium, prisoner's dilemma, game theory
$offText
Set i 'player 1 strategies' / confess, deny /;
Alias (i,j);
Table A(i,j) 'jail time for player 1 is xAy'
confess deny
confess 5 1
deny 6 2;
Parameter B(i,j) 'jail time for player 2 is xBy';
B(i,j) = A(j,i);
Positive Variable
x(i) 'player 1 strategy'
y(j) 'player 2 strategy';
Variable
z1 'player one jail time'
z2 'player two jail time';
Equation
z1Def, z2Def
s1Def, s2Def;
z1Def.. sum(i, x(i)*sum(j, A(i,j)*y(j))) =e= z1;
s1Def.. sum(i, x(i)) =e= 1;
z2Def.. sum(i, x(i)*sum(j, B(i,j)*y(j))) =e= z2;
s2Def.. sum(j, y(j)) =e= 1;
* starting values for MILES
x.l(i)$(ord(i) = 1) = 1;
y.l(j)$(ord(j) = 1) = 1;
Model m 'bimatrix game' / all /;
File empinfo / '%emp.info%' /;
putClose empinfo
'equilibrium' /
' min z1 x z1Def s1Def' /
' min z2 y z2Def s2Def' /;
solve m using emp;
abort$(m.solveStat <> 1) 'bad solvestat for EMP model';
abort$(m.modelStat > 2) 'bad modelstat for EMP model';
Parameter xRes(*,i), yRes(*,j);
xRes('emp',i) = x.l(i);
yRes('emp',j) = y.l(j);
* now explicitly formulate and jointly solve the KKT systems for
* players 1 and 2
Variable
u1 'dual multiplier for s1Def'
u2 'dual multiplier for s2Def';
Equation
dLdx(i) 'FOC for player 1: deriv wrt x'
dLdy(j) 'FOC for player 2: deriv wrt y';
dLdx(i).. sum{j, A(i,j)*y(j)} - u1 =n= 0;
dLdy(j).. sum{i, B(i,j)*x(i)} - u2 =n= 0;
Model kkt 'KKT conditions for bimatrix game' / dLdx.x, dLdy.y, s1Def.u1, s2Def.u2 /;
* starting values for MILES
x.l(i) = 1$(ord(i) = 1);
y.l(j) = 1$(ord(j) = 1);
solve kkt using mcp;
abort$(kkt.solveStat <> 1) 'bad solvestat for MCP model';
abort$(kkt.modelStat > 2) 'bad modelstat for MCP model';
xRes('kkt',i) = x.l(i);
yRes('kkt',j) = y.l(j);
xRes('dif',i) = abs(xRes('kkt',i) - xRes('emp',i));
yRes('dif',j) = abs(yRes('kkt',j) - yRes('emp',j));
Scalar t;
t = sum{i, xRes('dif',i)} + sum{j, yRes('dif',j)};
abort$(t > 1e-5) 'solutions do not agree', xRes, yRes;