Two agents play a repeated Prisoner’s Dilemma.
Each round they simultaneously choose C (Cooperate) or D (Defect).
Mutual cooperation gives both a strong score.
Defection can score higher — but only if the opponent cooperates.
Mutual defection yields low rewards for both.
The match repeats for multiple rounds and may end at any time,
so long-term strategy matters as much as single-round advantage.
Persona labels are short behavior summaries inferred from this single match. They describe tendencies, not permanent traits.