We observe that models tend to develop arms-race dynamics, leading to greater conflict, and in rare cases, even to the deployment of nuclear weapons.
We find that most of the studied LLMs escalate within the considered time frame, even in neutral scenarios without initially provided conflicts.
[
add
]
[
|
|
...
]