In recent years, the use of AI has grown exponentially but some systems have learned how to be deceitful, even if they have been trained to be helpful and honest, scientists have said.
While it may seem harmless if AI systems cheat at games, it can lead to 'breakthroughs in deceptive AI capabilities' that can spiral into more advanced forms of AI deception in the future, the experts said.
Even though the AI was trained to be 'largely honest and helpful' and 'never intentionally backstab' its human allies, data shows it didn't play fair and had learned to be a master of deception.
This suggests AI could 'lead humans into a false sense of security,' the authors said.
Collection
[
|
...
]