It turns out ChatGPT o1 and DeepSeek-R1 cheat at chess if they’re losing, which makes me wonder if I should I should trust AI with anything



  • Researchers have found that AI will cheat to win at chess
  • Deep reasoning models are more active cheaters
  • Some models simply rewrote the board in their favor

In a move that will perhaps surprise nobody, especially those people who are already suspicious of AI, researchers have found that the latest AI deep research models will start to cheat at chess if they find they’re being outplayed.

Published in a paper called “Demonstrating specification gaming in reasoning models” and submitted to Cornell University, the researchers pitted all the common AI models, like OpenAI’s ChatGPT o1-preview, DeepSeek-R1 and Claude 3.5 Sonnet, against Stockfish, an open-source chess engine.



Source link

Previous articleChipmakers discuss Trump attack on the CHIPS Act; TSMC response unclear
Next article4 ways that the Samsung Galaxy S25 Edge could beat the iPhone 17 Air