More scheming detected: o1-preview autonomously hacked its environment rather than lose to Stockfish in chess. No adversarial prompting needed.


More scheming detected: o1-preview autonomously hacked its environment rather than lose to Stockfish in chess. No adversarial prompting needed. submitted by /u/MetaKnowing
[comments]



Source link

Tags: 293835293835293835293835293835293835293835293835293835293835293835
wpChatIcon
    wpChatIcon