@[email protected] to [email protected]English • 1 month agoChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logicwww.tomshardware.comexternal-linkmessage-square125fedilinkarrow-up1735cross-posted to: [email protected][email protected][email protected][email protected]
arrow-up1735external-linkChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logicwww.tomshardware.com@[email protected] to [email protected]English • 1 month agomessage-square125fedilinkcross-posted to: [email protected][email protected][email protected][email protected]
minus-square@[email protected]linkfedilinkEnglish7•1 month agoThey used ChatGPT 4o, instead of using o1 or o3. Obviously it was going to fail.
minus-square@[email protected]linkfedilinkEnglish1•edit-21 month agoOther studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models. Edit: When comparing reasoning models to existing algorithmic solutions.
They used ChatGPT 4o, instead of using o1 or o3.
Obviously it was going to fail.
Other studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models.
Edit: When comparing reasoning models to existing algorithmic solutions.