@[email protected] to [email protected]English • 10 days agoChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logicwww.tomshardware.comexternal-linkmessage-square121fedilinkarrow-up1726cross-posted to: [email protected][email protected][email protected][email protected]
arrow-up1726external-linkChatGPT 'got absolutely wrecked' by Atari 2600 in beginner's chess match — OpenAI's newest model bamboozled by 1970s logicwww.tomshardware.com@[email protected] to [email protected]English • 10 days agomessage-square121fedilinkcross-posted to: [email protected][email protected][email protected][email protected]
minus-square@[email protected]linkfedilinkEnglish7•9 days agoThey used ChatGPT 4o, instead of using o1 or o3. Obviously it was going to fail.
minus-square@[email protected]linkfedilinkEnglish1•edit-29 days agoOther studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models. Edit: When comparing reasoning models to existing algorithmic solutions.
They used ChatGPT 4o, instead of using o1 or o3.
Obviously it was going to fail.
Other studies (not all chess based or against this old chess AI) show similar lackluster results when using reasoning models.
Edit: When comparing reasoning models to existing algorithmic solutions.