@[email protected] to [email protected] • 1 year agoOnce an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic foundwww.businessinsider.comexternal-linkmessage-square3fedilinkarrow-up145cross-posted to: [email protected][email protected][email protected]
arrow-up145external-linkOnce an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic foundwww.businessinsider.com@[email protected] to [email protected] • 1 year agomessage-square3fedilinkcross-posted to: [email protected][email protected][email protected]
minus-square@[email protected]linkfedilink8•1 year agoOnce it’s learnt this, it’ll just get better at lying when you try to punish/correct lies
Once it’s learnt this, it’ll just get better at lying when you try to punish/correct lies
Which is exactly what the article says happens