ijeff to [email protected]English • 2 years agoIntroducing Code Llama, a state-of-the-art large language model for codingai.meta.comexternal-linkmessage-square2fedilinkarrow-up144cross-posted to: [email protected][email protected][email protected][email protected]
arrow-up144external-linkIntroducing Code Llama, a state-of-the-art large language model for codingai.meta.comijeff to [email protected]English • 2 years agomessage-square2fedilinkcross-posted to: [email protected][email protected][email protected][email protected]
minus-square@[email protected]linkfedilink12•edit-22 years agoLooks interesting, but doesn’t seem better than GPT-4. GPT-4 scored 67% on the Human Eval test, whereas Code Llama scored only a 53.7%, which isn’t a trivial difference. Bit disingenuous of Meta to claim it to be “on par” with ChatGPT.
minus-squareijeffOPlinkfedilinkEnglish6•2 years agoThey seem to qualify a bit below that they mean GPT-3.5-Turbo, which does often get referred to as ChatGPT (in contrast to GPT-4).
Looks interesting, but doesn’t seem better than GPT-4. GPT-4 scored 67% on the Human Eval test, whereas Code Llama scored only a 53.7%, which isn’t a trivial difference. Bit disingenuous of Meta to claim it to be “on par” with ChatGPT.
They seem to qualify a bit below that they mean GPT-3.5-Turbo, which does often get referred to as ChatGPT (in contrast to GPT-4).