ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Study

ForgottenFlux@lemmy.world · 6 months ago

ChatGPT Answers Programming Questions Incorrectly 52% of the Time: Study

NounsAndWords@lemmy.world · 6 months ago

GPT-2 came out a little more than 5 years ago, it answered 0% of questions accurately and couldn’t string a sentence together.

GPT-3 came out a little less than 4 years ago and was kind of a neat party trick, but I’m pretty sure answered ~0% of programming questions correctly.

GPT-4 came out a little less than 2 years ago and can answer 48% of programming questions accurately.

I’m not talking about mortality, or creativity, or good/bad for humanity, but if you don’t see a trajectory here, I don’t know what to tell you.

Eheran@lemmy.world · 6 months ago

The study is using 3.5, not version 4.

phoneymouse@lemmy.world · 6 months ago

4 produces inaccurate programming answers too

Eheran@lemmy.world · 6 months ago

Obviously. But it is FAR better yet again.

phoneymouse@lemmy.world · 6 months ago

Not really. I ask it questions all the time and it makes shit up.

Eheran@lemmy.world · 6 months ago

Yes. But it is better than 3.5 without any doubt.

egeres@lemmy.world · 6 months ago

Lemmy seems to be very near-sighted when it comes to the exponential curve of AI progress, I think this is an effect because the community is very anti-corp

Knock_Knock_Lemmy_In@lemmy.world · 6 months ago

In what year do you estimating AI will have 90% accuracy?

NounsAndWords@lemmy.world · 6 months ago

No clue? Somewhere between a few years (assuming some unexpected breakthrough) or many decades? The consensus from experts (of which I am not) seems to be somewhere in the 2030s/40s for AGI. I’m guessing accuracy probably will be more on a topic by topic basis, LLMs might never even get there, or only related to things they’ve been heavily trained on. If predictive text doesn’t do it then I would be betting on whatever Yann LeCun is working on.