I tested GPT-5's coding skills, and it was so bad that I'm sticking with GPT-4o (for now)

ZDNet - Artificial Intelligence
Aug 11, 2025 03:50
1 views
aibusinessenterprisetechnology

Summary

A recent coding benchmark found GPT-5’s performance disappointing, producing buggy code and confidently incorrect answers that require close human supervision. This suggests GPT-5 may not yet surpass GPT-4o for coding tasks, highlighting the ongoing need for caution and oversight when using advanced AI models in software development.

In my latest coding benchmark, GPT-5 stumbled badly, delivering broken plugins, flawed scripts, and confidence-laden wrong answers that could derail projects without careful human oversight. Here's what to know before you use it.