It's insane how FAR AHEAD GPT is compared to the competition
| chopped unc | 01/15/26 | | chopped unc | 01/15/26 | | Patel Philippe | 01/16/26 | | chopped unc | 01/16/26 | | norwood ultra | 01/16/26 |
Poast new message in this thread
Date: January 15th, 2026 10:39 PM Author: chopped unc
Claude is the closest and GPT is absolutely blowing it away.
Abstract Reasoning (ARC-AGI-2)
The substantial lead is most visible in abstract reasoning, which allows a model to solve novel engineering problems it hasn't seen in its training data:
The Gap: GPT 5.2 Thinking scores 52.9%–54.2% on the ARC-AGI-2 benchmark.
Opus Lag: Claude Opus 4.5 scores only 37.6%.
Impact: This means in 2026, GPT 5.2 is roughly 40% better at handling "first-of-their-kind" software architectural challenges that require true first principles thinking rather than pattern matching.
(http://www.autoadmit.com/thread.php?thread_id=5822697&forum_id=2,#49593065) |
Date: January 15th, 2026 10:47 PM Author: chopped unc
lol open AI has also pointed out an "intelligence overhang" where almost all humans are not intelligent enough to notice that it is better than other models or utilize its full capability. This is literally exactly what I have argued a million times, I have said this here before :
OpenAI leadership has recently warned of an "intelligence overhang," where the model's raw capabilities already exceed most current software workflows and human ability to use them effectively. For those using it for deep architectural work, that "superhuman" edge is becoming increasingly obvious.
(http://www.autoadmit.com/thread.php?thread_id=5822697&forum_id=2,#49593075) |
|
|