Apple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.

AbuTahir@lemm.ee · edit-2 2 months ago

MCasq_qsaCJ_234@lemmy.zip · 2 months ago

In fact, simple computer programs do a great job of solving these puzzles…

If an AI is trained to do this, it will be very good, like for example when a GPT-2 was trained to multiply numbers up to 20 digits.

Here they do the same test to GPT-4o, o1-mini and o3-mini