AI agents wrong ~70% of time: Carnegie Mellon study

Jaden Norman@lemmy.world · 2 months ago

AI agents wrong ~70% of time: Carnegie Mellon study

Punkie@lemmy.world · 2 months ago

I’d compare LLMs to a junior executive. Probably gets the basic stuff right, but check and verify for anything important or complicated. Break tasks down into easier steps.

zbyte64@awful.systems · edit-2 2 months ago

A junior developer actually learns from doing the job, an LLM only learns when they update the training corpus and develop an updated model.

jumping_redditor@sh.itjust.works · 2 months ago

an llm costs less, and won’t compain when yelled at

zbyte64@awful.systems · 2 months ago

Why would you ever yell at an employee unless you’re bad at managing people? And you think you can manage an LLM better because it doesn’t complain when you’re obviously wrong?