Jaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 8 months agoAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comexternal-linkmessage-square85linkfedilinkarrow-up1578arrow-down114
arrow-up1564arrow-down1external-linkAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comJaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 8 months agomessage-square85linkfedilink
minus-squareOutbound7404@lemmy.mllinkfedilinkEnglisharrow-up3arrow-down3·8 months agoA human can review something close to correct a lot better than starting the task from zero.
minus-squareDreamlandLividity@lemmy.worldlinkfedilinkEnglisharrow-up9·8 months agoIt is a lot harder to notice incorrect information in review, than making sure it is correct when writing it.
minus-squareMangoCats@feddit.itlinkfedilinkEnglisharrow-up2arrow-down1·8 months ago harder to notice incorrect information in review, than making sure it is correct when writing it. That depends entirely on your writing method and attention span for review. Most people make stuff up off the cuff and skim anything longer than 75 words when reviewing, so the bar for AI improving over that is really low.
minus-squareMangoCats@feddit.itlinkfedilinkEnglisharrow-up4·8 months agoIn University I knew a lot of students who knew all the things but “just don’t know where to start” - if I gave them a little direction about where to start, they could run it to the finish all on their own.
A human can review something close to correct a lot better than starting the task from zero.
It is a lot harder to notice incorrect information in review, than making sure it is correct when writing it.
That depends entirely on your writing method and attention span for review.
Most people make stuff up off the cuff and skim anything longer than 75 words when reviewing, so the bar for AI improving over that is really low.
In University I knew a lot of students who knew all the things but “just don’t know where to start” - if I gave them a little direction about where to start, they could run it to the finish all on their own.