Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News Editorials & Other Articles General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search

Hugin

(37,055 posts)
4. My own cursory queries bear out the approximate 30% error rate.
Fri Oct 31, 2025, 02:01 AM
Friday

However, the error rate is non-linear. Especially when a query (prompt, to use the lingo) involves context switching. The results are often little more than a word salad.

Project management is littered with context switching.

Generative AI is incapable of chewing gum and walking at the same time and probably never will be. Sure, there’s methods of mitigation, but they all involve HITL (Human In The Loop). The captains of industry are going to balk at having numerous humans prompting the AI systems when they could be out building real world buildings.

Recommendations

1 members have recommended this reply (displayed in chronological order):

I hope it replaces all of the investment bankers VMA131Marine Friday #1
I have worked with various forms of AI since the 1980s. Metaphorical Friday #2
I'm really not asserting that this is what will happen, but I believe machines capable of tasks on that scale AZJonnie Friday #3
Yes, I agree that is the selling point being used. Hugin Friday #5
Oh, I absolutely agree with that Metaphorical Friday #21
My own cursory queries bear out the approximate 30% error rate. Hugin Friday #4
Good points and good analysis. . . . nt Bernardo de La Paz Friday #6
You could have chosen any number of similar pie in the sky examples and they'd all be dreams but not selling points. Bernardo de La Paz Friday #7
Sure, I use Gemini Agentic via CLI everyday. But all I'm talking about is a collection of agents that effectively talk AZJonnie Friday #9
Remember that most agentic calls Metaphorical Friday #22
Well yeah I don't have Gemini running locally like I do Ollama and a couple other models AZJonnie Friday #23
Helpful Starbeach Friday #8
Too long to read. I'll bookmark it. QueerDuck Friday #10
And you'll end up with a 90k sqft ballroom LetsGetSmartAboutIt Friday #11
Stupid Question About AI Bibbers Friday #12
Some amateur answers... Hugin Friday #14
+1 leftstreet Friday #15
Thanks so much! Bibbers Yesterday #31
What a pile of bullshit. hunter Friday #13
The coding is useful...but the resto is just bullshit and will crash...the AI companies are Demsrule86 Friday #16
Yeah. I probably didn't make clear enough that my point was more political than technical AZJonnie Friday #17
I didn't mean to call you out...just interests me...I love computers and coding. Demsrule86 Saturday #30
I've seen too many articles on problems with AI coding including security risks that aren't caught to be highplainsdem Friday #18
I probably should have made it more clear that my point was more political than technical AZJonnie Friday #19
That's very interesting, but I think there is a flaw in that MineralMan Friday #20
I really didn't mean it's definitely going to work and building a building was just a convenient illustration AZJonnie Friday #25
All well and good, but the energy demand will kill us (financially and literally) . . . . hatrack Friday #24
Yeah I certainly did not to have it come off sounding like it's all a 'good thing' ESPECIALLY not for the climate AZJonnie Friday #27
Priceline refund works on AI Turbineguy Friday #26
Fascinating discussion. Thank you all. cachukis Friday #28
AI, as both a technology and a commodity, is in its infancy, but this thread makes some nice points Ilikepurple Friday #29
Latest Discussions»General Discussion»If you're confused about ...»Reply #4