Would you ever trust an AI agent running your business? As hilarious as this sma...

keymon-o · 2025-06-27T20:37:53 1751056673

I’ve just written a small anecdote with GPT3.5, where it lost count of some trivial item quantity incremental in just a few prompts. It might get better for the orders of magnitude from now on, but who’s gonna pay for ‘that one eventual mistake’.

croemer · 2025-06-27T20:48:18 1751057298

GPT3.5? Did you mean to send this 2 years ago?

keymon-o · 2025-06-27T20:55:22 1751057722

Maybe. Did LLMs stop with hallucinations and errors 2 years ago?

marinmania · 2025-06-27T20:35:28 1751056528

It does seem far more straight forward to say "Write code that deterministically orders food items that people want and sends invoices etc."

I feel like that's more the future. Having an agent sorta make random choices feel like LLMs attempting to do math, instead of LLMs attempting to call a calculator.

keymon-o · 2025-06-27T20:50:23 1751057423

Every output that is going to be manually verified by a professional is a safe bet.

People forget that we use computers for accuracy, not smarts. Smarts make mistakes.

standardUser · 2025-06-27T21:13:13 1751058793

Right, but if we limit the scope too much we quickly arrive at the point where 'dumb' autonomy is sufficient instead of using the world's most expensive algorithms.

throwacct · 2025-06-27T20:41:14 1751056874

I don't think any decision maker will let LLMs run their business. If the LLMs fail, you could potentially lose your livelihood.