Try this "absolute mode" custom instruction for chatgpt, it cuts down all the BS...

vasco · 2025-06-06T02:48:14 1749178094

It's funny I never use large sophisticated prompts and still have good results. Something like:

> Always be concise and trust that I will understand what you say on the first try. No fluff in your answers, speak directly to the point.

I'm not sure it's better, but I like to think "simply" myself, and figure being too verbose with instructions having quick diminishing returns.

TeMPOraL · 2025-06-06T07:27:25 1749194845

What's more likely to be a problem, is the request to be concise.

For some reason, this still seems to not be widely known among even technical users: token generation is where the computation/"thinking" in LLMs happen! By forcing it to keep its answers short, you're starving the model for compute, making each token do more work. There's a small, fixed amount of "thinking" LLM can do per token, so the more you squeeze it, the less reliable it gets, until eventually it's not able to "spend" enough tokens to produce a reliable answer at all.

In other words: all those instructions to "be terse", "be concise", "don't be verbose", "just give answer, no explanation" - or even asking for answer first, then explanations - they're all just different ways to dumb down the model.

I wonder if this can explain, at least in part, why there's so much conflicted experiences with LLMs - in every other LLM thread, you'll see someone claim they're getting great results at some tasks, and then someone else saying they're getting disastrously bad results with the same model on the same tasks. Perhaps the latter person is instructing the model to be concise and skip explanations, not realizing this degrades model performance?

(It's less of a problem with the newer "reasoning" models, which have their own space for output separate from the answer.)

mrandish · 2025-06-06T20:03:08 1749240188

If that's correct then it's a significant problem with LLMs that needs to be addressed. Would it work to have the agent keep the talky, verbose answer to itself and only return to a finally summary to the user?

TeMPOraL · 2025-06-07T11:48:25 1749296905

That's what the "reasoning" models do, effectively. Some LLM services hide or summarize that part for you, other return it verbatim, and ofc. you get the full thing if you're using a local reasoning model.

stavros · 2025-06-06T07:25:11 1749194711

I have similarly good results with:

> Be terse, and don't moralize. Answer questions directly, without equivocation or hedging.