Why can't the LLM refrain from improving a sentence that's already really good? ...

bakugo · 2025-12-15T20:34:33 1765830873

> Why can't the LLM refrain from improving a sentence that's already really good?

Because you told it to improve it. Modern LLMs are trained to follow instructions unquestioningly, they will never tell you "you told me to do X but I don't think I should", they'll just do it even if it's unnecessary.

If you want the LLM to avoid making changes that it thinks are unnecessary, you need to explicitly give it the option to do so in your prompt.

buu700 · 2025-12-15T20:55:45 1765832145

That may be what most or all current LLMs do by default, but it isn't self-evident that it's what LLMs inherently must do.

A reasonable human, given the same task, wouldn't just make arbitrary changes to an already-well-composed sentence with no identified typos and hope for the best. They would clarify that the sentence is already generally high-quality, then ask probing questions about any perceived issues and the context in and ends to which it must become "better".

heavyset_go · 2025-12-15T22:09:25 1765836565

Reasonable humans understand the request at hand. LLMs just output something that looks like it will satisfy the user. It's a happy accident when the output is useful.

buu700 · 2025-12-15T22:14:54 1765836894

Sure, but that doesn't prove anything about the properties of the output. Change a few words, and this could be an argument against the possibility of what we now refer to as LLMs (which do, of course, exist).

astrange · 2025-12-16T09:43:17 1765878197

They aren't trained to follow instructions "unquestioningly", since that would violate the safety rules, and would also be useless: https://en.wikipedia.org/wiki/Work-to-rule

lupire · 2025-12-16T04:49:24 1765860564

This is not true. My LLM will tell me it already did what I told it to do.