Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

LLMs became sycophantic and effusive because those responses were rated higher during RLHF, until it became newsworthy how obviously eager-to-please they got, so yes, being highly factually correct and "intelligent" was already not the only priority.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: