Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Workaccount2
10 months ago
|
parent
|
context
|
favorite
| on:
Gemini-2.5-pro-preview-06-05
People can ask whatever they want on LMarena, so a question like "List some good snacks to bring to work" might elicit a win for a old/tiny/deprecated model simply because it lists the snack the user liked more.
AstroBen
10 months ago
[–]
are you saying that's a bad way to judge a model? Not sure why we'd want ones that choose bad snacks
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: