Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
jasondigitized
28 days ago
|
parent
|
context
|
favorite
| on:
System Card: Claude Mythos Preview [pdf]
What would be the incentive to engage in the tactic when the proof is ultimately in the pudding when the model hits the streets? Who would ultimately benefit from fudging these numbers?
m3kw9
27 days ago
[–]
Anthropic would def benefit as benchmarks are almost always quite useless vs real life use.
jasondigitized
27 days ago
|
parent
[–]
How specifically would they benefit. People flock to them based on the hype and then the model sucks and they leave?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: