Tuned Qwen 3.5 27B beats Step 3.5 on almost all benchmarks, so the point about t...

tempaccount420 · 2026-04-01T21:21:01 1775078461

Benchmarks are not interesting in deciding the "size class". Bigger size means more knowledge. Also, the Qwen 3.5 27B is a dense 27B active parameter model. StepFun 3.5 Flash has 11B active parameters.

lostmsu · 2026-04-01T21:55:14 1775080514

> Bigger size means more knowledge.

Qwen 3.5 27B beats StepFun 3.5 Flash on GPQA Diamond too, so probably no.

tarruda · 2026-04-02T11:23:28 1775129008

Benchmarks don't tell the whole story. For one-shot coding tasks, I found Step 3.5 Flash to be stronger even than Qwen 3.5 397B.

anentropic · 2026-04-02T18:02:42 1775152962

Benchmarks don't tell the whole story... for that you need anecdotes from random HN posters :)