I have 2 of them. I would advise against if you want to run things like vllm. I ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		djsjajah 19 days ago \| parent \| context \| favorite \| on: Taking on CUDA with ROCm: 'One Step After Another' I have 2 of them. I would advise against if you want to run things like vllm. I have had the cards for months and I still have not been able to create a uv env with trl and vllm. For vllm, it’s works fine in docker for some models. With one gpu, gpt-oss 20b decoding at a cumulative 600-800tps with 32 concurrent requests depending on context length but I was getting trash performance out of qwen3.5 and Gemma4 If I were to do it again, I’d probably just get a dgx spark. I don’t think it’s been worth the hassle.

girvo 19 days ago [–]

FWIW I’m in love with my Asus GX10 and have been learning CUDA on it while playing with vllm and such. Qwen3.5 122B A10 at ~50tps is quite neat.

But do beware, it’s weird hardware and not really Blackwell. We are only just starting to squeeze full performance out of SM12.1 lately!

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact