Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Ollama only uses llamacpp for running legacy models. gpt-oss runs entirely in the ollama engine.

You don't need to use Turbo mode; it's just there for people who don't have capable enough GPUs.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: