Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
boroboro4
on April 8, 2024
|
parent
|
context
|
favorite
| on:
Groq CEO: 'We No Longer Sell Hardware'
Name one use case where there is a difference between latency of 200 t/s (fireworks.ai mixtral model) and 500 t/s (groq mixtral)? Not throughput and not time to first token, but latency.
Groq model shines at latency, not at the other two.
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Groq model shines at latency, not at the other two.