Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

While your throughout is around 2x you still cost more then vercel ai model pricing for example for GLM-5: https://vercel.com/ai-gateway/models?q=glm

Is this a result of renting more expensive gpus?



Yes, we operate on GB200s and GH200s. Usually we are cheaper for many models and can get up to double the TPS.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: