Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

No idea, I just said I wanted to try this out and see how it performs.

Doesn’t VRAM amount limit the size of the model you can load? I’m not talking about training just inference. I also pointed out these are not the greatest GPUs available, just that the advantage they have is being able to address more memory since on those machines is a shared block between system and GPU.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: