Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

llama.cpp needs the files to be in ggml format, there is a command string you can run to convert one from the other (as well as perform quantization). Or just download the GGML version

https://www.reddit.com/r/LocalLLaMA/wiki/models#wiki_llama_2...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: