Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Another suggestion for optimizing local inference - the Hermes team talks a lot on X about how much better results are when you use custom parsers tuned to the nuances of each model. Some models might like to use a trailing `,` in JSON output, some don't - so if your parser can handle the quirks of the specific model, then you get higher-performing functionality.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: