>>107004768
I'd use sdcpp but it doesn't do ram offloading in proper fashion. This is very annoying.
Somewhat strange that llama.cpp is apparently its main influence.