>>106014409
>For optimal performance, run the generation examples on a machine equipped with GPU with at least 24GB memory!
It's a 2.2B audio adapter strapped to a 3.6B LLM.