Chroma really struggles to render a cage with a person in it. Should I just try and feed it whatever img2txt gives?