>>107163368
For dataset captioning, if you have less than, say, 300 images, you won't want to use local models, just use the Gemini API. It's free anyway and can caption NSFW too as long as it isn't too spicy