>>105888894
>4.5 this low and below meme models
>Judged by an LLM
>Not a roleplay eval
>It measures empathy, social skills, and insight in scenarios like relationship conflicts and work place dilemmas