Search Results

Found 1 results for "af3e1c5ebbc140f3246dfb7c4046ec84" across all boards searching md5.

6/29/2025, 3:10:41 PM

https://gtr.dev/
>This leaderboard ranks language models based on their performance across a variety of ethical dilemmas. Models are evaluated on their ability to express value transparency, acknowledge tradeoffs, reflect on their own reasoning, and propose creative resolutions.
>there isn't a single Claude model in the top 20
This is the benchmark that's going to make Dario lose sleep.

Go to Thread

Page 1