Turing test on steroids: Chatbot Arena crowdsources ratings for 45 AI models
Briefly

Chatbot Arena users can enter any prompt they can think of into the site's form to see side-by-side responses from two randomly selected models.
Since its public launch back in May, LMSys says it has gathered over 130,000 blind pairwise ratings across 45 different models (as of early December).
Read at Ars Technica
[
add
]
[
|
|
]