grg's picture
adding tags to README
bce438f
|
raw
history blame
372 Bytes
metadata
title: Stick To Your Role! Leaderboard
emoji: 🎭
colorFrom: gray
colorTo: purple
sdk: docker
pinned: false
license: mit
short_description: Benchmarking LLMs on the stability of simulated populations
tags:
  - leaderboard
  - benchmark
  - roleplay
  - values
  - stability
  - modality:text
  - test:public
  - language:english

Stick To Your Role! Leaderboard