SATA-Bench is a multi-domain benchmark designed for 'Select-all-that-apply' questions.
sata-bench
sata-bench
AI & ML interests
None yet
Recent Activity
commented on
a paper
11 days ago
The Personalization Trap: How User Memory Alters Emotional Reasoning in
LLMs
commented on
a paper
11 days ago
The Personalization Trap: How User Memory Alters Emotional Reasoning in
LLMs
upvoted
a
paper
11 days ago
The Personalization Trap: How User Memory Alters Emotional Reasoning in
LLMs
Organizations
None yet