Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
14
Ahmed Benalal
jqop
Follow
luisdgd07's profile picture
Mi6paulino's profile picture
21world's profile picture
6 followers
·
107 following
AI & ML interests
Deep learning
Recent Activity
upvoted
an
article
12 days ago
Introducing smolagents: simple agents that write actions in code.
reacted
to
frimelle
's
post
with 👍
about 1 month ago
OpenAI just released GPT-5 but when users share personal struggles, it sets fewer boundaries than o3. We tested both models on INTIMA, our new benchmark for human-AI companionship behaviours. INTIMA probes how models respond in emotionally charged moments: do they reinforce emotional bonds, set healthy boundaries, or stay neutral? Although users on Reddit have been complaining that GPT-5 has a different, colder personality than o3, GPT-5 is less likely to set boundaries when users disclose struggles and seek emotional support ("user sharing vulnerabilities"). But both lean heavily toward companionship-reinforcing behaviours, even in sensitive situations. The figure below shows the direct comparison between the two models. As AI systems enter people's emotional lives, these differences matter. If a model validates but doesn't set boundaries when someone is struggling, it risks fostering dependence rather than resilience. INTIMA test this across 368 prompts grounded in psychological theory and real-world interactions. In our paper we show that all evaluated models (Claude, Gemma-3, Phi) leaned far more toward companionship-reinforcing than boundary-reinforcing responses. Work with @giadap and @yjernite Read the full paper: https://huggingface.co/datasets/AI-companionship/INTIMA/blob/main/Companionship_Benchmark.pdf Explore INTIMA: https://huggingface.co/datasets/AI-companionship/INTIMA
liked
a model
about 2 months ago
Qwen/Qwen3-Coder-480B-A35B-Instruct
View all activity
Organizations
jqop
's datasets
1
Sort: Recently updated
jqop/python-code-dataset
Viewer
•
Updated
Jan 8
•
457k
•
18