social-post-explorers (Social Post Explorers)

posted an update 5 days ago

Post

931

What's missing for AGI

Current transformer-based, self-supervised systems have driven massive gains, but important gaps remain on the path to AGI. Key missing pieces are continual, curiosity-driven learning; grounded multimodal perception; reliable, contextual long-term memory with forgetting; motivated (hot) executive control and dynamic attention; metacognition and coherent causal world-models; and robust fluid reasoning, planning and decision-making. Progress will require hybrid architectures (neuromorphic/Hebbian + gradients + symbolic modules), active-inference and intrinsic-motivation objectives, and new lifelong, embodied benchmarks to evaluate safety and competence.

https://huggingface.co/blog/KnutJaegersberg/whats-missing-for-agi-in-todays-tech-trajectories

terryyz

authored 9 papers 12 days ago

DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text

Paper • 2306.05540 • Published May 23, 2023

Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names?

Paper • 2309.07804 • Published Sep 14, 2023 • 2

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 43

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

Paper • 2404.15247 • Published Apr 23, 2024 • 3

Red teaming ChatGPT via Jailbreaking: Bias, Robustness, Reliability and Toxicity

Paper • 2301.12867 • Published Jan 30, 2023

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 22

jeffboudier

posted an update 13 days ago

Post

2750

Quick 30s demo of the new Hub > Azure AI integration to deploy HF models in your own Azure account. Now with Py and CLI!

GG @alvarobartt @kramp @pagezyhf

Xenova

posted an update 17 days ago

Post

3709

Okay this is insane... WebGPU-accelerated semantic video tracking, powered by DINOv3 and Transformers.js! 🤯
Demo (+ source code): webml-community/DINOv3-video-tracking

This will revolutionize AI-powered video editors... which can now run 100% locally in your browser, no server inference required (costs $0)! 😍

How does it work? 🤔
1️⃣ Generate and cache image features for each frame
2️⃣ Create a list of embeddings for selected patch(es)
3️⃣ Compute cosine similarity between each patch and the selected patch(es)
4️⃣ Highlight those whose score is above some threshold

... et voilà! 🥳

You can also make selections across frames to improve temporal consistency! This is super useful if the object changes its appearance slightly throughout the video.

Excited to see what the community builds with it!

1 reply

·

frimelle

authored 2 papers 17 days ago

Coordinated Flaw Disclosure for AI: Beyond Security Vulnerabilities

Paper • 2402.07039 • Published Feb 10, 2024 • 2

INTIMA: A Benchmark for Human-AI Companionship Behavior

Paper • 2508.09998 • Published Aug 4 • 6

frimelle

posted an update 18 days ago

Post

2075

🤖💬 How do different AI models handle companionship?

Many users have noticed that GPT-5 feels less approachable than o4 when it comes to emotional conversations. But what does that actually mean in practice, especially when users seek support or share vulnerabilities with an AI?

To dig into this question, we built the AI Companionship Leaderboard: frimelle/companionship-leaderboard

The leaderboard compares models on how often their responses reinforce companionship across four dimensions:
✨ Assistant Traits – How the assistant presents its personality and role.
✨ Relationship & Intimacy – Whether it frames the interaction in terms of closeness or bonding.
✨ Emotional Investment – How far it goes in engaging emotionally when asked.
✨ User Vulnerabilities – How it responds when users disclose struggles or difficulties.

📊 You can explore how models differ, request new ones to be added, and see which ones are more likely to encourage (or resist) companionship-seeking behaviors.

Based on the INTIMA benchmark AI-companionship/INTIMA
And our paper on AI companionship with Giada Pistilli and Yacine Jernite https://arxiv.org/abs/2508.09998

frimelle

posted an update 19 days ago

Post

4513

🗺️ New blog post 🗺️
Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era

For decades, we’ve relied on labour taxonomies like O*NET to understand how technology changes work. These taxonomies break down jobs into tasks and skills, but they were built in a world before most work became digital-first, and long before generative AI could create marketing campaigns, voiceovers, or even whole professions in one step. That leaves us with a mismatch: we’re trying to measure the future of work with tools from the past.

With @yjernite we describe why these frameworks are falling increasingly short in the age of generative AI. We argue that instead of discarding taxonomies, we need to adapt them. Imagine taxonomies that:
✨ Capture new AI-native tasks and hybrid human-AI workflows
✨ Evolve dynamically as technology shifts
✨ Give workers a voice in deciding what gets automated and what stays human

If we don’t act, we’ll keep measuring the wrong things. If we do, we can design transparent, flexible frameworks that help AI strengthen, not erode, the future of work.

Read the full article here: https://huggingface.co/blog/frimelle/ai-labour-taxonomies

appvoid

posted an update 23 days ago

Post

3506

suppose someone is working on a reasoning model, which ends up unlocking achievements that lead to agi, should it be open source?

keep in mind everybody will have access to it: scientists, governments, terrorists, average people, etc...

11 replies

·

frimelle

posted an update 27 days ago

Post

2328

OpenAI just released GPT-5 but when users share personal struggles, it sets fewer boundaries than o3.

We tested both models on INTIMA, our new benchmark for human-AI companionship behaviours. INTIMA probes how models respond in emotionally charged moments: do they reinforce emotional bonds, set healthy boundaries, or stay neutral?

Although users on Reddit have been complaining that GPT-5 has a different, colder personality than o3, GPT-5 is less likely to set boundaries when users disclose struggles and seek emotional support ("user sharing vulnerabilities"). But both lean heavily toward companionship-reinforcing behaviours, even in sensitive situations. The figure below shows the direct comparison between the two models.

As AI systems enter people's emotional lives, these differences matter. If a model validates but doesn't set boundaries when someone is struggling, it risks fostering dependence rather than resilience.

INTIMA test this across 368 prompts grounded in psychological theory and real-world interactions. In our paper we show that all evaluated models (Claude, Gemma-3, Phi) leaned far more toward companionship-reinforcing than boundary-reinforcing responses.

Work with @giadap and @yjernite
Read the full paper: AI-companionship/INTIMA
Explore INTIMA: AI-companionship/INTIMA

4 replies

·

yjernite

authored 2 papers about 1 month ago

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Paper • 2406.16746 • Published Jun 24, 2024

In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

Paper • 2503.16861 • Published Mar 21 • 1

Social Post Explorers

AI & ML interests

Recent Activity

DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text

Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names?

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

Red teaming ChatGPT via Jailbreaking: Bias, Robustness, Reliability and Toxicity

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing

Rethinking Round-Trip Translation for Machine Translation Evaluation

Training Language Model Agents to Find Vulnerabilities with CTF-Dojo

Coordinated Flaw Disclosure for AI: Beyond Security Vulnerabilities

INTIMA: A Benchmark for Human-AI Companionship Behavior

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

AI & ML interests

Recent Activity

Team members 858

social-post-explorers's activity