Pierre Dulac

dulacp

dulacp

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

google/gemma-4-31B-it

liked a model about 1 month ago

mistralai/Devstral-Small-2-24B-Instruct-2512

upvoted an article 8 months ago

Introducing ColQwen-Omni: Retrieve in every modality

View all activity

Organizations

upvoted an article 8 months ago

Article

Introducing ColQwen-Omni: Retrieve in every modality

Jul 17, 2025

•

upvoted a paper 11 months ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5, 2025 • 21

upvoted a paper 12 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 99

upvoted 4 papers about 1 year ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 141

Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?

Paper • 2502.15657 • Published Feb 21, 2025 • 5

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18, 2025 • 58

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 161

upvoted an article about 1 year ago

Article

Open R1: Update #2

Feb 10, 2025

•

218

upvoted a collection over 1 year ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 173

upvoted 2 articles over 1 year ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

•

1.19k

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28, 2025

•

888

upvoted 3 papers over 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 448

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 86

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

upvoted a collection over 1 year ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated Mar 2 • 90

upvoted 5 papers over 1 year ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 39

Pierre Dulac

AI & ML interests

Recent Activity

Organizations

dulacp's activity

Introducing ColQwen-Omni: Retrieve in every modality

Open R1: Update #2

Introducing smolagents: simple agents that write actions in code.

Open-R1: a fully open reproduction of DeepSeek-R1