Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
87
28
323
Nathan Lambert
natolambert
Follow
SilvaNinja's profile picture
jeffboudier's profile picture
lunarflu's profile picture
139 followers
·
5 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
new
activity
about 8 hours ago
allenai/Llama-3.1-Tulu-3-8B:
Adding Evaluation Results
updated
a model
2 days ago
allenai/Llama-3.1-Tulu-3-70B
updated
a model
2 days ago
allenai/Llama-3.1-Tulu-3-8B
View all activity
Articles
Ethics and Society Newsletter #4: Bias in Text-to-Image Models
Jun 26, 2023
•
2
Can foundation models label data like humans?
Jun 12, 2023
•
1
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
24
Red-Teaming Large Language Models
Feb 24, 2023
•
19
What Makes a Dialog Agent Useful?
Jan 24, 2023
•
1
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
130
Stable Diffusion with 🧨 Diffusers
Aug 22, 2022
•
43
Organizations
natolambert
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
allenai/Llama-3.1-Tulu-3-8B
about 8 hours ago
Adding Evaluation Results
#3 opened 16 days ago by
T145
updated
2 models
2 days ago
allenai/Llama-3.1-Tulu-3-70B
Text Generation
•
Updated
2 days ago
•
4.65k
•
46
allenai/Llama-3.1-Tulu-3-8B
Text Generation
•
Updated
about 8 hours ago
•
4.4k
•
117
upvoted
an
article
2 days ago
view article
Article
Putting RL back in RLHF
Jun 12, 2024
•
69
updated
a collection
3 days ago
2025 Artifacts
Collection
13 items
•
Updated
3 days ago
•
2
liked
2 models
3 days ago
internlm/internlm3-8b-instruct
Text Generation
•
Updated
2 days ago
•
1.59k
•
162
ibm-granite/granite-3.1-2b-instruct
Text Generation
•
Updated
29 days ago
•
13.6k
•
26
updated
a collection
3 days ago
2025 Artifacts
Collection
13 items
•
Updated
3 days ago
•
2
liked
a model
4 days ago
MiniMaxAI/MiniMax-Text-01
Text Generation
•
Updated
1 day ago
•
1.53k
•
372
updated
a collection
4 days ago
2025 Artifacts
Collection
13 items
•
Updated
3 days ago
•
2
liked
a model
4 days ago
openbmb/MiniCPM-o-2_6
Any-to-Any
•
Updated
about 21 hours ago
•
7.05k
•
544
updated
a collection
4 days ago
2025 Artifacts
Collection
13 items
•
Updated
3 days ago
•
2
liked
a model
4 days ago
Qwen/Qwen2.5-Math-PRM-72B
Text Classification
•
Updated
about 22 hours ago
•
358
•
51
liked
a dataset
4 days ago
lmarena-ai/PPE-Human-Preference-V1
Viewer
•
Updated
Oct 22, 2024
•
16k
•
130
•
7
updated
a collection
5 days ago
2025 Artifacts
Collection
13 items
•
Updated
3 days ago
•
2
liked
a dataset
5 days ago
PRIME-RL/Eurus-2-SFT-Data
Viewer
•
Updated
16 days ago
•
230k
•
205
•
9
updated
a collection
5 days ago
2025 Artifacts
Collection
13 items
•
Updated
3 days ago
•
2
liked
a model
5 days ago
kyutai/helium-1-preview-2b
Text Generation
•
Updated
4 days ago
•
2.7k
•
110
updated
a dataset
8 days ago
allenai/reward-bench-results
Updated
8 days ago
•
10.9k
•
2
Load more