Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
87
28
323
Nathan Lambert
natolambert
Follow
mattvc's profile picture
lorinma's profile picture
ctauchmann's profile picture
139 followers
·
5 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
new
activity
about 9 hours ago
allenai/Llama-3.1-Tulu-3-8B:
Adding Evaluation Results
updated
a model
2 days ago
allenai/Llama-3.1-Tulu-3-70B
updated
a model
2 days ago
allenai/Llama-3.1-Tulu-3-8B
View all activity
Articles
Ethics and Society Newsletter #4: Bias in Text-to-Image Models
Jun 26, 2023
•
2
Can foundation models label data like humans?
Jun 12, 2023
•
1
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
24
Red-Teaming Large Language Models
Feb 24, 2023
•
19
What Makes a Dialog Agent Useful?
Jan 24, 2023
•
1
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
130
Stable Diffusion with 🧨 Diffusers
Aug 22, 2022
•
43
Organizations
natolambert
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
allenai/Llama-3.1-Tulu-3-8B
about 9 hours ago
Adding Evaluation Results
#3 opened 16 days ago by
T145
New activity in
allenai/reward-bench
8 days ago
multilingual
2
#8 opened 14 days ago by
ehartford
New activity in
allenai/reward-bench
about 1 month ago
add more contaminated models to the list
2
#7 opened 3 months ago by
arielgera
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 1 month ago
Reason behind not using special tokens in the prompt format?
2
#2 opened about 2 months ago by
Doctor-Shotgun
New activity in
allenai/OLMo-2-1124-13B-Instruct-preview
about 1 month ago
What is that instruction template?
1
#1 opened about 2 months ago by
SerialKicked
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 2 months ago
Why do you use pass@10 to test coding perfmance...
1
#4 opened about 2 months ago by
Leon-Leee
New activity in
allenai/OLMo-2-1124-13B-Instruct-preview
about 2 months ago
Has the data set been expanded?
1
#2 opened about 2 months ago by
win10
New activity in
allenai/tulu-3-sft-personas-algebra
about 2 months ago
Librarian Bot: Add language metadata for dataset
#1 opened about 2 months ago by
librarian-bot
New activity in
allenai/tulu-3-sft-personas-math
about 2 months ago
Add link to Tulu 3 paper
#2 opened about 2 months ago by
gabrielmbmb
New activity in
allenai/llama-3.1-tulu-3-70b-preference-mixture
about 2 months ago
Librarian Bot: Add language metadata for dataset
#1 opened about 2 months ago by
librarian-bot
New activity in
allenai/llama-3.1-tulu-3-8b-preference-mixture
about 2 months ago
Easy way to separate permissive samples
1
#1 opened about 2 months ago by
RASMUS
New activity in
allenai/tulu-3-sft-mixture
about 2 months ago
recommend filter
1
#2 opened about 2 months ago by
ehartford
NuminaMath-TIR License (Apache 2, not CC-BY-NC-4.0)
1
#3 opened about 2 months ago by
rbattle
New activity in
allenai/Llama-3.1-Tulu-3-8B-RM
about 2 months ago
Adding `safetensors` variant of this model
#2 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B-SFT
about 2 months ago
Adding Evaluation Results
#2 opened about 2 months ago by
leaderboard-pr-bot
New activity in
allenai/Llama-3.1-Tulu-3-8B-DPO
about 2 months ago
Adding `safetensors` variant of this model
#2 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B-DPO
about 2 months ago
Adding `safetensors` variant of this model
#3 opened about 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 2 months ago
Spelling Error in Section 5.4 - "then" should be "than"
1
#3 opened about 2 months ago by
eliuakk
New activity in
allenai/Llama-3.1-Tulu-3-8B
about 2 months ago
Feedback
1
#2 opened about 2 months ago by
KeyboardMasher
New activity in
allenai/Llama-3.1-Tulu-3-8B-RM
about 2 months ago
Update README.md
#1 opened about 2 months ago by
reach-vb
Load more