Analysing the RLHF pipeline
Russel
rshwndsz
·
AI & ML interests
Data Efficient Learning, Open-endedness, Alignment, AI Safety, Mechanical Interpretability
Recent Activity
updated
a model
16 days ago
rshwndsz/gemma-3-4b-it-ckpt-int8
published
a model
16 days ago
rshwndsz/gemma-3-4b-it-ckpt-int8
updated
a model
20 days ago
rshwndsz/gemma-3-4b-it-ckpt
Organizations
None yet