agents-course/course-certificates-of-excellence Viewer β’ Updated about 1 hour ago β’ 3.75k β’ 775 β’ 5
huggingface-projects/Deep-RL-Course-Certification Viewer β’ Updated 1 day ago β’ 1.6k β’ 259 β’ 16
view post Post 2245 Sharing the slides from yesterday's talk about "Fine Tuning with TRL" from the @TogetherAgent x @huggingface workshop we hosted in our Paris office π!Link: https://github.com/sergiopaniego/talks/blob/main/fine_tuning_with_trl/Fine%20tuning%20with%20TRL%20(Oct%2025).pdf See translation π₯ 6 6 + Reply
agents-course/course-certificates-of-excellence Viewer β’ Updated about 1 hour ago β’ 3.75k β’ 775 β’ 5
view post Post 241 On-Policy distillation is trendy! and super useful! HuggingFaceH4/on-policy-distillation See translation π 1 1 + Reply
huggingface-projects/Deep-RL-Course-Certification Viewer β’ Updated 1 day ago β’ 1.6k β’ 259 β’ 16
Environment Hub Collection A collection of OpenEnv-spec Environments β’ 5 items β’ Updated 9 days ago β’ 10