Testing
Python2231
ยท
AI & ML interests
None yet
Recent Activity
liked
a dataset
2 days ago
Flux9665/BibleMMS
reacted
to
s-emanuilov's
post
with ๐ฅ
2 days ago
Tutorial ๐ฅ Training a non-English reasoning model with GRPO and Unsloth
I wanted to share my experiment with training reasoning models in languages other than English/Chinese.
Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage.
Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/
The model itself: https://huggingface.co/s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1
I hope this helps anyone looking to build reasoning models in their language.
liked
a Space
3 days ago
ginigen/3D-LLAMA
Organizations
None yet
Python2231's activity
How can i run this model?
1
#1 opened 2 months ago
by
Python2231
Fail to create VirtualModel task
1
#7 opened about 1 year ago
by
Python2231