Low Horng Jiun
NickolasLow1
ยท
AI & ML interests
None yet
Recent Activity
reacted
to
sergiopaniego's
post
with ๐
about 1 month ago
Interested in RL training environments?
We just released a beginner-friendly walkthrough notebook!
Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM.
happy learning! ๐ฑ
Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb
OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv
updated
a model
about 1 month ago
NickolasLow1/Qwen2.5-7B-Instruct
updated
a Space
about 1 month ago
NickolasLow1/Qwen2.5-7B-Instruct
Organizations
None yet