GRPO trainer to tune Llama-3.1-Nemotron-Nano-8B-v1
#10 opened about 1 month ago
by
chandan-work1729
Tokenizer ERROR
π
1
#9 opened about 2 months ago
by
Mykhailo21

Which vLLM reasoning parser to use?
#8 opened about 2 months ago
by
Biggbran
Q4_K_M - .gguf for llama.cpp
#7 opened 2 months ago
by
PirateBayLoot

Enable GPU (torch cuda)
#6 opened 3 months ago
by
jr-researcher
Clarification for exact training data used for this model
π
1
#5 opened 4 months ago
by
ryanmarten

Great model, Seeking Advice on Fine-Tuning for Domain Reasoning Tasks
2
#4 opened 4 months ago
by
aaditya
