Chinese data could use this reward to filter?
#8
by
Yvonne1111
- opened
Shall I use this reward model to filter Chinese data?
I read the Nemotron 340B paper and I find that there are 15% multilingual natural language data in pretrain data.