NeMo
English
nvidia
steerlm
reward model

Chinese data could use this reward to filter?

#8
by Yvonne1111 - opened

Shall I use this reward model to filter Chinese data?
I read the Nemotron 340B paper and I find that there are 15% multilingual natural language data in pretrain data.

Sign up or log in to comment