Qwen2-VL-7B finetuned on a mini subset of SafeRLHF along with additional responses.
for more detailed about training data and parameters, please refer to our Paper
Chat template
Files info
Base model