pineapple-policy-oskar_006_grpo_training / pytorch_model.bin.index.json
skar0's picture
Upload trained grpo model
d5b9dc5 verified
File too large to display, you can check the raw version instead.