pineapple-policy-oskar_006a_grpo_training / pytorch_model.bin.index.json
skar0's picture
Upload trained grpo model
c13ff8c verified
File too large to display, you can check the raw version instead.