--- license: apache-2.0 datasets: - trl-lib/tldr base_model: - Qwen/Qwen3-0.6B pipeline_tag: summarization library_name: peft --- Source code available at https://github.com/phhusson/llm-rl/blob/main/grpo-tldr.py