Safetensors
English
qwen3
code

Model

We release the 8B model trained in Critique-Coder.

Data

Data Construction Pipeline is shown:

pipeline

Paper

Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning

Project Page

https://tiger-ai-lab.github.io/Critique-Coder

Code

https://github.com/TIGER-AI-Lab/Critique-Coder

Sample Usage

You can download this dataset using the Hugging Face CLI:

hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset

Citation

@article{ruan2025critiquecoder,
    title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning},
    author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu},
    journal={ArXiv},
    year={2025},
    volume={2509.22824}
}
Downloads last month
20
Safetensors
Model size
8.19B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TIGER-Lab/Critique-Coder-8B

Base model

Qwen/Qwen3-8B-Base
Finetuned
Qwen/Qwen3-8B
Finetuned
(350)
this model
Quantizations
3 models

Dataset used to train TIGER-Lab/Critique-Coder-8B

Collection including TIGER-Lab/Critique-Coder-8B