GRAM-Qwen3-1.7B-RewardModel-GGUF

GRAM-Qwen3-1.7B-RewardModel is a generative reward model developed by NiuTrans that follows a two-step training approach: it first pre-trains on a large amount of unlabeled data and then fine-tunes with supervised labeled data. This methodology, which incorporates label smoothing and a regularized ranking loss, enables effective reward generalization for large language models (LLMs). The model is built on the Qwen3-1.7B base, a compact language model with 1.7 billion parameters, 28 layers, and attention heads designed to handle long-context inputs (up to 32,768 tokens) and support both detailed reasoning and fast responses. GRAM-Qwen3-1.7B-RewardModel is intended for flexible application across diverse tasks, providing an open-source, plug-and-play reward model for aligning LLM outputs without requiring extensive task-specific retraining. It excels in evaluating and ranking the quality of AI-generated responses, operating effectively as a judge model in AI alignment scenarios.

Model Files

Model File name Size QuantType
GRAM-Qwen3-1.7B-RewardModel.BF16.gguf 3.45 GB BF16
GRAM-Qwen3-1.7B-RewardModel.F16.gguf 3.45 GB F16
GRAM-Qwen3-1.7B-RewardModel.F32.gguf 6.89 GB F32
GRAM-Qwen3-1.7B-RewardModel.Q2_K.gguf 778 MB Q2_K
GRAM-Qwen3-1.7B-RewardModel.Q3_K_L.gguf 1 GB Q3_K_L
GRAM-Qwen3-1.7B-RewardModel.Q3_K_M.gguf 940 MB Q3_K_M
GRAM-Qwen3-1.7B-RewardModel.Q3_K_S.gguf 867 MB Q3_K_S
GRAM-Qwen3-1.7B-RewardModel.Q4_K_M.gguf 1.11 GB Q4_K_M
GRAM-Qwen3-1.7B-RewardModel.Q4_K_S.gguf 1.06 GB Q4_K_S
GRAM-Qwen3-1.7B-RewardModel.Q5_K_M.gguf 1.26 GB Q5_K_M
GRAM-Qwen3-1.7B-RewardModel.Q5_K_S.gguf 1.23 GB Q5_K_S
GRAM-Qwen3-1.7B-RewardModel.Q6_K.gguf 1.42 GB Q6_K
GRAM-Qwen3-1.7B-RewardModel.Q8_0.gguf 1.83 GB Q8_0

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Downloads last month
186
GGUF
Model size
1.72B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for prithivMLmods/GRAM-Qwen3-1.7B-RewardModel-GGUF

Finetuned
Qwen/Qwen3-1.7B
Quantized
(1)
this model