Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tensorblock
/
luckeciano_Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-GGUF
like
0
Follow
TensorBlock
194
Transformers
GGUF
DigitalLearningGmbH/MATH-lighteval
Generated from Trainer
open-r1
trl
grpo
TensorBlock
GGUF
conversational
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
luckeciano_Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-GGUF
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
morriszms
Upload folder using huggingface_hub
d7771b5
verified
1 day ago
.gitattributes
2.89 kB
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q2_K.gguf
3.02 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q3_K_L.gguf
4.09 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q3_K_M.gguf
3.81 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q3_K_S.gguf
3.49 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q4_0.gguf
4.43 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q4_K_M.gguf
4.68 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q4_K_S.gguf
4.46 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q5_0.gguf
5.32 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q5_K_M.gguf
5.44 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q5_K_S.gguf
5.32 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q6_K.gguf
6.25 GB
LFS
Upload folder using huggingface_hub
1 day ago
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-Q8_0.gguf
8.1 GB
LFS
Upload folder using huggingface_hub
1 day ago
README.md
9.38 kB
Upload folder using huggingface_hub
1 day ago