Gonçalo Faria
graf
AI & ML interests
NLP
Recent Activity
updated
a dataset
4 days ago
graf/qwen_deepsr_train_no_tags
published
a dataset
4 days ago
graf/qwen_deepsr_train_no_tags
updated
a dataset
4 days ago
graf/qwen_deepsr_math_test_no_tags
Organizations
models
89
graf/qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5-overfit
2B
•
Updated
•
670
graf/qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5
2B
•
Updated
•
1.66k
graf/qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-6
2B
•
Updated
•
839
graf/qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-7
2B
•
Updated
•
1.24k
graf/qwen2.5-1.5b-instruct-sft-test-gt2-lr1e-5
2B
•
Updated
•
589
graf/qwen2.5-1.5b-instruct-sft-test-gt2-lr1e-6
2B
•
Updated
•
8
graf/qwen2.5-1.5b-instruct-sft-test-gt2-lr1e-7
2B
•
Updated
•
161
graf/qwen2.5-1.5b-instruct-sft-test-gt-lr1e-7
2B
•
Updated
•
263
graf/qwen2.5-1.5b-instruct-sft-test-gt-lr1e-6
2B
•
Updated
•
547
graf/qwen2.5-1.5b-instruct-sft-test-gt-lr1e-5
2B
•
Updated
•
4
datasets
47
graf/qwen_deepsr_train_no_tags
Viewer
•
Updated
•
24.3k
•
62
graf/qwen_deepsr_math_test_no_tags
Viewer
•
Updated
•
418
•
21
graf/qwen_deepsr_gsm8k_test_no_tags
Viewer
•
Updated
•
1.28k
•
11
graf/qwen_deepsr_fix_train_no_tags
Updated
•
11
graf/qwen_deepsr_fix_train
Viewer
•
Updated
•
24.3k
•
64
graf/qwen_deepsr_train
Viewer
•
Updated
•
24k
•
125
graf/qwen_deepsr_gsm8k_test
Viewer
•
Updated
•
1.27k
•
24
graf/qwen_deepsr_math_test
Viewer
•
Updated
•
415
•
55
graf/DeepScaleR-Preview-Dataset.gt.1.20000.ancestral.128.Qwen2.5-1.5B-Instruct.bon
Viewer
•
Updated
•
12.6k
•
24
graf/DeepScaleR-Preview-Dataset.gt.4.20000.ancestral.128.Qwen2.5-1.5B-Instruct.rwmv.0.5
Viewer
•
Updated
•
80k
•
8