Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Organizations
models
14
peakji/qwen2.5-coder-7b-awq
2B
•
Updated
•
6
peakji/steiner-32b-preview-gguf
33B
•
Updated
•
157
•
23
peakji/steiner-32b-preview-awq
6B
•
Updated
•
2
•
4
peakji/steiner-32b-preview
33B
•
Updated
•
2
•
92
peakji/peak-reasoning-7b-gguf
8B
•
Updated
•
80
•
4
peakji/peak-reasoning-7b-awq
2B
•
Updated
•
4
peakji/peak-reasoning-7b
8B
•
Updated
•
5
peakji/qwen2.5-72b-instruct-trim
73B
•
Updated
•
1
peakji/qwen2.5-32b-instruct-trim
33B
•
Updated
•
6
peakji/qwen2.5-14b-instruct-trim
15B
•
Updated
•
4
datasets
8
peakji/peak-text-with-context-2m
Viewer
•
Updated
•
2.07M
•
138
peakji/peak-anchor-content-plain-20k
Viewer
•
Updated
•
20.1k
•
66
peakji/peak-search-content-plain-40k
Viewer
•
Updated
•
40.4k
•
13
peakji/peak-anchor-content-35k
Viewer
•
Updated
•
35.6k
•
50
peakji/peak-search-content-70k
Viewer
•
Updated
•
70.2k
•
37
peakji/peak-anchor-40k
Viewer
•
Updated
•
42.7k
•
38
peakji/peak-search-300k
Viewer
•
Updated
•
312k
•
19
peakji/peak-intent-50
Viewer
•
Updated
•
265k
•
25