Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
about 8 hours ago
Qwen/Qwen3-VL-235B-A22B-Thinking
liked
a model
about 8 hours ago
Qwen/Qwen3-VL-235B-A22B-Instruct
liked
a model
2 days ago
jhu-clsp/mmBERT-base