Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
19
14
7
Xirui Li
PRO
AIcell
Follow
Dolphin42's profile picture
21world's profile picture
Gargaz's profile picture
4 followers
·
13 following
https://xirui-li.github.io/
xiruili7_li
xirui-li
AI & ML interests
Foundation LLM and VLM
Recent Activity
upvoted
a
paper
7 days ago
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
updated
a model
8 days ago
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority
published
a model
9 days ago
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority
View all activity
Organizations
AIcell
's models
26
Sort: Recently updated
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority
2B
•
Updated
8 days ago
•
25
AIcell/Qwen-1.5B-Instruct-GRPO-Majority
2B
•
Updated
10 days ago
•
8
AIcell/Qwen-1.5B-Instruct-GRPO-Random
2B
•
Updated
11 days ago
•
11
AIcell/Qwen-1.5B-Instruct-GRPO
2B
•
Updated
11 days ago
•
22
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-non-reasoning
2B
•
Updated
17 days ago
•
28
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-opposite
2B
•
Updated
18 days ago
•
9
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-random
2B
•
Updated
20 days ago
•
12
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
25 days ago
•
34
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k
Text Generation
•
2B
•
Updated
29 days ago
•
32
AIcell/Qwen2.5-0.5B-Instruct-GRPO-gsm8k
Text Generation
•
0.5B
•
Updated
29 days ago
•
33
AIcell/Qwen2.5-3B-Instruct-GRPO-gsm8k
Updated
Oct 10
AIcell/Qwen2.5-1.5B-Instruct-GRPO-DAPO17k-thinking
2B
•
Updated
Oct 6
•
6
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math220k-thinking
Text Generation
•
2B
•
Updated
Oct 5
•
3
AIcell/Qwen2.5-1.5B-Math-Instruct-GRPO-gsm8k
Text Generation
•
2B
•
Updated
Sep 29
•
3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-random-reward
Text Generation
•
2B
•
Updated
Sep 26
•
3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-no-thinking
2B
•
Updated
Sep 26
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-monitor
Text Generation
•
2B
•
Updated
Sep 12
•
2
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-plain
Text Generation
•
2B
•
Updated
Sep 12
•
10
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-GPQA-Diamond-thinking
Updated
Aug 21
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-MATH-500-thinking
Updated
Aug 21
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-thinking
Updated
Aug 20
AIcell/Qwen2.5-1.5B-Base-GRPO-Math12k
Updated
Jul 3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-no-thinkng
Updated
Jul 3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k
Updated
Jul 1
AIcell/Qwen2.5-1.5B-Instruct-GRPO
Updated
Jul 1
AIcell/Qwen2.5-Math-1.5B-GRPO
Updated
Jun 1