Xirui Li's picture

Xirui Li PRO

AIcell

·

https://xirui-li.github.io/

AI & ML interests

Foundation LLM and VLM

Recent Activity

upvoted a paper 7 days ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

updated a model 8 days ago

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority

published a model 9 days ago

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority

View all activity

Organizations

AIcell 's models 26

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority

2B • Updated 8 days ago • 25

AIcell/Qwen-1.5B-Instruct-GRPO-Majority

2B • Updated 10 days ago • 8

AIcell/Qwen-1.5B-Instruct-GRPO-Random

2B • Updated 11 days ago • 11

AIcell/Qwen-1.5B-Instruct-GRPO

2B • Updated 11 days ago • 22

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-non-reasoning

2B • Updated 17 days ago • 28

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-opposite

2B • Updated 18 days ago • 9

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-random

2B • Updated 20 days ago • 12

AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

2B • Updated 25 days ago • 34

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k

Text Generation • 2B • Updated 29 days ago • 32

AIcell/Qwen2.5-0.5B-Instruct-GRPO-gsm8k

Text Generation • 0.5B • Updated 29 days ago • 33

AIcell/Qwen2.5-3B-Instruct-GRPO-gsm8k

AIcell/Qwen2.5-1.5B-Instruct-GRPO-DAPO17k-thinking

2B • Updated Oct 6 • 6

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math220k-thinking

Text Generation • 2B • Updated Oct 5 • 3

AIcell/Qwen2.5-1.5B-Math-Instruct-GRPO-gsm8k

Text Generation • 2B • Updated Sep 29 • 3

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-random-reward

Text Generation • 2B • Updated Sep 26 • 3

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-no-thinking

2B • Updated Sep 26

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-monitor

Text Generation • 2B • Updated Sep 12 • 2

AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-plain

Text Generation • 2B • Updated Sep 12 • 10

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-GPQA-Diamond-thinking

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-MATH-500-thinking

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-thinking

AIcell/Qwen2.5-1.5B-Base-GRPO-Math12k

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-no-thinkng

AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k

AIcell/Qwen2.5-1.5B-Instruct-GRPO

AIcell/Qwen2.5-Math-1.5B-GRPO