Shuaijie She's picture

Shuaijie She

kevinpro

·

https://ricardokevins.github.io/

AI & ML interests

Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization

Recent Activity

upvoted a paper about 1 month ago

How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data

liked a dataset 3 months ago

BAAI/Chinese-LiPS

liked a dataset 3 months ago

PleIAs/YouTube-Commons

View all activity

Organizations

Collections 2

Papers 6

arxiv:2508.14460

arxiv:2507.13618

arxiv:2505.21505

arxiv:2503.21295

spaces 6

DEMO

Verify math steps with explanations

Uni API

Geminitest

Enter token to log in

Pro2a

Testgeminispace

验证令牌并登录

Open Multilingual Reasoning Leaderboard

Display and search a leaderboard of math models

models 15

kevinpro/R-PRM-7B-DPO

Text Generation • 8B • Updated Mar 28, 2025 • 17 • 3

kevinpro/Hydra-LLaMA3-8B-0531-preview-Q4_K_M-GGUF

Text Generation • 8B • Updated May 31, 2024 • 11

kevinpro/MistralMathOctopus-7B

Text Generation • 7B • Updated Mar 26, 2024 • 919

kevinpro/MetaMathOctopus-MAPO-DPO-13B

Text Generation • 13B • Updated Mar 26, 2024 • 7

kevinpro/MathOctopus-MAPO-DPO-7B

Text Generation • 7B • Updated Mar 26, 2024 • 103

kevinpro/MetaMathOctopus-13B

Text Generation • 13B • Updated Mar 26, 2024 • 8

kevinpro/MetaMathOctopus-MAPO-DPO-7B

Text Generation • 7B • Updated Mar 26, 2024 • 7

kevinpro/MetaMathOctopus-7B

Text Generation • 7B • Updated Mar 26, 2024 • 11

kevinpro/MathOctopus-MAPO-DPO-13B

Text Generation • 13B • Updated Mar 26, 2024 • 5

kevinpro/MistralMathOctopus-MAPO-DPO-7B

Text Generation • 7B • Updated Mar 26, 2024

datasets 6

kevinpro/MM

Updated Oct 30, 2025 • 4

kevinpro/R-PRM

Viewer • Updated Mar 28, 2025 • 594k • 1.57k • 1

kevinpro/WildChat-1M-GPT4-1Turn

Viewer • Updated May 6, 2024 • 121k • 10

kevinpro/WildChat-1M-GPT4-strict

Updated May 6, 2024 • 5

kevinpro/WildChat-1M-GPT4

Viewer • Updated May 6, 2024 • 221k • 28

kevinpro/MNumGLUESub

Updated Mar 11, 2024 • 7