Shan Chen's picture

Shan Chen

shanchen

·

https://shanchen.dev/

AI & ML interests

I train and eval pretty ok

Recent Activity

upvoted a paper 12 days ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

authored a paper 23 days ago

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use

authored a paper 23 days ago

Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models

View all activity

Organizations

upvoted a paper 12 days ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published 20 days ago • 105

authored 3 papers 23 days ago

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use

Paper • 2505.14963 • Published May 20 • 1

Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models

Paper • 2505.13774 • Published May 19 • 1

When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy

Paper • 2505.22888 • Published about 1 month ago • 6

upvoted 3 papers 24 days ago

Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models

Paper • 2505.13774 • Published May 19 • 1

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use

Paper • 2505.14963 • Published May 20 • 1

When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy

Paper • 2505.22888 • Published about 1 month ago • 6

updated 4 datasets 24 days ago

shanchen/combine_multilingual

Viewer • Updated 24 days ago • 2.1k • 54

shanchen/aime_2025_multilingual

Viewer • Updated 24 days ago • 330 • 395

shanchen/gpqa_diamond_mc_multilingual

Viewer • Updated 24 days ago • 2.18k • 393

shanchen/aime_2024_multilingual

Viewer • Updated 24 days ago • 330 • 381

liked a model 30 days ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated 30 days ago • 552k • • 809

updated a collection about 1 month ago

XReasoning - models

https://arxiv.org/abs/2505.22888 ds - means continue post-training on deepseek distilled qwen math 7b limo-{language}-{amount of data} • 19 items • Updated 24 days ago • 1