Shan Chen's picture

Shan Chen

shanchen

·

https://shanchen.dev/

AI & ML interests

I train and eval pretty ok

Recent Activity

upvoted a paper 12 days ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

authored a paper 23 days ago

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use

authored a paper 23 days ago

Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models

View all activity

Organizations

New activity in JoaquinVanschoren/croissant-checker about 2 months ago

Croissant checker hanging while local validation passes

#2 opened about 2 months ago by

New activity in AIM-Harvard/rabbits-leaderboard 8 months ago

When would it be open?

#1 opened 12 months ago by

commented a paper 8 months ago

WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation

Paper • 2410.12722 • Published Oct 16, 2024 • 5 •

commented a paper about 1 year ago

Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks

Paper • 2406.12066 • Published Jun 17, 2024 • 8 •

New activity in m720/SHADR over 1 year ago

Update README.md

#2 opened over 1 year ago by