4 2 9

sunlin

lincharliesun

https://openreview.net/profile?id=~Lin_Sun15

AI & ML interests

None yet

Recent Activity

commented on a paper 29 days ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

upvoted a paper 30 days ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

authored a paper about 1 month ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

View all activity

Organizations

commented a paper 29 days ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5 • 19 •

upvoted a paper 30 days ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5 • 19

authored a paper about 1 month ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5 • 19

commented a paper about 1 month ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5 • 19 •

updated a collection 3 months ago

TinyR1

Collection

2 items • Updated Apr 21 • 3

updated a model 3 months ago

qihoo360/TinyR1-32B-Preview

Text Generation • 33B • Updated Apr 16 • 4k • • 328

updated a dataset 3 months ago

qihoo360/TinyR1-32B-Preview-datasets

Preview • Updated Apr 16 • 72 • 2

published a dataset 3 months ago

qihoo360/TinyR1-32B-Preview-datasets

Preview • Updated Apr 16 • 72 • 2

authored 5 papers 4 months ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Paper • 2502.20790 • Published Feb 28

upvoted a paper 4 months ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6 • 15

updated a collection 4 months ago

TinyR1

Collection

2 items • Updated Apr 21 • 3

New activity in qihoo360/TinyR1-32B-Preview 4 months ago

Update README.md

#10 opened 4 months ago by

zhaoguangxiang

Repeated Thinking Tags in Output Generation

#2 opened 4 months ago by

xldistance

liked 2 models 4 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 787k • • 12.4k

qihoo360/TinyR1-32B-Preview

Text Generation • 33B • Updated Apr 16 • 4k • • 328

updated a collection 4 months ago

360Zhinao2

Collection

360Zhinao2 language model, include both base and chat model • 7 items • Updated Mar 5 • 1

sunlin

AI & ML interests

Recent Activity

Organizations

lincharliesun's activity

Update README.md

Repeated Thinking Tags in Output Generation