1 16 6

Shudong Liu

Sudanl

http://sudanl.github.io

AI & ML interests

NLP, LLM

Recent Activity

liked a model 19 days ago

opencompass/CompassVerifier-7B

liked a model 19 days ago

opencompass/CompassVerifier-32B

updated a dataset 22 days ago

opencompass/VerifierBench

View all activity

Organizations

liked 2 models 19 days ago

opencompass/CompassVerifier-7B

8B • Updated 22 days ago • 14 • 4

opencompass/CompassVerifier-32B

33B • Updated 22 days ago • 8 • 4

updated a dataset 22 days ago

opencompass/VerifierBench

Viewer • Updated 22 days ago • 2.82k • 102

upvoted a paper 23 days ago

Rethinking Verification for LLM Code Generation: From Generation to Testing

Paper • 2507.06920 • Published 24 days ago • 28

upvoted a paper 24 days ago

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published 25 days ago • 20

upvoted a paper about 2 months ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 110

authored 3 papers 2 months ago

TempoSum: Evaluating the Temporal Generalization of Abstractive Summarization

Paper • 2305.01951 • Published May 3, 2023 • 1

CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries

Paper • 2501.01282 • Published Jan 2

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Paper • 2505.19815 • Published May 26 • 37

upvoted 2 papers 2 months ago

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23 • 42

Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective

Paper • 2505.19815 • Published May 26 • 37

New activity in nvidia/Nemotron-CrossThink 2 months ago

It seems that the train_qa subset only contains multiple-choice questions

#4 opened 2 months ago by

Sudanl

upvoted a paper 2 months ago

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published May 21 • 34

upvoted a paper 4 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 280

authored a paper 4 months ago

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Paper • 2503.24377 • Published Mar 31 • 18

upvoted a paper 4 months ago

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Paper • 2503.24377 • Published Mar 31 • 18

upvoted 2 papers 5 months ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 49

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 75

liked a dataset 5 months ago

HuggingFaceH4/MATH-500

Viewer • Updated Nov 15, 2024 • 500 • 69.8k • 168

updated a model 5 months ago

Sudanl/Qwen-2.5-7B-Simple-RL

Text Generation • 8B • Updated Feb 24 • 4

Shudong Liu

AI & ML interests

Recent Activity

Organizations

Sudanl's activity

It seems that the train_qa subset only contains multiple-choice questions