3 20 16

Shaohang Wei

SylvainWei

https://sylvain-wei.github.io

AI & ML interests

NLP, LLM

Recent Activity

upvoted a paper 7 days ago

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

upvoted a paper 19 days ago

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

upvoted a paper 22 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

View all activity

Organizations

upvoted a paper 7 days ago

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published 11 days ago • 83

upvoted a paper 19 days ago

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

Paper • 2510.07896 • Published Oct 9, 2025 • 8

upvoted 2 papers 22 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 261

upvoted a paper about 1 month ago

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Paper • 2602.07422 • Published Feb 7 • 22

authored a paper about 1 month ago

Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

Paper • 2602.01745 • Published Feb 2 • 7

upvoted 4 papers about 1 month ago

Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

Paper • 2602.01745 • Published Feb 2 • 7

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published Feb 9 • 42

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published Feb 2 • 32

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 218

upvoted a paper about 2 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 201

liked a model about 2 months ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 17 days ago • 2.89M • • 2.29k

upvoted a paper about 2 months ago

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Paper • 2601.12346 • Published Jan 18 • 49

liked a model about 2 months ago

mwhanna/qwen3-1.7b-transcoders-lowl0

Updated Aug 18, 2025 • 605 • 1

upvoted a paper 2 months ago

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

Paper • 2601.07526 • Published Jan 12 • 24

liked a dataset 4 months ago

DigitalLearningGmbH/MATH-lighteval

Viewer • Updated Jan 15, 2025 • 25k • 25k • 64

upvoted 2 papers 5 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 97

Scaling Language-Centric Omnimodal Representation Learning

Paper • 2510.11693 • Published Oct 13, 2025 • 106

authored a paper 5 months ago

Mitigating Overthinking through Reasoning Shaping

Paper • 2510.09535 • Published Oct 10, 2025 • 5

commented a paper 5 months ago

Mitigating Overthinking through Reasoning Shaping

Paper • 2510.09535 • Published Oct 10, 2025 • 5 •

Shaohang Wei

AI & ML interests

Recent Activity

Organizations

SylvainWei's activity