YangWang92's picture

YangWang92

yangwang92

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

upvoted a collection about 1 hour ago

Skywork-Reward-V2

liked a dataset about 20 hours ago

HelpingAI/Dhanishtha-2.0-SUPERTHINKER

View all activity

Organizations

authored 2 papers 16 days ago

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Paper • 2506.08989 • Published 23 days ago • 14

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published 17 days ago • 37

authored a paper 9 months ago

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Paper • 2409.17066 • Published Sep 25, 2024 • 29