YangWang92's picture

YangWang92

yangwang92

·

AI & ML interests

None yet

Recent Activity

liked a dataset 8 days ago

liked a dataset 10 days ago

stepfun-ai/Step-3.5-Flash-SFT

upvoted an article 17 days ago

Welcome Gemma 4: Frontier multimodal intelligence on device

View all activity

Organizations

None yet

upvoted an article 17 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

18 days ago

•

866

upvoted a paper about 1 month ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 132

upvoted a paper 3 months ago

Controlled LLM Training on Spectral Sphere

Paper • 2601.08393 • Published Jan 13 • 2

upvoted an article 3 months ago

Article

Introducing Falcon H1R 7B

Jan 5

•

58

upvoted a collection 3 months ago

Spectral-Sphere-Optimizer

Paper-related Model Checkpoints for Reproduction • 4 items • Updated Jan 5 • 3

upvoted 4 papers 4 months ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published Dec 19, 2025 • 52

Universal Reasoning Model

Paper • 2512.14693 • Published Dec 16, 2025 • 44

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 111

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published Oct 28, 2025 • 41

upvoted 2 papers 5 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 110

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19

upvoted a collection 5 months ago

Retrofitting Recurrence

20 items • Updated Mar 2 • 6

upvoted a paper 5 months ago

DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Paper • 2511.06307 • Published Nov 9, 2025 • 53

upvoted a collection 6 months ago

LLaDA 2.0

7 items • Updated 26 days ago • 41

upvoted 4 papers 6 months ago

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Paper • 2505.06708 • Published May 10, 2025 • 11

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 182

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 170

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published Jul 10, 2025 • 35

upvoted a collection 6 months ago

L1

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13, 2025 • 9

upvoted a paper 6 months ago

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Paper • 2510.01179 • Published Oct 1, 2025 • 28