Peng's picture

Peng

pandases

·

AI & ML interests

None yet

Recent Activity

liked a Space 23 days ago

nanotron/ultrascale-playbook

liked a Space 23 days ago

nanotron/predict_memory

upvoted an article 23 days ago

SmolLM3: smol, multilingual, long-context reasoner

View all activity

Organizations

upvoted an article 23 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

25 days ago

• 602

upvoted an article about 1 month ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

Jun 3

• 79

upvoted a collection about 1 month ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated 22 days ago • 157

upvoted 2 papers about 1 month ago

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

Paper • 2506.18896 • Published Jun 23 • 28

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 124

upvoted a collection 2 months ago

LiveBench

Datasets for LiveBench • 8 items • Updated Mar 31 • 9

upvoted a paper 3 months ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published Mar 31 • 24

upvoted an article 3 months ago

Article

Qwen3 X ModelScope Toolkit: Faster Training + Comprehensive Evaluation

By

•

May 8

• 1

upvoted a paper 3 months ago

LongGenBench: Long-context Generation Benchmark

Paper • 2410.04199 • Published Oct 5, 2024 • 22

upvoted a collection 3 months ago

Qwen3

80 items • Updated 3 days ago • 978

upvoted an article 3 months ago

Article

WTF is Fine-Tuning? (intro4devs) | [2025]

By

•

Feb 16

• 7

upvoted a collection 7 months ago

Phi-4 (All Versions)

Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes • 20 items • Updated 3 days ago • 73

upvoted an article 11 months ago

Article

We are hiring interns!

By

and 1 other •

Nov 29, 2022

• 13

upvoted an article 12 months ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 401