Unchun Yang's picture

Unchun Yang

ucyang

·

https://ucyang.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

Energy-Based Transformers are Scalable Learners and Thinkers

upvoted an article 1 day ago

SmolLM3: smol, multilingual, long-context reasoner

upvoted a paper 2 days ago

4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture

View all activity

Organizations

upvoted a paper about 21 hours ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published 7 days ago • 43

upvoted an article 1 day ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

2 days ago

• 413

upvoted a paper 2 days ago

4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture

Paper • 2507.05163 • Published 3 days ago • 37

upvoted an article 2 days ago

Article

Merge Large Language Models with mergekit

By

•

Jan 9, 2024

• 127

upvoted 2 articles 4 days ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

Jun 3

• 67

Article

🐯 Liger GRPO meets TRL

By

and 5 others •

May 25

• 45

upvoted 2 articles 5 days ago

Article

Gemma 3n fully available in the open-source ecosystem!

By

and 7 others •

14 days ago

• 105

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

May 12

• 474

upvoted a collection 5 days ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 7 days ago • 147

upvoted a paper 7 days ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 37

upvoted an article 8 days ago

Article

Deploying TensorFlow Vision Models in Hugging Face with TF Serving

By

•

Jul 25, 2022

• 2

upvoted 2 collections 9 days ago

FLUX.1

A collection of our FLUX.1 models and LoRAs. • 9 items • Updated 14 days ago • 145

FLUX.1 ONNX

ONNX exports of our FLUX.1 models. • 5 items • Updated 14 days ago • 29

upvoted 2 papers 11 days ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published 17 days ago • 56

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 14 days ago • 59

upvoted a paper 12 days ago

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Paper • 2506.17201 • Published 20 days ago • 52

upvoted an article 13 days ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By

and 6 others •

28 days ago

• 109

upvoted a paper 13 days ago

Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever

Paper • 2408.16672 • Published Aug 29, 2024 • 9

upvoted 2 collections 13 days ago

late interaction retrievers

This collection list our ColBERT like late interaction retriever models • 4 items • Updated Sep 17, 2024 • 2

Gemma 3n

4 items • Updated about 15 hours ago • 163