33 70 106

Li Dong

unilm

AI & ML interests

Language Model Pre-Training

Recent Activity

upvoted a collection about 1 month ago

VibeVoice Models

new activity about 1 month ago

microsoft/VibeVoice-Realtime-0.5B:Fixed tag typo (long-from -> long-form)

new activity about 1 month ago

microsoft/VibeVoice-Realtime-0.5B:add gradio app for this model

View all activity

Organizations

upvoted a collection about 1 month ago

VibeVoice Models

Collection

3 items • Updated Dec 6, 2025 • 5

New activity in microsoft/VibeVoice-Realtime-0.5B about 1 month ago

Fixed tag typo (long-from -> long-form)

#11 opened about 1 month ago by

mcfadyeni

add gradio app for this model

👍 3

#4 opened about 1 month ago by

akhaliq

liked a model about 1 month ago

microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 28 days ago • 305k • 1.05k

liked a dataset about 2 months ago

ytz20/LMSYS-Chat-GPT-5-Chat-Response

Viewer • Updated Nov 17, 2025 • 192k • 626 • 91

liked 5 models about 2 months ago

upvoted a collection about 2 months ago

GAD-Models

Collection

Model checkpoints of Black-Box On-Policy Distillation of Large Language Models • 5 items • Updated Nov 17, 2025 • 6

liked a model about 2 months ago

sentence-transformers/all-MiniLM-L6-v2

authored a paper about 2 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 49

upvoted a paper about 2 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 49

authored 6 papers 2 months ago

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

Paper • 2509.22613 • Published Sep 26, 2025 • 9

DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published Oct 13, 2025 • 27

Information-Preserving Reformulation of Reasoning Traces for Antidistillation

Paper • 2510.11545 • Published Oct 13, 2025 • 1

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 57

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

Paper • 2510.24514 • Published Oct 28, 2025 • 21

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30, 2025 • 27