-
Aligning Teacher with Student Preferences for Tailored Training Data Generation
Paper • 2406.19227 • Published • 26 -
Pre-training Distillation for Large Language Models: A Design Space Exploration
Paper • 2410.16215 • Published • 16 -
Baichuan Alignment Technical Report
Paper • 2410.14940 • Published • 52 -
MiniPLM: Knowledge Distillation for Pre-Training Language Models
Paper • 2410.17215 • Published • 16
By
ByRookie
AI & ML interests
None yet
Recent Activity
new activity
7 days ago
nvidia/AceReason-1.1-SFT:will you release code rl dataset ?
liked
a dataset
7 days ago
zwhe99/DeepMath-103K
liked
a dataset
17 days ago
open-thoughts/OpenThoughts3-1.2M