arxiv:2501.17116
Ziyue Yang
ziyueyang37
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Optimizing Large Language Model Training Using FP4 Quantization
authored
a paper
7 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models