Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yushun zhang's picture
2 4

yushun zhang

yushun0410
AROOJ12's profile picture AH211's profile picture sted97's profile picture
·
https://zyushun.github.io/
  • zyushun

AI & ML interests

LLMs

Organizations

None yet

yushun0410's activity

upvoted a paper 5 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368
upvoted 2 collections 6 months ago

Qwen2.5

Collection
Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 26 days ago • 613

Qwen2.5-Math

Collection
Math-specific model series based on Qwen2.5 • 11 items • Updated 26 days ago • 81
upvoted a paper 11 months ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 69
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs