Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZiYi Yang's picture
4 12 8

ZiYi Yang

AALF
fbe3p2q's profile picture sevenown72's profile picture Jason233's profile picture
·
https://github.com/yangzy39
  • yangzy39

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago
Perception-Aware Policy Optimization for Multimodal Reasoning
upvoted a paper 25 days ago
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
authored a paper about 2 months ago
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
View all activity

Organizations

FuseAI's profile picture Sun Yat-Sen University's profile picture Tongyi-Zhiwen's profile picture

Articles 2

Article
22

FuseO1-Preview: System-II Reasoning Fusion of LLMs

Article
5

FuseChat-3.0: Preference Optimization for Implicit Model Fusion

View all Articles

Papers 4

arxiv:2505.17667
arxiv:2503.04222
arxiv:2412.03187
arxiv:2402.16107

models 7

AALF/FuseR1-QwQ-R1-TinyR1-32B

33B • Updated Mar 7 • 3 • 1

AALF/FuseR1-QwQ-R1-LightR1-32B

33B • Updated Mar 7 • 3

AALF/FuseR1-QwQ-R1-32B

33B • Updated Mar 7 • 4

AALF/FuseR1-QwQ-R1-LightR1-TinyR1-32B

33B • Updated Mar 7 • 5

AALF/gemma-2-27b-it-SimPO-37K

Text Generation • 27B • Updated Dec 18, 2024 • 1.12k • 18

AALF/gemma-2-27b-it-SimPO-37K-100steps

Text Generation • 27B • Updated Dec 18, 2024 • 1.12k • 12

AALF/llama-3-8b-Instruct-simpo-beta10-gamma3-lr1e-6

8B • Updated Aug 16, 2024 • 2

datasets 1

AALF/ultrafeedback_wrpo

Viewer • Updated Feb 28 • 59.9k • 29
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs