Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
12
8
ZiYi Yang
AALF
Follow
fbe3p2q's profile picture
sevenown72's profile picture
Jason233's profile picture
22 followers
·
9 following
https://github.com/yangzy39
yangzy39
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
14 days ago
Perception-Aware Policy Optimization for Multimodal Reasoning
upvoted
a
paper
25 days ago
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
authored
a paper
about 2 months ago
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
View all activity
Organizations
Articles
2
Article
22
FuseO1-Preview: System-II Reasoning Fusion of LLMs
Article
5
FuseChat-3.0: Preference Optimization for Implicit Model Fusion
View all Articles
Papers
4
arxiv:
2505.17667
arxiv:
2503.04222
arxiv:
2412.03187
arxiv:
2402.16107
models
7
Sort: Recently updated
AALF/FuseR1-QwQ-R1-TinyR1-32B
33B
•
Updated
Mar 7
•
3
•
1
AALF/FuseR1-QwQ-R1-LightR1-32B
33B
•
Updated
Mar 7
•
3
AALF/FuseR1-QwQ-R1-32B
33B
•
Updated
Mar 7
•
4
AALF/FuseR1-QwQ-R1-LightR1-TinyR1-32B
33B
•
Updated
Mar 7
•
5
AALF/gemma-2-27b-it-SimPO-37K
Text Generation
•
27B
•
Updated
Dec 18, 2024
•
1.12k
•
18
AALF/gemma-2-27b-it-SimPO-37K-100steps
Text Generation
•
27B
•
Updated
Dec 18, 2024
•
1.12k
•
12
AALF/llama-3-8b-Instruct-simpo-beta10-gamma3-lr1e-6
8B
•
Updated
Aug 16, 2024
•
2
datasets
1
AALF/ultrafeedback_wrpo
Viewer
•
Updated
Feb 28
•
59.9k
•
29