Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
540
2
Lei Wang
demolei
Follow
hendrydong's profile picture
siyengfeng's profile picture
Oromgaada's profile picture
9 followers
·
5 following
https://demoleiwang.github.io/HomePage/
demo_lei_wang
lei-wang-0805831a2
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
about 2 hours ago
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation
upvoted
a
paper
3 days ago
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling
upvoted
a
paper
3 days ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
View all activity
Organizations
demolei
's models
6
Sort: Recently updated
demolei/qwen2_5_vl_7b_grpo_chartqa_filtered_40
8B
•
Updated
May 8, 2025
demolei/Qwen2.5-VL-7B-Instruct-chartqa_filtered_240
8B
•
Updated
May 5, 2025
demolei/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Feb 23, 2025
•
2
demolei/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
Feb 23, 2025
•
1
demolei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Feb 23, 2025
demolei/sft_openassistant-guanaco
Updated
Jun 28, 2024