Lei Wang's picture

Lei Wang

demolei

·

https://demoleiwang.github.io/HomePage/

AI & ML interests

LLMs

Recent Activity

upvoted a paper about 2 hours ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

upvoted a paper 3 days ago

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

upvoted a paper 3 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

View all activity

Organizations

demolei 's models 6

demolei/qwen2_5_vl_7b_grpo_chartqa_filtered_40

8B • Updated May 8, 2025

demolei/Qwen2.5-VL-7B-Instruct-chartqa_filtered_240

8B • Updated May 5, 2025

demolei/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Feb 23, 2025 • 2

demolei/Qwen-2.5-7B-Simple-RL

Text Generation • 8B • Updated Feb 23, 2025 • 1

demolei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Feb 23, 2025

demolei/sft_openassistant-guanaco

Updated Jun 28, 2024