2 584

Lei Wang

demolei

https://demoleiwang.github.io/HomePage/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 3 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

upvoted a paper 3 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

upvoted a paper 3 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

View all activity

Organizations

Collections 3

View 3 collections

Papers 14

models 6

datasets 0

None public yet

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Papers 14

models 6

demolei/qwen2_5_vl_7b_grpo_chartqa_filtered_40

demolei/Qwen2.5-VL-7B-Instruct-chartqa_filtered_240

demolei/Qwen2.5-1.5B-Open-R1-Distill

demolei/Qwen-2.5-7B-Simple-RL

demolei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

demolei/sft_openassistant-guanaco

datasets 0

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Papers 14

models 6 Sort: Recently updated

datasets 0

models 6