Jingcheng Hu
reign12
AI & ML interests
Foundation models and alignment
Recent Activity
upvoted
a
paper
10 days ago
Group Sequence Policy Optimization
upvoted
a
paper
10 days ago
Step-3 is Large yet Affordable: Model-system Co-design for
Cost-effective Decoding