jiaxing's picture

jiaxing

huangjiaxing

AI & ML interests

None yet

Recent Activity

authored a paper 11 days ago

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

authored a paper 3 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

View all activity

Organizations

None yet

huangjiaxing's activity

authored a paper 11 days ago

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Paper • 2503.12937 • Published 12 days ago • 27

authored a paper 3 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 39