tongxiao

tongxiao2002

https://tongxiao2002.github.io

tongxiao2002

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Perception-Aware Policy Optimization for Multimodal Reasoning

upvoted a paper 2 days ago

Skywork-R1V3 Technical Report

updated a dataset 2 days ago

tongxiao2002/Perception-R1-Dataset

View all activity

Organizations

upvoted 3 papers 2 days ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published 4 days ago • 41

Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published 5 days ago • 57

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published 3 days ago • 40

upvoted a paper 11 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 215

upvoted a paper about 1 month ago

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Paper • 2505.17018 • Published May 22 • 15

upvoted a paper 2 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 107

upvoted 2 papers 3 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 160

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 53

upvoted a collection 3 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Apr 28 • 501

upvoted a collection 4 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Apr 28 • 626

upvoted an article 5 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 874

upvoted a collection 5 months ago

DeepSeek-R1

Collection

10 items • Updated May 29 • 751

tongxiao

AI & ML interests

Recent Activity

Organizations

tongxiao2002's activity

Open-R1: a fully open reproduction of DeepSeek-R1