Haoran Wei's picture

Haoran Wei

HaoranWei

·

AI & ML interests

LLM，CV，OVOD

Recent Activity

liked a model 2 days ago

stepfun-ai/step3-fp8

upvoted a collection 2 days ago

liked a model 2 days ago

stepfun-ai/step3

View all activity

Organizations

None yet

authored a paper 3 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 270

authored 2 papers 6 months ago

Focus Anywhere for Fine-grained Multi-page Document Understanding

Paper • 2405.14295 • Published May 23, 2024 • 1

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Paper • 2412.20631 • Published Dec 30, 2024 • 15

authored a paper 8 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

authored a paper 11 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 84

authored 6 papers over 1 year ago

OneChart: Purify the Chart Structural Extraction via One Auxiliary Token

Paper • 2404.09987 • Published Apr 15, 2024 • 2

Small Language Model Meets with Reinforced Vision Vocabulary

Paper • 2401.12503 • Published Jan 23, 2024 • 33

ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning

Paper • 2307.09474 • Published Jul 18, 2023 • 1

DreamLLM: Synergistic Multimodal Comprehension and Creation

Paper • 2309.11499 • Published Sep 20, 2023 • 59

Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models

Paper • 2312.06109 • Published Dec 11, 2023 • 21

Merlin:Empowering Multimodal LLMs with Foresight Minds

Paper • 2312.00589 • Published Nov 30, 2023 • 27