3 4

Zonghao Guo

guozonghao96

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

upvoted a paper 2 months ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

upvoted a paper 3 months ago

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

View all activity

Organizations

None yet

guozonghao96's activity

authored a paper 5 days ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published 8 days ago • 28

upvoted a paper 2 months ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10 • 29

upvoted a paper 3 months ago

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

Paper • 2412.13871 • Published Dec 18, 2024 • 18

commented a paper 3 months ago

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

Paper • 2412.13871 • Published Dec 18, 2024 • 18 •

upvoted a paper 5 months ago

LLMtimesMapReduce: Simplified Long-Sequence Processing using Large Language Models

Paper • 2410.09342 • Published Oct 12, 2024 • 39

upvoted a paper 8 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 82

updated a dataset 8 months ago

guozonghao96/ocr_vqa_image

Updated Aug 4, 2024 • 4

updated a model 8 months ago

guozonghao96/llava-uhd-144-13b

Text Generation • Updated Jul 30, 2024 • 46 • 1

updated a dataset 9 months ago

guozonghao96/objects365

Updated Jul 9, 2024 • 91

New activity in guozonghao96/objects365 9 months ago

Upload 2 files

#1 opened 9 months ago by

guozonghao96

authored a paper about 1 year ago

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Paper • 2403.11703 • Published Mar 18, 2024 • 17