Qingyun Li's picture

Qingyun Li

Qingyun

·

https://scholar.google.com/citations?user=TvsTun4AAAAJ&hl=zh-CN

Li-Qingyun

AI & ML interests

Object Detection, Remote Sensing

Recent Activity

liked a dataset 9 days ago

OpenGVLab/MMBench-GUI

liked a model 14 days ago

rednote-hilab/dots.llm1.inst

upvoted a collection 17 days ago

View all activity

Organizations

upvoted a collection 17 days ago

OmniCorpus

A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text • 6 items • Updated Apr 20 • 2

upvoted a paper 17 days ago

VGR: Visual Grounded Reasoning

Paper • 2506.11991 • Published 20 days ago • 19

upvoted a paper about 1 month ago

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29 • 46

upvoted 2 papers 2 months ago

A Simple Aerial Detection Baseline of Multimodal Language Models

Paper • 2501.09720 • Published Jan 16 • 2

Scalable Vision Language Model Training via High Quality Data Curation

Paper • 2501.05952 • Published Jan 10 • 3

upvoted an article 3 months ago

Article

Preference Optimization for Vision Language Models

By

and 3 others •

Jul 10, 2024

• 79

upvoted 2 collections 3 months ago

InternVL3

34 items • Updated Apr 20 • 72

OmniCorpus 🐳

[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text https://github.com/OpenGVLab/OmniCorpus • 5 items • Updated May 14 • 1

upvoted a paper 3 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 69

upvoted a collection 8 months ago

InternVL Data

9 items • Updated Apr 20 • 8

upvoted a paper 11 months ago

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 31

upvoted a collection 11 months ago

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 60

upvoted a collection about 1 year ago

Florence

9 items • Updated May 1 • 168

upvoted a paper almost 2 years ago

InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language

Paper • 2305.05662 • Published May 9, 2023 • 4