Conghui He's picture

5 7 6

Conghui He

conghui

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

opendatalab/MinerU2.5-2509-1.2B

upvoted a paper 4 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

authored a paper 6 days ago

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning

View all activity

Organizations

None yet

authored a paper 6 days ago

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning

Paper • 2509.21070 • Published 7 days ago • 9

authored a paper 9 days ago

From Uniform to Heterogeneous: Tailoring Policy Optimization to Every Token's Nature

Paper • 2509.16591 • Published 12 days ago • 2

authored a paper 4 months ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published May 25 • 145

authored a paper 5 months ago

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges

Paper • 2504.19093 • Published Apr 27 • 17

authored 5 papers 6 months ago

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Paper • 2504.09925 • Published Apr 14 • 38

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 292

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published Apr 3 • 57

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Paper • 2503.21758 • Published Mar 27 • 22

LEMMA: Learning from Errors for MatheMatical Advancement in LLMs

Paper • 2503.17439 • Published Mar 21 • 15

authored 3 papers 7 months ago

MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion

Paper • 2503.16212 • Published Mar 20 • 25

MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer

Paper • 2503.14891 • Published Mar 19 • 22

LEGION: Learning to Ground and Explain for Synthetic Image Detection

Paper • 2503.15264 • Published Mar 19 • 21

authored a paper 9 months ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9 • 43

authored 6 papers 10 months ago

GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

Paper • 2412.11863 • Published Dec 16, 2024 • 4

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 98

Chimera: Improving Generalist Model with Domain-Specific Experts

Paper • 2412.05983 • Published Dec 8, 2024 • 9

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 26

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published Dec 3, 2024 • 24

authored a paper 11 months ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23, 2024 • 36