Zhe Chen's picture

On Vacation 🏝️

Zhe Chen

czczup

·

https://scholar.google.com/citations?hl=en&user=j1rq_lYAAAAJ

czczup

AI & ML interests

multimodal large language model, vision foundation model

Recent Activity

liked a model 11 days ago

tencent/Hy3-preview

liked a dataset 14 days ago

Kassadin88/GLM-5.1-1000000x

liked a dataset 14 days ago

Jackrong/GLM-5.1-Reasoning-1M-Cleaned

View all activity

Organizations

upvoted a collection about 2 months ago

MiroThinker-v1.5

MiroMind’s Open Source Research Agent for Prediction • 2 items • Updated Mar 2 • 25

upvoted a collection 2 months ago

MiMo-V2-Flash

MiMo-V2-Flash Series • 2 items • Updated Dec 17, 2025 • 28

upvoted a paper 6 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 195

upvoted 2 papers 11 months ago

AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

Paper • 2506.05328 • Published Jun 5, 2025 • 21

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29, 2025 • 45

upvoted 2 papers about 1 year ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21, 2025 • 68

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 308

upvoted 2 collections about 1 year ago

InternVL3

33 items • Updated Mar 2 • 84

VisualPRM

7 items • Updated Mar 2 • 4

upvoted 3 papers about 1 year ago

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13, 2025 • 36

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 84

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

Paper • 2503.01328 • Published Mar 3, 2025 • 16

upvoted a collection about 1 year ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67

upvoted a paper over 1 year ago

FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation

Paper • 2111.02394 • Published Nov 3, 2021 • 2

upvoted 6 collections over 1 year ago

VideoChat

Chat-Centric Video Understanding • 8 items • Updated Sep 28, 2025 • 3

V2PE

Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding • 3 items • Updated Sep 28, 2025 • 3

InternVL Adaptation

Adaptation Models for Specific Domains • 12 items • Updated Sep 28, 2025 • 2

InternVideo2

InternVideo2 • 21 items • Updated Sep 28, 2025 • 26

InternVL1.5

A Pioneering Open-Source Alternative to GPT-4V • 7 items • Updated Mar 2 • 10

Mono-InternVL

A Pioneering Monolithic MLLM • 8 items • Updated Sep 28, 2025 • 7