39 46 96

Di Zhang

di-zhang-fdu

https://scholar.google.com/citations?user=vxAO250AAAAJ&hl=en

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

liked a dataset about 3 hours ago

a-m-team/AM-DeepSeek-R1-0528-Distilled

new activity about 16 hours ago

XiaomiMiMo/MiMo-VL-7B-RL:license？Is this model available for commercial usage?

liked a model 2 days ago

nvidia/AceMath-7B-Instruct

View all activity

Organizations

di-zhang-fdu's activity

upvoted 2 papers 5 days ago

Control-R: Towards controllable test-time scaling

Paper • 2506.00189 • Published 12 days ago • 3

AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

Paper • 2506.05328 • Published 6 days ago • 20

upvoted a paper 8 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 12 days ago • 119

upvoted a paper 16 days ago

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Paper • 2505.17873 • Published 19 days ago • 30

upvoted a paper 23 days ago

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published 26 days ago • 55

upvoted a paper 29 days ago

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 37

upvoted 2 papers 30 days ago

A Preliminary Study for GPT-4o on Image Restoration

Paper • 2505.05621 • Published May 8 • 10

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 42

upvoted a paper about 1 month ago

AlignRAG: An Adaptable Framework for Resolving Misalignments in Retrieval-Aware Reasoning of RAG

Paper • 2504.14858 • Published Apr 21 • 3

upvoted 2 papers about 2 months ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21 • 65

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Paper • 2504.10368 • Published Apr 14 • 21

upvoted a paper 2 months ago

VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning

Paper • 2504.02949 • Published Apr 3 • 21

upvoted a paper 3 months ago

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10, 2024 • 43

upvoted 2 papers 4 months ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 62

upvoted a paper 5 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 100

upvoted 3 papers 6 months ago

Language Models as Inductive Reasoners

Paper • 2212.10923 • Published Dec 21, 2022 • 2

Logical Reasoning over Natural Language as Knowledge Representation: A Survey

Paper • 2303.12023 • Published Mar 21, 2023 • 2

MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses

Paper • 2410.07076 • Published Oct 9, 2024 • 2

upvoted a paper 7 months ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published Nov 27, 2024 • 38