2 22 7

hcwei

AI & ML interests

Diffusion Model, Image Generation, ML, DL, CV

Recent Activity

upvoted a paper 1 day ago

Multimodal Referring Segmentation: A Survey

upvoted a paper 6 days ago

Agentic Reinforced Policy Optimization

commented on a paper about 2 months ago

Training-Free Reasoning and Reflection in MLLMs

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Multimodal Referring Segmentation: A Survey

Paper • 2508.00265 • Published 4 days ago • 6

upvoted a paper 6 days ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published 10 days ago • 125

commented a paper about 2 months ago

Training-Free Reasoning and Reflection in MLLMs

Paper • 2505.16151 • Published May 22 • 9 •

upvoted a paper 2 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 76

upvoted a collection 2 months ago

Multimodal Reasoning

Collection

108 items • Updated 1 day ago • 24

upvoted 2 papers 2 months ago

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22 • 57

Training-Free Reasoning and Reflection in MLLMs

Paper • 2505.16151 • Published May 22 • 9

commented a paper 2 months ago

Training-Free Reasoning and Reflection in MLLMs

Paper • 2505.16151 • Published May 22 • 9 •

updated a model 3 months ago

hcwei/FRANK-ZERO-38B

38B • Updated Apr 25 • 3 • 3

upvoted 4 papers 4 months ago

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Paper • 2504.13122 • Published Apr 17 • 21

Antidistillation Sampling

Paper • 2504.13146 • Published Apr 17 • 61

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17 • 39

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 52

liked a model 4 months ago

OpenGVLab/InternVL3-14B

Image-Text-to-Text • 15B • Updated May 29 • 652k • 68

upvoted a paper 4 months ago

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Paper • 2504.06514 • Published Apr 9 • 39

liked a model 4 months ago

hcwei/FRANK-ZERO-38B

38B • Updated Apr 25 • 3 • 3

published a model 5 months ago

hcwei/FRANK-ZERO-38B

38B • Updated Apr 25 • 3 • 3

liked a model 5 months ago

hy1111/CLIP-RS

Updated Feb 23 • 4

upvoted 2 papers 7 months ago

Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding

Paper • 2501.07888 • Published Jan 14 • 16

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 298

hcwei

AI & ML interests

Recent Activity

Organizations

hcwei's activity