SII-Yibin Wang's picture

SII-Yibin Wang

CodeGoat24

·

https://codegoat24.github.io/

CodeGoat24

AI & ML interests

I'm part of Shanghai Innovation Institute, focusing on Multimodal RL and Generation.

Recent Activity

authored a paper 20 days ago

EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

upvoted a paper 20 days ago

EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

submitted a paper 20 days ago

EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

View all activity

Organizations

upvoted a paper 20 days ago

EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

Paper • 2512.04753 • Published 27 days ago • 7

upvoted a paper 26 days ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published 27 days ago • 45

upvoted a paper about 1 month ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published Nov 19 • 93

upvoted a paper about 2 months ago

UniREditBench: A Unified Reasoning-based Image Editing Benchmark

Paper • 2511.01295 • Published Nov 3 • 37

upvoted a collection about 2 months ago

UniREditBench

4 items • Updated Nov 3 • 1

upvoted 2 papers 2 months ago

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published Oct 20 • 19

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21 • 66

upvoted 5 papers 3 months ago

RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11 • 35

G^2RPO: Granular GRPO for Precise Reward in Flow Models

Paper • 2510.01982 • Published Oct 2 • 5

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 54

InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles

Paper • 2508.16072 • Published Aug 22 • 4

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26 • 32

upvoted 2 collections 4 months ago

UnifiedReward 2.0 Qwen2.5VL Models

10 items • Updated Nov 6 • 1

Pref-GRPO & UniGenBench

7 items • Updated Nov 28 • 1

upvoted a paper 4 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89

upvoted 2 papers 5 months ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6 • 52

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published Aug 1 • 62

upvoted a collection 7 months ago

UnifiedReward 1.0 Qwen2.5 Models GGUF

9 items • Updated Nov 6 • 2

upvoted a paper 7 months ago

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Paper • 2506.07160 • Published Jun 8 • 3

upvoted a paper 8 months ago

Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models

Paper • 2505.02686 • Published May 5 • 16