SII-ZhijianZhou

Dexter9516

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

published a dataset about 1 month ago

inf-Prover/Dapo5k

liked a model about 1 month ago

infly/INF-AZ-7B-0524

View all activity

Organizations

upvoted a paper 5 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Paper • 2512.22322 • Published 9 days ago • 36

upvoted 2 papers about 1 month ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published Nov 19, 2025 • 93

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published Nov 19, 2025 • 22

upvoted 2 papers 2 months ago

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 53

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 66

upvoted 2 papers 3 months ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15, 2025 • 45

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 29

upvoted a paper 4 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89

upvoted 2 papers 8 months ago

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Paper • 2505.13427 • Published May 19, 2025 • 26

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published May 18, 2025 • 24

upvoted a collection 8 months ago

UnifiedReward 1.0 Qwen2.5VL Models

Collection

6 items • Updated Nov 6, 2025 • 10

upvoted a paper 9 months ago

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

Paper • 2503.18931 • Published Mar 24, 2025 • 30

upvoted 2 papers 10 months ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 55

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 122

upvoted a paper about 1 year ago

LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment

Paper • 2412.04814 • Published Dec 6, 2024 • 46

SII-ZhijianZhou

AI & ML interests

Recent Activity

Organizations

Dexter9516's activity