SII-ZhijianZhou

Dexter9516

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

upvoted a paper 11 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

published a dataset about 1 month ago

inf-Prover/Dapo5k

View all activity

Organizations

upvoted a paper 5 days ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published 10 days ago • 106

upvoted a paper 11 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Paper • 2512.22322 • Published 15 days ago • 38

published a dataset about 1 month ago

inf-Prover/Dapo5k

Updated Aug 3, 2025 • 3

liked a model about 1 month ago

infly/INF-AZ-7B-0524

Image-to-Text • 8B • Updated May 25, 2025 • 27 • 3

upvoted 2 papers about 2 months ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published Nov 19, 2025 • 93

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published Nov 19, 2025 • 23

upvoted a paper 2 months ago

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 53

upvoted 3 papers 3 months ago

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 66

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15, 2025 • 45

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 29

upvoted a paper 4 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89

published a model 5 months ago

Dexter9516/test

Updated Aug 7, 2025

liked a model 5 months ago

Qwen/Qwen3-8B

Text Generation • 8B • Updated Jul 26, 2025 • 3.7M • • 851

updated a dataset 5 months ago

inf-Prover/Dapo5k

Updated Aug 3, 2025 • 3

liked a dataset 5 months ago

Saigyouji-Yuyuko1000/dapo17k

Viewer • Updated Jun 23, 2025 • 17.9k • 120 • 2

liked a model 6 months ago

Goedel-LM/Goedel-Prover-V2-32B

Text Generation • 33B • Updated Aug 27, 2025 • 5.96k • 60

upvoted 2 papers 8 months ago

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Paper • 2505.13427 • Published May 19, 2025 • 26

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published May 18, 2025 • 24

upvoted a collection 8 months ago

UnifiedReward 1.0 Qwen2.5VL Models

Collection

6 items • Updated Nov 6, 2025 • 10

upvoted a paper 10 months ago

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

Paper • 2503.18931 • Published Mar 24, 2025 • 30

SII-ZhijianZhou

AI & ML interests

Recent Activity

Organizations

Dexter9516's activity