Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Xizhou Zhu
Einsiedler
Follow
0 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models
authored
a paper
3 months ago
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
authored
a paper
3 months ago
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning
View all activity
Organizations
Papers
11
arxiv:
2504.15279
arxiv:
2503.19757
arxiv:
2503.10291
arxiv:
2501.07783
Expand 11 papers
models
0
None public yet
datasets
0
None public yet