1 5 16

zhang

landy123007

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

shaohao011/BrainMVP-16k

liked a model 8 days ago

black-forest-labs/FLUX.1-Kontext-dev

commented on a paper about 1 month ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

View all activity

Organizations

None yet

liked a dataset 1 day ago

shaohao011/BrainMVP-16k

Updated Mar 31 • 1.29k • 4

liked a model 8 days ago

black-forest-labs/FLUX.1-Kontext-dev

Image-to-Image • Updated 8 days ago • 155k • • 1.32k

commented a paper about 1 month ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 108 •

upvoted an article about 1 month ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

and 8 others •

Jun 3

• 175

liked a Space about 1 month ago

150

MedGemma - Radiology Explainer Demo

🩺

Radiology Image & Report Explainer Demo. Built with MedGemma

liked a Space about 2 months ago

281

vggt

🏆

VGGT (CVPR 2025)

liked a dataset about 2 months ago

ibrahimhamamci/CT-RATE

Viewer • Updated 18 days ago • 151k • 83.1k • 157

liked a model 2 months ago

TencentBAC/Conan-embedding-v2

published a Space 3 months ago

My Argilla

✍

liked a dataset 4 months ago

nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

Updated May 15 • 475k • 133

liked a Space 4 months ago

168

Visualize Dataset (v2.0+ latest dataset format)

💻

Visualize LeRobot Datasets

liked a model 4 months ago

google/siglip2-base-patch16-224

Zero-Shot Image Classification • 0.4B • Updated Feb 21 • 120k • 53

liked a Space 4 months ago

2.75k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 7 months ago

Datou1111/shou_xin

Text-to-Image • Updated Mar 16 • 103 • • 877

liked a dataset 7 months ago

Spawning/PD12M

Viewer • Updated Jan 9 • 12.4M • 3.78k • 160

upvoted an article 10 months ago

Article

Scaling robotics datasets with video encoding

and 2 others •

Aug 27, 2024

• 40

upvoted 2 papers 11 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 92

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 62

liked a model 11 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated 8 days ago • 1.56M • • 10.8k

liked a dataset 11 months ago

tiange/Cap3D

Updated 1 day ago • 12.8k • 112