Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
7
xincheng yang
Vertax
Follow
0 followers
ยท
10 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
1 day ago
Vertax/xense_bi_arx5_tie_shoelaces_tactile
published
a dataset
1 day ago
Vertax/xense_bi_arx5_tie_shoelaces_tactile
reacted
to
andito
's
post
with โค๏ธ
1 day ago
Finally, our new paper is out! "๐๐ถ๐ป๐ฒ๐ฉ๐ถ๐๐ถ๐ผ๐ป: ๐ข๐ฝ๐ฒ๐ป ๐๐ฎ๐๐ฎ ๐๐ ๐๐น๐น ๐ฌ๐ผ๐ ๐ก๐ฒ๐ฒ๐ฑ"! ๐ฅณ https://huggingface.co/papers/2510.17269 If you've ever trained a VLM, you know this problem: nobody shares their data mixtures. It's a black box, making replicating SOTA work impossible. We wanted to change that. FineVision unifies 200 sources into 24 million samples. With 17.3 million images and 9.5 billion answer tokens, it's the largest open resource of its kind. In the paper, we share how we built it: ๐ finding and cleaning data at scale ๐งน removing excessive duplicates across sources ๐ค decontaminating against 66 public benchmarks My favorite part is Figure 6 (in the video!). It's our visual diversity analysis. It shows that FineVision isn't just bigger; it's more balanced and conceptually richer than other open datasets. NVIDIA's Eagle 2 paper highlighted just how critical this visual diversity is, and our results confirm it: models trained on FineVision consistently outperform those trained on any other open dataset on 11 benchmarks! ๐ To celebrate the paper, Iโm also releasing a concatenated and shuffled version of the full dataset! ๐`HuggingFaceM4/FineVision_full_shuffled` Itโs ready to stream, so you can start training your own models right away: from datasets import load_dataset d = load_dataset("HuggingFaceM4/FineVision_full_shuffled", split="train", streaming=True) print(next(iter(d))) A big shoutout to the first authors: Luis Wiedmann and Orr Zohar. They are rockstars!
View all activity
Organizations
models
6
Sort:ย Recently updated
Vertax/act_bi_arx5_pick_and_place_cube
Updated
24 days ago
โข
46
Vertax/diffusion_bi_arx5_pick_and_place_cube
Robotics
โข
Updated
24 days ago
โข
16
Vertax/bi_arx5_pick_and_place_cube
Robotics
โข
Updated
30 days ago
โข
16
Vertax/smolvla_xense-so101-place-by-colors_policy
Robotics
โข
Updated
Sep 3
โข
1
Vertax/act_xense-so101-place-by-colors_policy
Robotics
โข
Updated
Sep 3
โข
1
Vertax/act_xense-so101-test_policy
Robotics
โข
Updated
Aug 20
โข
1
datasets
13
Sort:ย Recently updated
Vertax/xense_bi_arx5_tie_shoelaces_tactile
Viewer
โข
Updated
1 day ago
โข
1.56k
โข
191
Vertax/xense_bi_arx5_insert_shoelaces_high_quality
Viewer
โข
Updated
3 days ago
โข
22.1k
โข
68
Vertax/xense_bi_arx5_tie_shoelaces
Viewer
โข
Updated
4 days ago
โข
515k
โข
735
โข
1
Vertax/xense_bi_arx5_pick_and_place_cube
Viewer
โข
Updated
11 days ago
โข
25.3k
โข
237
Vertax/eval_diffusion_bi_arx5_pick_and_place_cube
Updated
24 days ago
โข
53
Vertax/eval_bi_arx5_pick_and_place_cube
Updated
30 days ago
โข
116
Vertax/bi_arx5_demo
Viewer
โข
Updated
about 1 month ago
โข
1.35k
โข
65
Vertax/place_dual_shoes_demo_clean_arx5_repo
Viewer
โข
Updated
Sep 11
โข
15.1k
โข
6
Vertax/xense-so101-place-by-colors
Viewer
โข
Updated
Sep 3
โข
84.8k
โข
76
Vertax/xense-so101-test
Viewer
โข
Updated
Aug 20
โข
19.2k
โข
8
View 13 datasets