xincheng yang's picture

1 7

xincheng yang

Vertax

·

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

Vertax/xense_bi_arx5_tie_shoelaces_tactile

published a dataset 1 day ago

Vertax/xense_bi_arx5_tie_shoelaces_tactile

reacted to andito's post with ❤️ 1 day ago

Finally, our new paper is out! "𝗙𝗶𝗻𝗲𝗩𝗶𝘀𝗶𝗼𝗻: 𝗢𝗽𝗲𝗻 𝗗𝗮𝘁𝗮 𝗜𝘀 𝗔𝗹𝗹 𝗬𝗼𝘂 𝗡𝗲𝗲𝗱"! 🥳 https://huggingface.co/papers/2510.17269 If you've ever trained a VLM, you know this problem: nobody shares their data mixtures. It's a black box, making replicating SOTA work impossible. We wanted to change that. FineVision unifies 200 sources into 24 million samples. With 17.3 million images and 9.5 billion answer tokens, it's the largest open resource of its kind. In the paper, we share how we built it: 🔍 finding and cleaning data at scale 🧹 removing excessive duplicates across sources 🤗 decontaminating against 66 public benchmarks My favorite part is Figure 6 (in the video!). It's our visual diversity analysis. It shows that FineVision isn't just bigger; it's more balanced and conceptually richer than other open datasets. NVIDIA's Eagle 2 paper highlighted just how critical this visual diversity is, and our results confirm it: models trained on FineVision consistently outperform those trained on any other open dataset on 11 benchmarks! 🎉 To celebrate the paper, I’m also releasing a concatenated and shuffled version of the full dataset! 👉`HuggingFaceM4/FineVision_full_shuffled` It’s ready to stream, so you can start training your own models right away: from datasets import load_dataset d = load_dataset("HuggingFaceM4/FineVision_full_shuffled", split="train", streaming=True) print(next(iter(d))) A big shoutout to the first authors: Luis Wiedmann and Orr Zohar. They are rockstars!

View all activity

Organizations

models 6

Vertax/act_bi_arx5_pick_and_place_cube

Updated 24 days ago • 46

Vertax/diffusion_bi_arx5_pick_and_place_cube

Robotics • Updated 24 days ago • 16

Vertax/bi_arx5_pick_and_place_cube

Robotics • Updated 30 days ago • 16

Vertax/smolvla_xense-so101-place-by-colors_policy

Robotics • Updated Sep 3 • 1

Vertax/act_xense-so101-place-by-colors_policy

Robotics • Updated Sep 3 • 1

Vertax/act_xense-so101-test_policy

Robotics • Updated Aug 20 • 1

datasets 13

Vertax/xense_bi_arx5_tie_shoelaces_tactile

Viewer • Updated 1 day ago • 1.56k • 191

Vertax/xense_bi_arx5_insert_shoelaces_high_quality

Viewer • Updated 3 days ago • 22.1k • 68

Vertax/xense_bi_arx5_tie_shoelaces

Viewer • Updated 4 days ago • 515k • 735 • 1

Vertax/xense_bi_arx5_pick_and_place_cube

Viewer • Updated 11 days ago • 25.3k • 237

Vertax/eval_diffusion_bi_arx5_pick_and_place_cube

Updated 24 days ago • 53

Vertax/eval_bi_arx5_pick_and_place_cube

Updated 30 days ago • 116

Vertax/bi_arx5_demo

Viewer • Updated about 1 month ago • 1.35k • 65

Vertax/place_dual_shoes_demo_clean_arx5_repo

Viewer • Updated Sep 11 • 15.1k • 6

Vertax/xense-so101-place-by-colors

Viewer • Updated Sep 3 • 84.8k • 76

Vertax/xense-so101-test

Viewer • Updated Aug 20 • 19.2k • 8

View 13 datasets