5 26 1

weimeng

mengwei0427

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

upvoted a paper 27 days ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

upvoted a paper about 1 month ago

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

View all activity

Organizations

upvoted a paper 17 days ago

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published 18 days ago • 45

upvoted a paper 27 days ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 29 days ago • 49

upvoted a paper about 1 month ago

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

Paper • 2603.25319 • Published Mar 26 • 32

upvoted a paper about 2 months ago

Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens

Paper • 2603.19232 • Published Mar 19 • 33

upvoted 2 papers 4 months ago

VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs

Paper • 2512.22342 • Published Dec 26, 2025 • 10

LoGoPlanner: Localization Grounded Navigation Policy with Metric-aware Visual Geometry

Paper • 2512.19629 • Published Dec 22, 2025 • 26

upvoted 2 papers 5 months ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 89

Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

Paper • 2512.08186 • Published Dec 9, 2025 • 23

submitted a paper to Daily Papers 5 months ago

Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

Paper • 2512.08186 • Published Dec 9, 2025 • 23

updated a model 5 months ago

InternRobotics/InternVLA-N1-DualVLN

8B • Updated Dec 10, 2025 • 210 • 4

published a model 5 months ago

InternRobotics/InternVLA-N1-DualVLN

8B • Updated Dec 10, 2025 • 210 • 4

updated a model 5 months ago

InternRobotics/InternVLA-N1-w-NavDP

8B • Updated Dec 10, 2025 • 46 • 2

published a model 5 months ago

InternRobotics/InternVLA-N1-w-NavDP

8B • Updated Dec 10, 2025 • 46 • 2

updated a model 5 months ago

InternRobotics/InternVLA-N1-System2

8B • Updated Dec 10, 2025 • 205 • 1

published a model 5 months ago

InternRobotics/InternVLA-N1-System2

8B • Updated Dec 10, 2025 • 205 • 1

upvoted a paper 6 months ago

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes

Paper • 2510.26800 • Published Oct 30, 2025 • 22

updated a model 7 months ago

mengwei0427/StreamVLN_Video_qwen_1_5_r2r_rxr_envdrop_scalevln_v1_3

Text Generation • 8B • Updated Sep 28, 2025 • 404

published a model 7 months ago

mengwei0427/StreamVLN_Video_qwen_1_5_r2r_rxr_envdrop_scalevln_v1_3

Text Generation • 8B • Updated Sep 28, 2025 • 404

upvoted 2 papers 8 months ago

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Paper • 2509.15185 • Published Sep 18, 2025 • 29

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Paper • 2509.09680 • Published Sep 11, 2025 • 44

weimeng

AI & ML interests

Recent Activity

Organizations

mengwei0427's activity