3 20 4

winston_ge

W1nst0nGe

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

Parallel Scaling Law for Language Models

upvoted a paper 2 months ago

Seed1.5-VL Technical Report

upvoted a paper 2 months ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

View all activity

Organizations

upvoted 4 papers 2 months ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15 • 82

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 148

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 67

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 82

updated 2 datasets 3 months ago

General-Level/General-Bench-Openset

Updated 8 days ago • 74k • 4

General-Level/General-Bench-Closeset

Updated about 21 hours ago • 4.05k • 2

New activity in General-Level/General-Bench-Closeset 3 months ago

create image folder

#2 opened 3 months ago by

W1nst0nGe

New activity in General-Level/General-Bench-Openset 3 months ago

create image folder

#7 opened 3 months ago by

W1nst0nGe

create image folder

#6 opened 3 months ago by

W1nst0nGe

liked a Space 8 months ago

UGround

📱

upvoted a paper 9 months ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 32

upvoted a paper 10 months ago

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12, 2024 • 49

upvoted a paper 11 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 101

upvoted a paper 12 months ago

AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents

Paper • 2407.17490 • Published Jul 3, 2024 • 32

upvoted a paper about 1 year ago

Understanding Alignment in Multimodal LLMs: A Comprehensive Study

Paper • 2407.02477 • Published Jul 2, 2024 • 24

upvoted an article about 1 year ago

Article

Breaking resolution curse of vision-language models

•

Feb 24, 2024

• 19

upvoted a paper about 1 year ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 65

liked a dataset about 1 year ago

ONE-Lab/GUI-World

Preview • Updated Mar 26 • 4.05k • 31

upvoted 2 papers over 1 year ago

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

Paper • 2312.11370 • Published Dec 18, 2023 • 20

Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings

Paper • 2403.07750 • Published Mar 12, 2024 • 24

winston_ge

AI & ML interests

Recent Activity

Organizations

W1nst0nGe's activity

create image folder

create image folder

create image folder

UGround

Breaking resolution curse of vision-language models