6 27 123

Jaeyoon Jung PRO

lastdefiance20

AI & ML interests

multimodal

Recent Activity

liked a model 11 days ago

nari-labs/Dia-1.6B

upvoted a paper 12 days ago

TesserAct: Learning 4D Embodied World Models

upvoted a paper 13 days ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

View all activity

Organizations

lastdefiance20's activity

liked a model 11 days ago

nari-labs/Dia-1.6B

Text-to-Speech • Updated 6 days ago • 151k • • 2.06k

upvoted a paper 12 days ago

TesserAct: Learning 4D Embodied World Models

Paper • 2504.20995 • Published 13 days ago • 20

upvoted a paper 13 days ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published 21 days ago • 73

updated a Space 18 days ago

KOFFVQA Leaderboard

🏆

Explore and filter a leaderboard of models

updated a dataset 18 days ago

maum-ai/KOFFVQA_Data

Viewer • Updated 18 days ago • 275 • 115 • 2

liked a model 18 days ago

naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B

Text Generation • Updated 6 days ago • 49.6k • 163

liked a model 19 days ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • Updated 11 days ago • 87.5k • 962

upvoted a collection 24 days ago

Qwen3

Collection

37 items • Updated 3 days ago • 560

liked a dataset 26 days ago

KRX-Data/Won-Instruct

Viewer • Updated 24 days ago • 86k • 697 • 17

upvoted 2 papers about 1 month ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 181

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7 • 58

liked 2 models about 1 month ago

Skywork/SkyReels-A2

Updated Apr 8 • 1.2k • 127

weizhiwang/Open-Qwen2VL

Image-Text-to-Text • Updated 27 days ago • 370 • 15

authored a paper about 1 month ago

KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language

Paper • 2503.23730 • Published Mar 31 • 4

upvoted a paper about 1 month ago

KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language

Paper • 2503.23730 • Published Mar 31 • 4

commented a paper about 1 month ago

KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language

Paper • 2503.23730 • Published Mar 31 • 4 •

upvoted a paper about 1 month ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 50

liked 2 models about 2 months ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 12 days ago • 184k • 1.59k

ByteDance/InfiniteYou

Text-to-Image • Updated 26 days ago • 18.2k • 595

liked a dataset about 2 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated 4 days ago • 3.91M • 10.5k • 478