Yohan Na's picture

Yohan Na

nayohan

·

nayohan

AI & ML interests

NLP, Dialogue systems

Recent Activity

published a dataset 29 days ago

rl-world/web-agent-trajectory-multimodal-test

updated a dataset 29 days ago

rl-world/web-agent-trajectory-multimodal-test

published a dataset 29 days ago

rl-world/web-agent-trajectory-test

View all activity

Organizations

upvoted a paper 3 months ago

What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models

Paper • 2601.06165 • Published Jan 7 • 16

upvoted a paper 4 months ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 30

upvoted a collection 4 months ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 3 days ago • 125

upvoted 2 articles 5 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

623

Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

Feb 4, 2025

•

34

upvoted a paper 5 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229

upvoted an article 5 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

65

upvoted a paper 6 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 132

upvoted a collection 6 months ago

KORMo pretraining datasets

The pretraining datasets for KORMo-10B were collected from diverse, publicly available source. • 14 items • Updated Oct 13, 2025 • 22

upvoted a paper 6 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 87

upvoted a paper 7 months ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Paper • 2510.04230 • Published Oct 5, 2025 • 27

upvoted a collection 7 months ago

Qwen3-VL

37 items • Updated Dec 31, 2025 • 701

upvoted 2 articles 7 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

+4

Oct 1, 2025

•

141

Article

mmBERT: ModernBERT goes Multilingual

+4

Sep 9, 2025

•

145

upvoted a collection 7 months ago

[Dataset] FineWeb2 Edu Korean

4 items • Updated Mar 2 • 2

upvoted an article 8 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

+3

Aug 8, 2025

•

97

upvoted a collection 8 months ago

AI2 Safety Toolkit

Safety data, moderation tools and safe LLMs. • 6 items • Updated Dec 23, 2025 • 9

upvoted 2 papers 10 months ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17, 2025 • 46

Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic Interactions

Paper • 2506.00421 • Published May 31, 2025 • 5

upvoted a collection 12 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.75k