oh sehun

sehun

AI & ML interests

None yet

Recent Activity

liked a Space about 18 hours ago

multimodalart/lens

liked a dataset about 18 hours ago

armand0e/qwen3.7-max-pi-traces

upvoted a paper about 18 hours ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

View all activity

Organizations

upvoted a paper about 18 hours ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

Paper • 2605.16928 • Published 8 days ago • 83

upvoted a paper 2 days ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Paper • 2605.20258 • Published 6 days ago • 29

upvoted 2 papers 3 days ago

Aurora: Unified Video Editing with a Tool-Using Agent

Paper • 2605.18748 • Published 6 days ago • 29

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published 5 days ago • 126

upvoted an article 4 days ago

Article

OlmoEarth v1.1: A more efficient family of Earth observation models

allenai

•

4 days ago

• 15

upvoted a paper 4 days ago

Process Rewards with Learned Reliability

Paper • 2605.15529 • Published 9 days ago • 51

upvoted 7 papers 6 days ago

FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization

Paper • 2605.15824 • Published 9 days ago • 59

Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models

Paper • 2605.07721 • Published 16 days ago • 29

upvoted 7 papers 10 days ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published 11 days ago • 217

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published 12 days ago • 60

Key-Value Means

Paper • 2605.09877 • Published 13 days ago • 25

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper • 2605.13724 • Published 11 days ago • 96

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published 11 days ago • 58

Teaching Language Models to Think in Code

Paper • 2605.07237 • Published 13 days ago • 30

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published 13 days ago • 74

oh sehun

AI & ML interests

Recent Activity

Organizations

sehun's activity

OlmoEarth v1.1: A more efficient family of Earth observation models