geonung kim's picture

29 4

geonung kim

comar

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

upvoted a paper 2 days ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

upvoted a paper 2 days ago

ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment

View all activity

Organizations

None yet

upvoted 3 papers 2 days ago

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Paper • 2507.22058 • Published 4 days ago • 33

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published 4 days ago • 94

ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment

Paper • 2507.19058 • Published 8 days ago • 12

upvoted 5 papers 8 days ago

TokensGen: Harnessing Condensed Tokens for Long Video Generation

Paper • 2507.15728 • Published 12 days ago • 6

Streaming 4D Visual Geometry Transformer

Paper • 2507.11539 • Published 18 days ago • 14

Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

Paper • 2507.17745 • Published 10 days ago • 30

NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining

Paper • 2507.14119 • Published 15 days ago • 53

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published 10 days ago • 77

upvoted a paper 11 days ago

"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models

Paper • 2507.13428 • Published 16 days ago • 15

upvoted 4 papers 12 days ago

Taming generative video models for zero-shot optical flow extraction

Paper • 2507.09082 • Published 22 days ago • 11

SpatialTrackerV2: 3D Point Tracking Made Easy

Paper • 2507.12462 • Published 17 days ago • 15

MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second

Paper • 2507.10065 • Published 19 days ago • 23

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published 16 days ago • 62

upvoted a paper 21 days ago

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published 25 days ago • 113

upvoted a paper 29 days ago

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Paper • 2507.02813 • Published 30 days ago • 59

upvoted 5 papers about 1 month ago

Learning to Skip the Middle Layers of Transformers

Paper • 2506.21103 • Published Jun 26 • 16

FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation

Paper • 2506.18899 • Published Jun 23 • 5

FaSTA^*: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published Jun 26 • 40

Matrix-Game: Interactive World Foundation Model

Paper • 2506.18701 • Published Jun 23 • 62

Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales

Paper • 2506.19713 • Published Jun 24 • 13