hjkim's picture

251 3

hjkim

hojie11

·

hojie11

AI & ML interests

Computer Vision, 3D Vision, Anomaly Detection

Recent Activity

upvoted a paper 4 days ago

Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection

upvoted a paper 5 days ago

CORG: Generating Answers from Complex, Interrelated Contexts

upvoted a paper 13 days ago

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

View all activity

Organizations

None yet

hojie11's activity

upvoted a paper 4 days ago

Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection

Paper • 2505.02393 • Published 7 days ago • 2

upvoted a paper 5 days ago

CORG: Generating Answers from Complex, Interrelated Contexts

Paper • 2505.00023 • Published 17 days ago • 8

upvoted 3 papers 13 days ago

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published 17 days ago • 41

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Paper • 2504.16656 • Published 19 days ago • 55

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published 20 days ago • 155

upvoted a paper 17 days ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published 18 days ago • 108

upvoted a paper 18 days ago

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Paper • 2504.16030 • Published 19 days ago • 34

upvoted 4 papers 19 days ago

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Paper • 2504.14899 • Published 21 days ago • 20

SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation

Paper • 2504.14396 • Published 22 days ago • 28

UFO2: The Desktop AgentOS

Paper • 2504.14603 • Published 22 days ago • 28

StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians

Paper • 2504.15281 • Published 20 days ago • 23

upvoted a paper 21 days ago

AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis

Paper • 2504.13157 • Published 24 days ago • 21

upvoted a paper 25 days ago

NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

Paper • 2504.11427 • Published 26 days ago • 19

upvoted 7 papers about 1 month ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8 • 73

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

Paper • 2504.07083 • Published Apr 9 • 23

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 160

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 107

URECA: Unique Region Caption Anything

Paper • 2504.05305 • Published Apr 7 • 36

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Paper • 2504.04718 • Published Apr 7 • 41

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 103