Hyogun Lee
Haawron
AI & ML interests
Video understanding, multi-modal LLMs
Recent Activity
upvoted
a
paper
about 1 month ago
Florence-VL: Enhancing Vision-Language Models with Generative Vision
Encoder and Depth-Breadth Fusion
upvoted
a
paper
about 1 month ago
NVILA: Efficient Frontier Visual Language Models
upvoted
a
paper
about 1 month ago
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene
Understanding
Organizations
None yet
models
None public yet
datasets
None public yet