7 12 5

Junhyeok Kim

kjunh

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

upvoted a paper 6 days ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

new activity 8 days ago

kjunh/v1-7B:Improve model card with pipeline tag and library name

View all activity

Organizations

upvoted a paper 1 day ago

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Paper • 2507.07990 • Published 2 days ago • 29

upvoted a paper 6 days ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published 12 days ago • 76

New activity in kjunh/v1-7B 8 days ago

Improve model card with pipeline tag and library name

#1 opened about 1 month ago by

nielsr

upvoted 3 papers about 1 month ago

Language-Image Alignment with Fixed Text Encoders

Paper • 2506.04209 • Published Jun 4 • 11

Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics

Paper • 2506.00070 • Published May 29 • 28

Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues

Paper • 2506.00958 • Published Jun 1 • 20

authored a paper about 1 month ago

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Paper • 2505.18842 • Published May 24 • 37

upvoted a paper about 1 month ago

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Paper • 2505.18842 • Published May 24 • 37

commented a paper about 1 month ago

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Paper • 2505.18842 • Published May 24 • 37 •

updated a model about 2 months ago

kjunh/v1-7B

Image-Text-to-Text • 8B • Updated 8 days ago • 11

published a model about 2 months ago

kjunh/v1-7B

Image-Text-to-Text • 8B • Updated 8 days ago • 11

upvoted a paper about 2 months ago

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published May 21 • 103

New activity in Allen8/TVC-Data 4 months ago

dataset source in json file

#1 opened 4 months ago by

kjunh

upvoted a paper 4 months ago

VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms

Paper • 2503.14427 • Published Mar 18 • 19

upvoted a paper 5 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

liked a Space 5 months ago

439

AI Deadlines

⚡

Organize project deadlines with AI assistance

New activity in kjunh/EgoSpeak 5 months ago

Add video-classification task category and paper link

#1 opened 5 months ago by

nielsr

upvoted a paper 5 months ago

EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild

Paper • 2502.14892 • Published Feb 17 • 6

commented a paper 5 months ago

EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild

Paper • 2502.14892 • Published Feb 17 • 6 •

authored a paper 5 months ago

EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild

Paper • 2502.14892 • Published Feb 17 • 6

Junhyeok Kim

AI & ML interests

Recent Activity

Organizations

kjunh's activity

Improve model card with pipeline tag and library name

dataset source in json file

AI Deadlines

Add video-classification task category and paper link