1 54 252

Sam Flin PRO

sflindrs

sflindrs

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

Favorites

liked a Space 3 days ago

moonshotai/Kimi-VL-A3B

liked a Space 3 days ago

moonshotai/Kimi-VL-A3B-Thinking

View all activity

Organizations

None yet

sflindrs's activity

upvoted a collection 5 days ago

OmniCaptioner

Collection

OmniCaptioner • 8 items • Updated 6 days ago • 1

upvoted 9 papers 5 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 8 days ago • 69

WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments

Paper • 2504.03886 • Published 11 days ago • 9

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Paper • 2504.06958 • Published 6 days ago • 9

DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion

Paper • 2504.04010 • Published 11 days ago • 8

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published 8 days ago • 14

OmniCaptioner: One Captioner to Rule Them All

Paper • 2504.07089 • Published 6 days ago • 17

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Paper • 2504.04842 • Published 9 days ago • 29

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

Paper • 2504.07083 • Published 6 days ago • 21

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published 6 days ago • 66

upvoted 3 papers 7 days ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 12 days ago • 67

SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning

Paper • 2504.00396 • Published 15 days ago • 4

HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration

Paper • 2504.03536 • Published 11 days ago • 11

upvoted a collection 7 days ago

VisionLM

Collection

929 items • Updated about 19 hours ago • 56

upvoted 2 papers 7 days ago

LiveVQA: Live Visual Knowledge Seeking

Paper • 2504.05288 • Published 8 days ago • 13

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 8 days ago • 158

upvoted 4 papers 11 days ago

SkyReels-A2: Compose Anything in Video Diffusion Transformers

Paper • 2504.02436 • Published 13 days ago • 35

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

Paper • 2504.02542 • Published 13 days ago • 41

FreSca: Unveiling the Scaling Space in Diffusion Models

Paper • 2504.02154 • Published 13 days ago • 17

Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages

Paper • 2503.23542 • Published 16 days ago • 10