Delin Qu's picture

Delin Qu

delinqu

·

https://delinqu.github.io/

AI & ML interests

Embodied AI, 3D Vision

Recent Activity

upvoted a paper 15 days ago

Scaling RL to Long Videos

upvoted a collection 21 days ago

Libero Benchmark Dataset

upvoted a paper 21 days ago

Hume: Introducing System-2 Thinking in Visual-Language-Action Model

View all activity

Organizations

authored a paper about 1 month ago

Hume: Introducing System-2 Thinking in Visual-Language-Action Model

Paper • 2505.21432 • Published May 27 • 3

authored 9 papers 5 months ago

FreeGaussian: Annotation-free Controllable 3D Gaussian Splats with Flow Derivatives

Paper • 2410.22070 • Published Oct 29, 2024

Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

Paper • 2503.08120 • Published Mar 11 • 32

Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction

Paper • 2303.18125 • Published Mar 31, 2023

GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting

Paper • 2311.11700 • Published Nov 20, 2023 • 4

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control

Paper • 2406.16038 • Published Jun 23, 2024 • 1

Implicit Event-RGBD Neural SLAM

Paper • 2311.11013 • Published Nov 18, 2023

Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface

Paper • 2409.19499 • Published Sep 29, 2024

SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model

Paper • 2501.15830 • Published Jan 27 • 14

Exploring the Potential of Encoder-free Architectures in 3D LMMs

Paper • 2502.09620 • Published Feb 13 • 26