Lu Sheng

lsheng2024

https://lucassheng.github.io/

AI & ML interests

3D Vision, Embodied AI

Recent Activity

upvoted a collection 29 days ago

MV-Adapter Spaces

upvoted a paper about 1 month ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

upvoted a paper about 1 month ago

A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs

View all activity

Organizations

upvoted a collection 29 days ago

MV-Adapter Spaces

Collection

5 items • Updated Mar 31 • 9

upvoted 2 papers about 1 month ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17 • 64

A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs

Paper • 2411.17265 • Published Nov 26, 2024 • 1

authored a paper about 2 months ago

Use Property-Based Testing to Bridge LLM Code Generation and Validation

Paper • 2506.18315 • Published Jun 23 • 10

upvoted a paper about 2 months ago

Use Property-Based Testing to Bridge LLM Code Generation and Validation

Paper • 2506.18315 • Published Jun 23 • 10

authored 4 papers about 2 months ago

A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs

Paper • 2411.17265 • Published Nov 26, 2024 • 1

T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation

Paper • 2501.12612 • Published Jan 22

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4 • 43

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24 • 59

upvoted a paper about 2 months ago

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24 • 59

upvoted a paper 3 months ago

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4 • 43

authored a paper 5 months ago

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16 • 44

upvoted 2 papers 5 months ago

Video-T1: Test-Time Scaling for Video Generation

Paper • 2503.18942 • Published Mar 24 • 91

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16 • 44

upvoted 2 papers 7 months ago

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Paper • 2412.04455 • Published Dec 5, 2024 • 39

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Paper • 2501.03847 • Published Jan 7 • 23

liked a Space 8 months ago

106

MV Adapter T2MV Anime

👁

Generate anime-style multi-view images from texts

authored 3 papers 9 months ago

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE

Paper • 2311.02684 • Published Nov 5, 2023

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy

Paper • 2203.07845 • Published Mar 15, 2022

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Paper • 2403.12037 • Published Mar 18, 2024 • 1

Lu Sheng

AI & ML interests

Recent Activity

Organizations

lsheng2024's activity

MV Adapter T2MV Anime