Tianrui Zhu's picture

3 16 5

Tianrui Zhu

xilluill

·

https://github.com/Xilluill

xilluill

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

upvoted a paper 24 days ago

InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions

liked a model about 1 month ago

ziqipang/RandAR

View all activity

Organizations

None yet

upvoted 2 papers 24 days ago

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Paper • 2506.08009 • Published 26 days ago • 24

InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions

Paper • 2506.09984 • Published 24 days ago • 15

upvoted a paper about 2 months ago

Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

Paper • 2505.14135 • Published May 20 • 15

upvoted a paper 2 months ago

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

Paper • 2505.03730 • Published May 6 • 27

upvoted a paper 3 months ago

InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Paper • 2503.16418 • Published Mar 20 • 36

upvoted 4 collections 4 months ago

Image Editting

30 items • Updated 3 days ago • 2

Gen AI Diffusion

97 items • Updated 1 day ago • 5

Makeup Transfer

2 items • Updated Feb 26 • 1

image

375 items • Updated 3 days ago • 4

upvoted 2 papers 4 months ago

Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams

Paper • 2406.08085 • Published Jun 12, 2024 • 17

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1, 2024 • 25

upvoted a collection 4 months ago

manga_translation

29 items • Updated Mar 22 • 5

upvoted 3 papers 4 months ago

MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model

Paper • 2404.19759 • Published Apr 30, 2024 • 28

KV-Edit: Training-Free Image Editing for Precise Background Preservation

Paper • 2502.17363 • Published Feb 24 • 37

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Paper • 2406.12275 • Published Jun 18, 2024 • 32

upvoted a paper 5 months ago

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Paper • 2502.01105 • Published Feb 3 • 20