2 16 24

Huang

Jinfa

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

VideoAuteur: Towards Long Narrative Video Generation

upvoted a paper 1 day ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

liked a model 1 day ago

BestWishYsh/MagicTime

View all activity

Organizations

Jinfa's activity

upvoted 2 papers 1 day ago

VideoAuteur: Towards Long Narrative Video Generation

Paper • 2501.06173 • Published 8 days ago • 30

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 1 day ago • 40

liked a model 1 day ago

BestWishYsh/MagicTime

Text-to-Video • Updated Dec 3, 2024 • 27

upvoted a paper 1 day ago

Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion

Paper • 2501.09019 • Published 3 days ago • 10

liked a Space about 1 month ago

Running on Zero

💻

Newborn Article Impact Predict

Use title and abstract to predict future academic impact

liked a dataset about 1 month ago

BestWishYsh/ConsisID-preview-Data

Viewer • Updated 23 days ago • 31.9k • 775 • 19

liked a model about 2 months ago

BestWishYsh/ConsisID-preview

Image-to-Video • Updated 25 days ago • 730 • 25

liked a Space about 2 months ago

Running on L40S

🔥

ConsisID-preview

Identity-Preserving Text-to-Video Generation

upvoted a paper about 2 months ago

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Paper • 2411.17440 • Published Nov 26, 2024 • 35

authored a paper about 2 months ago

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Paper • 2411.17440 • Published Nov 26, 2024 • 35

liked a dataset about 2 months ago

Xkev/LLaVA-CoT-100k

Viewer • Updated Nov 27, 2024 • 98.6k • 1.23k • 65

upvoted 2 papers about 2 months ago

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity

Paper • 2411.15411 • Published Nov 23, 2024 • 7

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published Nov 26, 2024 • 10

liked a model about 2 months ago

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated Dec 16, 2024 • 7.57k • 142

upvoted a paper 2 months ago

Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8, 2024 • 17

commented a paper 2 months ago

Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8, 2024 • 17 •

liked a model 3 months ago

genmo/mochi-1-preview

Text-to-Video • Updated about 1 month ago • 40.5k • 1.15k

upvoted a paper 3 months ago

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51

liked 2 datasets 3 months ago

BestWishYsh/ChronoMagic-ProH

Viewer • Updated Dec 3, 2024 • 145k • 196 • 15

BestWishYsh/ChronoMagic-Bench

Viewer • Updated 18 days ago • 1.8k • 83 • 10