ZhengHao's picture

31 14

ZhengHao

ZhengHao-L

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Qwen/Qwen2.5-VL-3B-Instruct

liked a model 1 day ago

stabilityai/stable-diffusion-xl-base-1.0

liked a model 1 day ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

View all activity

Organizations

None yet

ZhengHao-L's activity

liked 11 models 1 day ago

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 23 days ago • 907k • 253

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 4.09M • • 6.39k

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated 14 days ago • 1.11M • 534

microsoft/Magma-8B

Image-Text-to-Text • Updated 4 days ago • 10.5k • 320

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 258k • • 1.7k

microsoft/OmniParser-v2.0

Image-Text-to-Text • Updated 20 days ago • 8.78k • 1.13k

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • Updated 15 days ago • 63.4k • • 512

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • Updated Jan 12 • 182k • • 1.7k

stabilityai/stable-diffusion-3.5-large

Text-to-Image • Updated Oct 22, 2024 • 158k • • 2.43k

hexgrad/Kokoro-82M

Text-to-Speech • Updated 6 days ago • 1.54M • 3.61k

THUDM/CogView4-6B

Text-to-Image • Updated 6 days ago • 6.84k • • 167

upvoted 9 papers 1 day ago

Koala: Key frame-conditioned long video-LLM

Paper • 2404.04346 • Published Apr 5, 2024 • 7

DATENeRF: Depth-Aware Text-based Editing of NeRFs

Paper • 2404.04526 • Published Apr 6, 2024 • 11

Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models

Paper • 2404.04478 • Published Apr 6, 2024 • 13

YaART: Yet Another ART Rendering Technology

Paper • 2404.05666 • Published Apr 8, 2024 • 17

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

Paper • 2404.05674 • Published Apr 8, 2024 • 15

Aligning Diffusion Models by Optimizing Human Utility

Paper • 2404.04465 • Published Apr 6, 2024 • 15

PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

Paper • 2404.04421 • Published Apr 5, 2024 • 18

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 22

BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion

Paper • 2404.04544 • Published Apr 6, 2024 • 23