The model zoo for "Video-As-Prompt: Unified Semantic Control for Video Generation"
ByteDance
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
spaces
12
pinned
Running
on
Zero
1.09k
InfiniteYou-FLUX
📸
Flexible Photo Recrafting While Preserving Your Identity
pinned
Runtime error
26
ID-Patch
📸
Robust ID Association for Group Photo Personalization.
pinned
Running
on
Zero
92
MegaTTS3 Demo
👋
Running
on
Zero
24
XVerse
🖼
Online demo for XVerse
Running
on
Zero
596
DreamO
🐨
A Unified Framework for Image Customization
Running
on
Zero
77
Dolphin
🦀
Dolphin Demo
models
41
ByteDance/Video-As-Prompt-Wan2.1-14B
Image-to-Video
•
Updated
•
36
•
17
ByteDance/Video-As-Prompt-CogVideoX-5B
Image-to-Video
•
Updated
•
26
•
5
ByteDance/Sa2VA-Qwen3-VL-4B
Image-Text-to-Text
•
5B
•
Updated
•
74
•
6
ByteDance/Dolphin-1.5
Image-Text-to-Text
•
0.4B
•
Updated
•
342
•
10
ByteDance/FaceCLIP
Text-to-Image
•
Updated
•
76
ByteDance/Sa2VA-InternVL3-14B
Image-Text-to-Text
•
15B
•
Updated
•
111
•
9
ByteDance/Sa2VA-Qwen2_5-VL-7B
Image-Text-to-Text
•
9B
•
Updated
•
262
•
1
ByteDance/Sa2VA-InternVL3-8B
Image-Text-to-Text
•
8B
•
Updated
•
91
•
3
ByteDance/Sa2VA-Qwen2_5-VL-3B
Image-Text-to-Text
•
4B
•
Updated
•
456
•
1
ByteDance/Sa2VA-InternVL3-2B
Image-Text-to-Text
•
2B
•
Updated
•
209
•
1
datasets
8
ByteDance/veAgentBench
Updated
•
50
•
1
ByteDance/AncientDoc
Viewer
•
Updated
•
3.44k
•
239
•
2
ByteDance/Attention2Probability
Preview
•
Updated
•
27
ByteDance/WildDoc
Viewer
•
Updated
•
35.8k
•
212
•
22
ByteDance/CloudTimeSeriesData
Viewer
•
Updated
•
11.5M
•
23
ByteDance/FullStackBench
Viewer
•
Updated
•
3.37k
•
98
•
20
ByteDance/ComTQA
Viewer
•
Updated
•
9.07k
•
25
•
19
ByteDance/MTVQA
Viewer
•
Updated
•
8.79k
•
216
•
38