VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper β’ 2412.21059 β’ Published Dec 30, 2024 β’ 18 β’ 2
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper β’ 2412.21059 β’ Published Dec 30, 2024 β’ 18
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper β’ 2412.21059 β’ Published Dec 30, 2024 β’ 18
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Paper β’ 2412.15204 β’ Published Dec 19, 2024 β’ 33
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer Paper β’ 2408.06072 β’ Published Aug 12, 2024 β’ 37
CogAgent: A Visual Language Model for GUI Agents Paper β’ 2312.08914 β’ Published Dec 14, 2023 β’ 30
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation Paper β’ 2304.05977 β’ Published Apr 12, 2023 β’ 1
CogVLM: Visual Expert for Pretrained Language Models Paper β’ 2311.03079 β’ Published Nov 6, 2023 β’ 24