Spatial-T2I

university

AI & ML interests

None defined yet.

agneet

authored a paper 5 months ago

Dual Caption Preference Optimization for Diffusion Models

Paper • 2502.06023 • Published Feb 9 • 9

estellea

authored 3 papers 7 months ago

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Paper • 2404.01197 • Published Apr 1, 2024 • 32

LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models

Paper • 2404.03118 • Published Apr 3, 2024 • 27

FastRM: An efficient and automatic explainability framework for multimodal generative models

Paper • 2412.01487 • Published Dec 2, 2024 • 1

agneet

authored 2 papers 11 months ago

REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models

Paper • 2408.02231 • Published Aug 5, 2024 • 2

On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation

Paper • 2404.08540 • Published Apr 12, 2024 • 12

gabsm

authored a paper over 1 year ago

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Paper • 2404.01197 • Published Apr 1, 2024 • 32

djghosh

authored 2 papers over 1 year ago

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Paper • 2404.01197 • Published Apr 1, 2024 • 32

DataComp: In search of the next generation of multimodal datasets

Paper • 2304.14108 • Published Apr 27, 2023 • 2

agneet

authored a paper over 1 year ago

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Paper • 2404.01197 • Published Apr 1, 2024 • 32

gabsm

authored a paper over 1 year ago

LDM3D-VR: Latent Diffusion Model for 3D VR

Paper • 2311.03226 • Published Nov 6, 2023 • 11

estellea

authored a paper over 1 year ago

LDM3D-VR: Latent Diffusion Model for 3D VR

Paper • 2311.03226 • Published Nov 6, 2023 • 11

gabsm

authored a paper about 2 years ago

LDM3D: Latent Diffusion Model for 3D

Paper • 2305.10853 • Published May 18, 2023 • 11