RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation
Paper
β’
2412.11919
β’
Published
β’
33
Extend images to new sizes using prompts
Upgraded to v1.0!
Generate captions for images in various styles
generated sound from video/text and search
Image Super-resolution via Diffusion Inversion
Ultra-high resolution image synthesis
Build datasets using natural language