https://huggingface.co/papers/2501.03006
A sample demonstration of building with thinking LLMs
Erase any object just by naming it!
3D Generation from text prompts
automated video and sound synthesis from images