Generate images based on text prompts and condition images
Co-Speech Gesture Video Generation
Review and analyze medical reports