4 4 9

Hexiang Hu

hexianghu

https://www.hexianghu.com/

AI & ML interests

Multimodal learning: Vision, Language, etc.

Recent Activity

liked a model about 1 month ago

HiDream-ai/HiDream-I1-Full

liked a model about 2 months ago

valentinafeve/yolos-fashionpedia

liked a model 2 months ago

black-forest-labs/FLUX.1-schnell

View all activity

Organizations

hexianghu's activity

liked a model about 1 month ago

HiDream-ai/HiDream-I1-Full

Text-to-Image • Updated 14 days ago • 42.1k • • 832

liked a model about 2 months ago

valentinafeve/yolos-fashionpedia

Object Detection • Updated Mar 10, 2023 • 107k • 126

liked a model 2 months ago

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 540k • • 3.76k

liked 2 datasets 2 months ago

DIS-CO/MovieTection

Viewer • Updated Mar 31 • 14k • 93 • 5

TheFusion21/PokemonCards

Viewer • Updated Nov 21, 2022 • 13.1k • 239 • 40

authored a paper 4 months ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72

New activity in Spawning/PD12M 5 months ago

Is it possible to get metadata of the images?

#4 opened 5 months ago by

hexianghu

liked a dataset 6 months ago

Spawning/PD12M

Viewer • Updated Jan 9 • 12.4M • 1.89k • 156

upvoted a paper 7 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 39

authored a paper 7 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 39

liked a model 9 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 2.63M • • 10.1k

authored 3 papers 9 months ago

upvoted a paper 9 months ago

Imagen 3

Paper • 2408.07009 • Published Aug 13, 2024 • 62

authored 5 papers about 1 year ago

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28, 2024 • 22

Re-Imagen: Retrieval-Augmented Text-to-Image Generator

Paper • 2209.14491 • Published Sep 29, 2022

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

Paper • 2311.17136 • Published Nov 28, 2023 • 7

From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces

Paper • 2306.00245 • Published May 31, 2023

PreSTU: Pre-Training for Scene-Text Understanding

Paper • 2209.05534 • Published Sep 12, 2022