microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 2 days ago • 231k • 1.04k
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity • Updated 4 days ago • 12.6M • • 804
DreamTeacher: Pretraining Image Backbones with Deep Generative Models Paper • 2307.07487 • Published Jul 14, 2023 • 20
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Paper • 2307.06304 • Published Jul 12, 2023 • 30
H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models Paper • 2306.14048 • Published Jun 24, 2023 • 12
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing Paper • 2306.14435 • Published Jun 26, 2023 • 20