ITVTON:Virtual Try-On Diffusion Transformer Model Based on Integrated Image and Text Paper • 2501.16757 • Published Jan 28 • 1
Qwen QwQ-32B Collection Collection Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions. • 13 items • Updated 13 days ago • 5
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity Paper • 2503.16418 • Published 4 days ago • 31
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers Paper • 2503.14487 • Published 6 days ago • 26
Unleashing Vecset Diffusion Model for Fast Shape Generation Paper • 2503.16302 • Published 4 days ago • 38
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 4 days ago • 38
Open-RS Collection Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated 4 days ago • 10
Process Reward Models Collection Model and Datasets for Qwen 2.5 Math PRM 7B • 6 items • Updated Feb 18 • 2
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline Paper • 2406.11939 • Published Jun 17, 2024 • 7
Slam Collection All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 6 items • Updated 28 days ago • 13
INF-Retriever-v1 Collection LLM-based dense retrieval models for EN & ZH (also effective in other languages) • 2 items • Updated 29 days ago • 1
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated 5 days ago • 100
EgoLife Collection CVPR 2025 - EgoLife: Towards Egocentric Life Assistant. Homepage: https://egolife-ai.github.io/ • 10 items • Updated 18 days ago • 16
UGround Collection UGround: Universal GUI Visual Grounding for GUI Agents (ICLR'25 Oral) • 9 items • Updated Feb 16 • 4