CompVis Community

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

authored a paper 5 months ago

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24, 2025 • 48

authored a paper 9 months ago

The Diffusion Duality

Paper • 2506.10892 • Published Jun 12, 2025 • 37

authored a paper 9 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 153

authored 3 papers 11 months ago

MMDetection: Open MMLab Detection Toolbox and Benchmark

Paper • 1906.07155 • Published Jun 17, 2019

HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction

Paper • 2412.13187 • Published Dec 17, 2024

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7, 2025 • 110

authored a paper 11 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25, 2025 • 55

authored 7 papers 12 months ago

DataComp: In search of the next generation of multimodal datasets

Paper • 2304.14108 • Published Apr 27, 2023 • 2

Scalable Extraction of Training Data from (Production) Language Models

Paper • 2311.17035 • Published Nov 28, 2023 • 3

Query-Based Adversarial Prompt Generation

Paper • 2402.12329 • Published Feb 19, 2024

Git Re-Basin: Merging Models modulo Permutation Symmetries

Paper • 2209.04836 • Published Sep 11, 2022 • 2

Scalable Fingerprinting of Large Language Models

Paper • 2502.07760 • Published Feb 11, 2025

PLeaS -- Merging Models with Permutations and Least Squares

Paper • 2407.02447 • Published Jul 2, 2024

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17, 2025 • 13

authored a paper 12 months ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12, 2025 • 76

authored 3 papers about 1 year ago

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published Feb 20, 2025 • 14

Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model

Paper • 2410.13882 • Published Oct 3, 2024

MiRAGeNews: Multimodal Realistic AI-Generated News Detection

Paper • 2410.09045 • Published Oct 11, 2024 • 4

authored a paper about 1 year ago

DynVFX: Augmenting Real Videos with Dynamic Content

Paper • 2502.03621 • Published Feb 5, 2025 • 31

authored a paper about 1 year ago

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published Jan 31, 2025 • 10