Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published Jan 7 • 78
Unified Visual Relationship Detection with Vision and Language Models Paper • 2303.08998 • Published Mar 16, 2023
The iNaturalist Species Classification and Detection Dataset Paper • 1707.06642 • Published Jul 20, 2017
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception Paper • 2305.06324 • Published May 10, 2023 • 1
Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset Paper • 2004.12276 • Published Apr 26, 2020 • 1
Spatiotemporal Contrastive Video Representation Learning Paper • 2008.03800 • Published Aug 9, 2020
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation Paper • 2012.07177 • Published Dec 13, 2020
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model Paper • 2306.01736 • Published Jun 2, 2023 • 1
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation Paper • 2104.13921 • Published Apr 28, 2021
VideoGLUE: Video General Understanding Evaluation of Foundation Models Paper • 2307.03166 • Published Jul 6, 2023 • 5
A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models Paper • 2302.06235 • Published Feb 13, 2023
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper • 2411.07126 • Published Nov 11, 2024 • 31
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published 28 days ago • 45
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Paper • 2503.14492 • Published 28 days ago • 17
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published 28 days ago • 45
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published 28 days ago • 45 • 2
Running 2.46k 2.46k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published Jan 7 • 78