Ross Wightman's picture

Ross Wightman

rwightman

·

AI & ML interests

Computer vision, transfer learning, semi/self supervised learning, robotics.

Recent Activity

new activity 5 days ago

timm/vit_small_patch16_dinov3_qkvb.lvd1689m:Regarding the Feature extractor

updated a model 5 days ago

timm/csatv2.r512_in1k

updated a model 5 days ago

timm/csatv2_21m.sw_r640_in1k

View all activity

Organizations

upvoted a collection 2 months ago

MobileCLIP2

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 37 items • Updated Sep 18, 2025 • 57

upvoted a paper 3 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 165

upvoted a collection 4 months ago

timm DINOv3

Meta AI's DINOv3 weights in timm. ViTs with `qkvb` have a zero QV bias present, otherwise bias is disabled. QKV bias are all 0 in original weights. • 18 items • Updated Sep 19, 2025 • 26

upvoted a collection 5 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 396

upvoted an article 5 months ago

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Jun 11, 2024

•

67

upvoted 4 collections 5 months ago

MetaCLIP

MetaCLIP & MetaCLIP2 OpenCLIP and timm models. All models are dual timm + OpenCLIP (or just timm for specific vit encoders). • 24 items • Updated Sep 19, 2025 • 3

Perception Encoder

OpenCLIP (PE Core image + text) and timm PE Core, Spatial, Lang (ViT only) weights. NOTE: These weights do not work with original modeling code. • 19 items • Updated Sep 19, 2025 • 6

Meta CLIP 1

Scaling CLIP data with transparent training distribution from an end-to-end pipeline. • 7 items • Updated Nov 24, 2025 • 21

Perception Encoder

17 items • Updated Jul 11, 2025 • 73

upvoted a paper 6 months ago

RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations

Paper • 2412.19628 • Published Dec 27, 2024 • 2

upvoted 2 collections 6 months ago

RecNeXt

37 items • Updated Aug 1, 2025 • 2

Gemma 3n

4 items • Updated Jul 10, 2025 • 255

upvoted 2 collections 8 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11, 2025 • 369

OpenVision

27 items • Updated Aug 15, 2025 • 33

upvoted a paper 11 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 156

upvoted a collection 11 months ago

SigLIP 2

OpenCLIP and timm SigLIP 2 models • 47 items • Updated Sep 19, 2025 • 25

upvoted 3 articles 11 months ago

Article

SigLIP 2: A better multilingual vision language encoder

+1

Feb 21, 2025

•

193

Article

🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces

Feb 2, 2025

•

6

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28, 2025

•

887

upvoted an article 12 months ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

Jan 15, 2025

•

48