Daniel Bourke's picture

Daniel Bourke PRO

mrdbourke

·

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

upvoted a collection about 2 hours ago

commented on an article about 2 hours ago

Gemma 3n fully available in the open-source ecosystem!

upvoted an article about 2 hours ago

Gemma 3n fully available in the open-source ecosystem!

View all activity

Organizations

None yet

upvoted a collection about 2 hours ago

Gemma 3n

4 items • Updated about 9 hours ago • 45

upvoted an article about 2 hours ago

Article

Gemma 3n fully available in the open-source ecosystem!

By

and 7 others •

1 day ago

• 28

upvoted a collection 2 days ago

GLiNER-X

The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. • 6 items • Updated 3 days ago • 15

upvoted an article 3 days ago

Article

Transformers backend integration in SGLang

By

and 4 others •

4 days ago

• 35

upvoted a collection 7 days ago

BioCLIP

8 items • Updated 27 days ago • 1

upvoted a paper 11 days ago

RT-DETRv2: Improved Baseline with Bag-of-Freebies for Real-Time Detection Transformer

Paper • 2407.17140 • Published Jul 24, 2024 • 2

upvoted an article 14 days ago

Article

Featherless AI on Hugging Face Inference Providers 🔥

By

and 5 others •

15 days ago

• 41

upvoted a collection 22 days ago

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 14 items • Updated about 7 hours ago • 22

upvoted an article 23 days ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By

and 8 others •

24 days ago

• 164

upvoted 2 collections 27 days ago

MiMo

7 items • Updated 9 days ago • 6

MiMo-VL

3 items • Updated 9 days ago • 27

upvoted a collection 28 days ago

OpenVision

27 items • Updated May 8 • 28

upvoted a paper about 1 month ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 94

upvoted an article about 1 month ago

Article

The Transformers Library: standardizing model definitions

By

and 3 others •

May 15

• 114

upvoted 2 papers about 1 month ago

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Paper • 2505.07263 • Published May 12 • 29

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 145

upvoted an article about 1 month ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

By

and 14 others •

Dec 19, 2024

• 657

upvoted a paper about 1 month ago

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7 • 26

upvoted an article about 1 month ago

Article

Object Detection Leaderboard

By

and 1 other •

Sep 18, 2023

• 15

upvoted an article about 2 months ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

May 12

• 459