Daniel van Strien's picture

🏗️ Building on HF

Daniel van Strien PRO

davanstrien

huggingface

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a bucket 23 minutes ago

davanstrien/atlas-data

liked a dataset about 1 hour ago

open-thoughts/OpenThoughts-Agent-SFT-100K

liked a Space about 4 hours ago

small-models-for-glam/index-card-extractor

View all activity

Organizations

upvoted an article about 9 hours ago

Article

Building Moon Bot: A Slack-Native Coding Agent Backed by HuggingFace Buckets

huggingface

•

about 12 hours ago

• 30

upvoted an article 1 day ago

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

+2

BenjaminB, sayakpaul, hubnemo, kashif

•

7 days ago

• 60

upvoted a collection 2 days ago

PP-OCRv6

From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks • 19 items • Updated 10 days ago • 91

upvoted an article 2 days ago

Article

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters

PaddlePaddle

•

2 days ago

• 22

upvoted an article 12 days ago

Article

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

Samoed

•

12 days ago

• 22

upvoted an article 14 days ago

Article

Any Custom Frontend with Gradio's Backend

ysharma, abidlabs

•

Apr 1

• 38

upvoted an article 15 days ago

Article

Migrating Your GitHub CI to Hugging Face Jobs

abidlabs

•

16 days ago

• 10

upvoted a paper 18 days ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 24 days ago • 56

upvoted an article 20 days ago

Article

Designing the hf CLI as an agent-optimized way to work with the Hub

celinah, Wauplin

•

21 days ago

• 58

upvoted a paper 21 days ago

Less Is More? When Dataset Context Hurts LLM-Generated Dataset Descriptions

Paper • 2606.02334 • Published 24 days ago • 1

upvoted an article 21 days ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

29 days ago

• 42

upvoted an article 23 days ago

Article

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

ibm-research

•

23 days ago

• 88

upvoted an article 27 days ago

Article

MONET: Lowering the Barrier to World Class Image Generation Research

jasperai

•

27 days ago

• 10

upvoted a collection 27 days ago

MONET - Massive Open Non-redundant, Enriched, Text-to-image

A curated, deduped & recaptioned open image–text dataset of 104.9M samples released under the Apache2.0 licence. https://huggingface.co/blog/jasperai/ • 4 items • Updated 27 days ago • 11

upvoted a collection 30 days ago

MiniCPM5

A SOTA 1B on-device LLM, small yet powerful. • 11 items • Updated 30 days ago • 27

upvoted an article about 1 month ago

Article

Why Open Models Are the Only Sustainable Way to Teach AI

penelopegittos

•

May 22

• 8

upvoted a paper about 1 month ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published May 19 • 137

upvoted a collection about 1 month ago

Hy-MT2

混元翻译模型2.0版本 • 11 items • Updated 30 days ago • 44

upvoted an article about 1 month ago

Article

The Open Agent Leaderboard

ibm-research

•

May 18

• 14

upvoted a collection about 2 months ago

OCR models

11 items • Updated May 21 • 14