Shikhar Singh's picture

110 528

Shikhar Singh

AxAI

·

axe--

AI & ML interests

Commonsense & Language Grounding

Recent Activity

liked a model 9 days ago

nvidia/NitroGen

liked a model 15 days ago

allenai/Molmo2-8B

liked a Space 19 days ago

opencv/edge_detection_dexined

View all activity

Organizations

None yet

upvoted a collection 19 days ago

DeTikZify

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ • 13 items • Updated Jun 27 • 30

upvoted a collection 20 days ago

SigLIP2

36 items • Updated Jul 10 • 101

upvoted a collection 21 days ago

Qwen3-VL

37 items • Updated about 13 hours ago • 552

upvoted an article 30 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

about 1 month ago

•

260

upvoted an article about 1 month ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5

•

508

upvoted a collection about 1 month ago

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 435

upvoted an article about 1 month ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

+3

Jul 29

•

206

upvoted a collection about 1 month ago

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 12 items • Updated 8 days ago • 140

upvoted an article about 1 month ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

+5

Feb 20

•

320

upvoted a paper about 2 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 95

upvoted 4 articles 2 months ago

Article

Fine-Tune a Semantic Segmentation Model with a Custom Dataset

Mar 17, 2022

•

32

Article

Fine-Tune ViT for Image Classification with 🤗 Transformers

Feb 11, 2022

•

57

Article

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

Sep 10

•

108

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21

•

286

upvoted an article 4 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

May 21

•

247

upvoted a collection 4 months ago

August 29 Releases

40 items • Updated Sep 1 • 7

upvoted a paper 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 211

upvoted an article 5 months ago

Article

Make LLM Fine-tuning 2x faster with Unsloth and 🤗 TRL

Jan 10, 2024

•

74

upvoted 2 collections 5 months ago

Qwen2.5-VL (All Versions)

All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more! • 16 items • Updated 7 days ago • 22

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated about 13 hours ago • 550