16 158 748

Drishti Sharma

DrishtiSharma

https://scholar.google.com/citations?hl=en&user=9-GkrdkAAAAJ

AI & ML interests

None yet

Recent Activity

updated a dataset 13 days ago

DrishtiSharma/backtrans-model-eval-evy

published a dataset 13 days ago

DrishtiSharma/backtrans-model-eval-evy

upvoted a collection 13 days ago

SigLIP2

View all activity

Organizations

upvoted a collection 13 days ago

SigLIP2

Collection

36 items • Updated Jul 10 • 84

upvoted an article about 1 month ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 508

upvoted a collection about 1 month ago

Multimodal Benchmarks

Collection

181 items • Updated 9 days ago • 19

upvoted an article about 1 month ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

Jul 9

• 649

upvoted a paper 4 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

upvoted an article 4 months ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

•

Oct 20, 2024

• 48

upvoted 2 papers 5 months ago

LLM as a Broken Telephone: Iterative Generation Distorts Information

Paper • 2502.20258 • Published Feb 27 • 27

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 100

upvoted a paper 6 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

upvoted an article 6 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

and 2 others •

Feb 19

• 71

upvoted 10 papers 6 months ago

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published Feb 16 • 23

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13 • 23

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published Feb 13 • 32

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 36

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published Feb 12 • 44

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 90

Drishti Sharma

AI & ML interests

Recent Activity

Organizations

DrishtiSharma's activity

Vision Language Models (Better, Faster, Stronger)

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

PaliGemma 2 Mix - New Instruction Vision Language Models by Google