Dmitry Balobin's picture

Dmitry Balobin

d0rj

·

AI & ML interests

NLP and 🥴 tensors. MIPT 💙, 2GIS 💚

Recent Activity

liked a dataset 1 day ago

t-tech/T-Wix

liked a dataset 6 days ago

NousResearch/Hermes-3-Dataset

liked a dataset 11 days ago

togethercomputer/glaive-function-calling-v2-formatted

View all activity

Organizations

None yet

upvoted a collection 11 days ago

3-layer

очень быстрые модели • 3 items • Updated 11 days ago • 1

upvoted a paper 21 days ago

SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval

Paper • 2109.10086 • Published Sep 21, 2021 • 3

upvoted an article about 1 month ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By

and 6 others •

Jun 12

• 115

upvoted a paper about 1 month ago

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 135

upvoted a collection about 2 months ago

Eso-LMs

Esoteric Language Models • 3 items • Updated Jun 3 • 5

upvoted 3 collections 3 months ago

Qwen3

74 items • Updated about 5 hours ago • 870

blt

4 items • Updated Apr 17 • 26

SANA-Sprint

🏃SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation • 6 items • Updated Apr 17 • 43

upvoted 6 collections 4 months ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated Apr 17 • 92

DRAMA

A collection of small (sub-1B) multilingual dense retrievers that generalize well across a number of tasks and languages. • 3 items • Updated Feb 26 • 7

SuperBPE

SuperBPE tokenizers and models trained with them • 8 items • Updated Apr 10 • 15

Reasoning Dataset

7 items • Updated Jun 19 • 3

Datasets [RU]

SFT / RL high-quality datasets • 10 items • Updated Jun 19 • 2

Gemma 3 Release

24 items • Updated 12 days ago • 417

upvoted a paper 4 months ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 233

upvoted a paper 5 months ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 90

upvoted a collection 6 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 161

upvoted a paper 6 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94

upvoted a collection 6 months ago

Ru Dialogue Benchmarks

A collection of benchmarks for evaluating the quality of dialogue models in Russian. • 3 items • Updated 4 days ago • 2

upvoted a paper 7 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 106