Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
5
Sarthak Malhotra
zarmalhotra
Follow
davanstrien's profile picture
1 follower
ยท
7 following
sarthakwer
sarthak-malhotra
AI & ML interests
None yet
Recent Activity
liked
a dataset
14 minutes ago
UCSC-VLAA/MedReason
reacted
to
davanstrien
's
post
with ๐
5 days ago
I've created a v1 dataset (https://huggingface.co/datasets/davanstrien/reasoning-required) and model (https://huggingface.co/davanstrien/ModernBERT-based-Reasoning-Required) to help curate "wild text" data for generating reasoning examples beyond the usual code/math/science domains. - I developed a "Reasoning Required" dataset with a 0-4 scoring system for reasoning complexity - I used educational content from https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu, adding annotations for domains, reasoning types, and example questions My approach enables a more efficient workflow: filter text with small models first, then use LLMs only on high-value content. This significantly reduces computation costs while expanding reasoning dataset domain coverage.
upvoted
a
collection
5 days ago
Reasoning Required?
View all activity
Organizations
Articles
1
Article
14
Reasoning Datasets Competition
models
None public yet
datasets
None public yet