9 12 24

Shubham Toshniwal

stoshniwal

https://shtoshni.github.io/

shtoshni

AI & ML interests

NLP, LLM

Recent Activity

commented on a paper 11 days ago

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

updated a dataset 12 days ago

nvidia/OpenMathReasoning

liked a dataset 12 days ago

nvidia/OpenCodeReasoning-2

View all activity

Organizations

stoshniwal's activity

commented a paper 11 days ago

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Paper • 2505.17813 • Published 16 days ago • 55 •

updated a dataset 12 days ago

nvidia/OpenMathReasoning

Viewer • Updated 12 days ago • 5.68M • 28.6k • 274

liked a dataset 12 days ago

nvidia/OpenCodeReasoning-2

Viewer • Updated 23 days ago • 2.16M • 2.27k • 21

New activity in nvidia/OpenMathReasoning 13 days ago

[bot] Conversion to Parquet

#4 opened 16 days ago by

parquet-converter

some files error

#3 opened 26 days ago by

HERIUN

liked a Space 26 days ago

2.67k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

authored 6 papers about 1 month ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 35

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Paper • 2206.04615 • Published Jun 9, 2022 • 5

Nemotron-4 340B Technical Report

Paper • 2406.11704 • Published Jun 17, 2024

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Paper • 2410.01560 • Published Oct 2, 2024 • 4

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 13

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Paper • 2504.16891 • Published Apr 23 • 21

liked 4 models about 1 month ago

upvoted a collection about 1 month ago

OpenMathReasoning

Collection

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 2 days ago • 40

liked a model about 2 months ago

nvidia/OpenMath-Nemotron-32B

Text Generation • Updated Apr 30 • 1.16k • • 27

upvoted a paper about 2 months ago

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Paper • 2504.16891 • Published Apr 23 • 21

liked a dataset about 2 months ago

nvidia/OpenMathReasoning

Viewer • Updated 12 days ago • 5.68M • 28.6k • 274