1 17 7

Rishabh Maheshwary

rmahesh

https://rishabhmaheshwary.github.io/

AI & ML interests

NLP, Multimodal vision and language, AI robustness and safety

Recent Activity

authored a paper 19 days ago

Prompting with Phonemes: Enhancing LLM Multilinguality for non-Latin Script Languages

authored a paper 19 days ago

Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels

authored a paper 19 days ago

Apriel-Nemotron-15B-Thinker

View all activity

Organizations

upvoted a paper 23 days ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published 25 days ago • 72

upvoted a paper 24 days ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published 26 days ago • 61

upvoted 2 papers 2 months ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 98

upvoted an article 2 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 95

upvoted a paper 3 months ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 149

upvoted an article 5 months ago

Article

PipelineRL

ServiceNow

•

Apr 25, 2025

• 45

upvoted an article 6 months ago

Article

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

ServiceNow-AI

•

Dec 23, 2025

• 49

upvoted a collection 8 months ago

Apriel-1.5-15B-Thinker

Collection

3 items • Updated Oct 2, 2025 • 76

upvoted a paper 8 months ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 125

upvoted a paper 9 months ago

GRAFT: GRaPH and Table Reasoning for Textual Alignment -- A Benchmark for Structured Instruction Following and Visual Reasoning

Paper • 2508.15690 • Published Aug 21, 2025 • 8

upvoted an article 9 months ago

Article

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

ServiceNow-AI

•

Sep 22, 2025

• 14

upvoted a paper 9 months ago

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

upvoted a paper 11 months ago

How to Train Your LLM Web Agent: A Statistical Diagnosis

Paper • 2507.04103 • Published Jul 5, 2025 • 52

upvoted 2 papers over 1 year ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3, 2025 • 39

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20, 2024 • 12

Rishabh Maheshwary

AI & ML interests

Recent Activity

Organizations

rmahesh's activity

A New Framework for Evaluating Voice Agents (EVA)

PipelineRL

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs