25 20 6

Pasquale Minervini

pminervini

https://www.neuralnoise.com

AI & ML interests

NLP, ML, AI

Recent Activity

upvoted a paper about 2 months ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

authored a paper 3 months ago

Inverse Scaling in Test-Time Compute

commented on a paper 3 months ago

Inverse Scaling in Test-Time Compute

View all activity

Organizations

upvoted a paper about 2 months ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published Aug 8 • 39

authored a paper 3 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

commented a paper 3 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27 •

upvoted a paper 3 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

upvoted 4 papers 4 months ago

Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models

Paper • 2506.06006 • Published Jun 6 • 13

Inference-Time Hyper-Scaling with KV Cache Compression

Paper • 2506.05345 • Published Jun 5 • 27

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

Paper • 2506.03295 • Published Jun 3 • 17

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 72

authored 2 papers 5 months ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15 • 53

Neurosymbolic Diffusion Models

Paper • 2505.13138 • Published May 19 • 35

upvoted 2 papers 5 months ago

Neurosymbolic Diffusion Models

Paper • 2505.13138 • Published May 19 • 35

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15 • 53

authored 6 papers 8 months ago

FLARE: Faithful Logic-Aided Reasoning and Exploration

Paper • 2410.11900 • Published Oct 14, 2024 • 4

SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages

Paper • 2406.14425 • Published Jun 20, 2024 • 2

upvoted a paper 8 months ago

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

Paper • 2502.05092 • Published Feb 7 • 8

liked a dataset 9 months ago

edinburgh-dawg/mmlu-redux-2.0

Viewer • Updated Feb 25 • 5.7k • 9.78k • 30

Pasquale Minervini

AI & ML interests

Recent Activity

Organizations

pminervini's activity