AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy Paper • 2506.13284 • Published 17 days ago • 23
AR-RAG: Autoregressive Retrieval Augmentation for Image Generation Paper • 2506.06962 • Published 26 days ago • 28
LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer Paper • 2506.06952 • Published 26 days ago • 10
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Paper • 2505.15966 • Published May 21 • 51
LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models Paper • 2504.10430 • Published Apr 14 • 5
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning Paper • 2505.16400 • Published May 22 • 31
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published Apr 15 • 60
FoNE: Precise Single-Token Number Embeddings via Fourier Features Paper • 2502.09741 • Published Feb 13 • 15
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation Paper • 2503.09598 • Published Mar 12 • 2
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation Paper • 2503.09598 • Published Mar 12 • 2
Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding Paper • 2305.14592 • Published May 24, 2023
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations Paper • 2404.01266 • Published Apr 1, 2024 • 3