Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Thomas Ferraz's picture

Thomas Ferraz

thomas-ferraz

mzboito's profile picture

·

thomas-ferraz

AI & ML interests

NLP in portuguese

Organizations

thomas-ferraz 's collections 3

Retrieve-Reasoning

TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering

Paper • 2504.20114 • Published Apr 28 • 4

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper • 2502.04404 • Published Feb 6 • 24
Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21 • 43
TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 118
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published Apr 17 • 25

Reinforcement Learning

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper • 2504.16078 • Published Apr 22 • 20
Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28 • 38

Retrieve-Reasoning

TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering

Paper • 2504.20114 • Published Apr 28 • 4

Reinforcement Learning

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper • 2504.16078 • Published Apr 22 • 20
Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28 • 38

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper • 2502.04404 • Published Feb 6 • 24
Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21 • 43
TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 118
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published Apr 17 • 25

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs