Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Thomas Ferraz's picture
16 10 13

Thomas Ferraz

thomas-ferraz
mzboito's profile picture
·
  • thomas-ferraz

AI & ML interests

NLP in portuguese

Recent Activity

updated a collection 9 days ago
Reasoning LLMs
updated a collection 9 days ago
Reinforcement Learning
updated a collection 17 days ago
Reasoning LLMs
View all activity

Organizations

NLP Poli-USP's profile picture IAra Project's profile picture

Collections 3

Retrieve-Reasoning
  • TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering

    Paper • 2504.20114 • Published 12 days ago • 5
Reinforcement Learning
  • LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

    Paper • 2504.16078 • Published 17 days ago • 20
  • Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

    Paper • 2504.20157 • Published 11 days ago • 34

Papers 5

arxiv:2410.06458
arxiv:2311.01070
arxiv:2308.02962
arxiv:2201.01337

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs