Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ernanhughes 's Collections
code
datasets
financial
programmer.ie

programmer.ie

updated about 17 hours ago

Papers I have written about on my blog.

Upvote
-

  • MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization

    Paper • 2503.16874 • Published Mar 21 • 44

  • System Prompt Optimization with Meta-Learning

    Paper • 2505.09666 • Published 28 days ago • 70

  • UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning

    Paper • 2505.23380 • Published 13 days ago • 23

  • DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

    Paper • 2505.23754 • Published 13 days ago • 15

  • Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence

    Paper • 2505.20325 • Published 19 days ago • 45

  • Reward Reasoning Model

    Paper • 2505.14674 • Published 22 days ago • 35

  • I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

    Paper • 2503.18878 • Published Mar 24 • 118

  • Pre-trained Large Language Models Learn Hidden Markov Models In-context

    Paper • 2506.07298 • Published 2 days ago • 17
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs