Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
galois77 's Collections
Multi-language
Agentic
Multimodal
Inference
Check-later
Videos
ahan
Image generation
Training optimization
RL
Reasoning
Benchmarks and challenges
Instructions
Evaluators

Agentic

updated 9 days ago
Upvote
-

  • Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

    Paper • 2505.01441 • Published Apr 28 • 36

  • LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

    Paper • 2504.16078 • Published Apr 22 • 20

  • Emergent Agentic Transformer from Chain of Hindsight Experience

    Paper • 2305.16554 • Published May 26, 2023

  • DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

    Paper • 2504.02882 • Published Apr 2 • 7

  • ATLAS: Learning to Optimally Memorize the Context at Test Time

    Paper • 2505.23735 • Published 15 days ago • 23

  • Self-Challenging Language Model Agents

    Paper • 2506.01716 • Published 12 days ago • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs