Agentic - a galois77 Collection

galois77 's Collections

Agentic

Videos

ahan

Image generation

Training optimization

RL

Benchmarks and challenges

Agentic

updated 9 days ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28 • 36
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper • 2504.16078 • Published Apr 22 • 20
Emergent Agentic Transformer from Chain of Hindsight Experience

Paper • 2305.16554 • Published May 26, 2023
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

Paper • 2504.02882 • Published Apr 2 • 7
ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published 15 days ago • 23
Self-Challenging Language Model Agents

Paper • 2506.01716 • Published 12 days ago • 9