osanseviero 's Collections Papers I've read
updated
Chain-of-Thought Reasoning Without Prompting
Paper
• 2402.10200
• Published
• 109
Large Language Models Cannot Self-Correct Reasoning Yet
Paper
• 2310.01798
• Published
• 36
Premise Order Matters in Reasoning with Large Language Models
Paper
• 2402.08939
• Published
• 28
Chain of Thought Empowers Transformers to Solve Inherently Serial
Problems
Paper
• 2402.12875
• Published
• 13
ReAct: Synergizing Reasoning and Acting in Language Models
Paper
• 2210.03629
• Published
• 33
WebShop: Towards Scalable Real-World Web Interaction with Grounded
Language Agents
Paper
• 2207.01206
• Published
• 3
Optimizing Instructions and Demonstrations for Multi-Stage Language
Model Programs
Paper
• 2406.11695
• Published
• 2
Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better
Together
Paper
• 2407.10930
• Published
SWE-agent: Agent-Computer Interfaces Enable Automated Software
Engineering
Paper
• 2405.15793
• Published
• 7
OpenDevin: An Open Platform for AI Software Developers as Generalist
Agents
Paper
• 2407.16741
• Published
• 76
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work
Tasks?
Paper
• 2403.07718
• Published
• 2
WorkArena++: Towards Compositional Planning and Reasoning-based Common
Knowledge Work Tasks
Paper
• 2407.05291
• Published
• 2
Beyond A*: Better Planning with Transformers via Search Dynamics
Bootstrapping
Paper
• 2402.14083
• Published
• 47
Dualformer: Controllable Fast and Slow Thinking by Learning with
Randomized Reasoning Traces
Paper
• 2410.09918
• Published
• 3
YuLan-Mini: An Open Data-efficient Language Model
Paper
• 2412.17743
• Published
• 66
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
• 2412.09871
• Published
• 108