-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 85 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 95 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 91 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 24
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
liked
a Space
about 2 hours ago
hf-vision/object_detection_leaderboard
liked
a Space
about 2 hours ago
google/paligemma2-10b-mix
liked
a dataset
about 2 hours ago
microsoft/IMAGE_UNDERSTANDING
Organizations
Collections
4
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 78 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 55 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 106 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 95
models
2
datasets
None public yet