7 158 9

Robin Williams PRO

bfuzzy1

AI & ML interests

None yet

Recent Activity

updated a collection 2 days ago

upvoted a paper 2 days ago

Towards General-Purpose Model-Free Reinforcement Learning

updated a collection 2 days ago

Agents

View all activity

Organizations

None yet

bfuzzy1's activity

updated a collection 2 days ago

RL

Collection

9 items • Updated 2 days ago • 1

upvoted a paper 2 days ago

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 3 days ago • 19

updated a collection 2 days ago

Agents

Collection

Collection of resources related to Agents. • 73 items • Updated 2 days ago • 5

upvoted a paper 2 days ago

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 7 days ago • 14

updated a collection 3 days ago

Agents

Collection

Collection of resources related to Agents. • 73 items • Updated 2 days ago • 5

upvoted a paper 3 days ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 8 days ago • 60

reacted to JingzeShi's post with 🤯 3 days ago

Post

2041

Only a single RTX 4090 running model pre-training is really slow, even for small language models!!! ( JingzeShi/doge-slm-677fd879f8c4fd0f43e05458)

2 replies

upvoted a paper 4 days ago

Control LLM: Controlled Evolution for Intelligence Retention in LLM

Paper • 2501.10979 • Published 11 days ago • 4

upvoted a paper 6 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 9 days ago • 20

updated a collection 6 days ago

RL

Collection

9 items • Updated 2 days ago • 1

upvoted a paper 6 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 9 days ago • 76

updated a collection 6 days ago

RL

Collection

9 items • Updated 2 days ago • 1

upvoted a paper 6 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 8 days ago • 267

upvoted a paper 8 days ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 10 days ago • 84

updated a collection 10 days ago

RL

Collection

9 items • Updated 2 days ago • 1

upvoted a paper 10 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 14 days ago • 100

upvoted a paper 14 days ago

MangaNinja: Line Art Colorization with Precise Reference Following

Paper • 2501.08332 • Published 16 days ago • 55

updated a collection 14 days ago

Agents

Collection

Collection of resources related to Agents. • 73 items • Updated 2 days ago • 5

upvoted 2 papers 14 days ago

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Paper • 2501.06590 • Published 19 days ago • 8

WebWalker: Benchmarking LLMs in Web Traversal

Paper • 2501.07572 • Published 17 days ago • 19