Reasoning, Thinking and RL - a kaizuberbuehler Collection

kaizuberbuehler 's Collections

Reasoning, Thinking and RL

Vision Language Models

Foundation Models

Synthetic Data and Self-Improvement

Agents

LM Prompt Engineering

LM Capabilities and Scaling

LM Architectures

Code Generation

EXL2 Quantized Models

Reasoning, Thinking and RL

updated about 13 hours ago