Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Gabriel Bo's picture

1 2

Gabriel Bo

gabrielbo

·

https://www.gabrielbo.com/

gabrielkmbo
gabriel-bo

AI & ML interests

NLP, Scaling, Test-time Compute

Organizations

gabrielbo 's collections 1

combines reinforcement learning (RL) and large language models (LLMs) to improve exploration using diverse tool generation during inference

gabrielbo/explore-rl-hotpota-trajectories

Updated May 9 • 4
gabrielbo/swirl-trajectories-mmlu-pro

Viewer • Updated May 20 • 24.8k • 21 • 2
gabrielbo/spark-model-QLoRA

Text Generation • Updated May 24 • 1

combines reinforcement learning (RL) and large language models (LLMs) to improve exploration using diverse tool generation during inference

gabrielbo/explore-rl-hotpota-trajectories

Updated May 9 • 4
gabrielbo/swirl-trajectories-mmlu-pro

Viewer • Updated May 20 • 24.8k • 21 • 2
gabrielbo/spark-model-QLoRA

Text Generation • Updated May 24 • 1

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs