Hiroshi Yoshihara's picture

6 5 8

Hiroshi Yoshihara

RabotniKuma

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

upvoted a paper 13 days ago

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

updated a collection 13 days ago

View all activity

Organizations

upvoted a paper 12 days ago

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Paper • 2503.04412 • Published Mar 6 • 4

upvoted a paper 13 days ago

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

Paper • 2507.08267 • Published 17 days ago • 10

upvoted 2 collections 3 months ago

OpenMathReasoning

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 7 days ago • 42

Cosmos

The collection of Cosmos models • 31 items • Updated 7 days ago • 292

upvoted a collection 5 months ago

Reasoning Vector

Reasoningモデルとベースモデルの重み差分 • 4 items • Updated Feb 18 • 3