reasoning-project

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Cartinoe5930 authored a paper 25 days ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

JW17 authored a paper 4 months ago

AlphaPO -- Reward shape matters for LLM alignment

JW17 authored a paper 4 months ago

Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

View all activity

Cartinoe5930

authored a paper 25 days ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Paper • 2510.04230 • Published 29 days ago • 26

JW17

authored 2 papers 4 months ago

AlphaPO -- Reward shape matters for LLM alignment

Paper • 2501.03884 • Published Jan 7 • 2

Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

Paper • 2504.03380 • Published Apr 4

JW17

authored a paper 6 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

amphora

authored a paper 6 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

Cartinoe5930

authored 2 papers 6 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 10

Won: Establishing Best Practices for Korean Financial NLP

Paper • 2503.17963 • Published Mar 23

amphora

authored a paper 8 months ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 26

Cartinoe5930

authored 2 papers 8 months ago

Multi-Step Reasoning in Korean and the Emergent Mirage

Paper • 2501.05712 • Published Jan 10

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 26

JW17

updated a model 9 months ago

reasoning-project/Q25M-1.5B-MR1-50k-SFT-v0.2-3epoch

Text Generation • 2B • Updated Feb 16

JW17

published a model 9 months ago

reasoning-project/Q25M-1.5B-MR1-50k-SFT-v0.2-3epoch

Text Generation • 2B • Updated Feb 16

JW17

updated a model 9 months ago

reasoning-project/Q25M-1.5B-Open-R1-55k-SFT-v0.1

Text Generation • 2B • Updated Feb 15 • 1

JW17

published a model 9 months ago

reasoning-project/Q25M-1.5B-Open-R1-55k-SFT-v0.1

Text Generation • 2B • Updated Feb 15 • 1

JW17

updated a model 9 months ago

reasoning-project/Q25-1.5B-PRIME-55K-GRPO-Acc2-format5e1

Updated Feb 14

JW17

published a model 9 months ago

reasoning-project/Q25-1.5B-PRIME-55K-GRPO-Acc2-format5e1

Updated Feb 14

JW17

updated a model 9 months ago

reasoning-project/Q25-1.5B-Open-R1-55K-GRPO-Acc2-format5e1

Updated Feb 14

JW17

published a model 9 months ago

reasoning-project/Q25-1.5B-Open-R1-55K-GRPO-Acc2-format5e1

Updated Feb 14

Cartinoe5930

authored 2 papers 10 months ago

LLM-as-a-Judge & Reward Model: What They Can and Cannot Do

Paper • 2409.11239 • Published Sep 17, 2024 • 3

Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap

Paper • 2501.02448 • Published Jan 5

AI & ML interests

Recent Activity

Team members 3

reasoning-project's activity