Kuo-Hsin Tu's picture

166 54

Kuo-Hsin Tu

dapumptu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Distillation Scaling Laws

upvoted a paper 9 days ago

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

upvoted a paper 11 days ago

ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning

View all activity

Organizations

None yet

dapumptu's activity

upvoted 2 papers 9 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 10 days ago • 42

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published 10 days ago • 49

upvoted 3 papers 11 days ago

ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning

Paper • 2502.04689 • Published 15 days ago • 7

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper • 2502.04404 • Published 16 days ago • 20

Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM

Paper • 2502.06635 • Published 12 days ago • 4

upvoted 7 papers 15 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 17 days ago • 13

Large Language Model Guided Self-Debugging Code Generation

Paper • 2502.02928 • Published 17 days ago • 11

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 17 days ago • 56

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 17 days ago • 188

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 17 days ago • 51

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published 16 days ago • 12

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published 16 days ago • 23

upvoted 5 papers 17 days ago

Learning to Generate Unit Tests for Automated Debugging

Paper • 2502.01619 • Published 19 days ago • 4

The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles

Paper • 2502.01081 • Published 19 days ago • 14

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Paper • 2502.00698 • Published 20 days ago • 23

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published 19 days ago • 28

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 18 days ago • 13

upvoted a paper 19 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 22 days ago • 105

upvoted a collection 22 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 12 items • Updated 2 days ago • 78

upvoted a paper about 1 month ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18 • 24