Jiaxi Li
plusn
·
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
about 1 month ago
Reinforcement Pre-Training
upvoted
a
paper
about 1 month ago
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
upvoted
a
paper
about 2 months ago
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth
Approach
Organizations
None yet