liu zh's picture

6

liu zh

morphism42

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

upvoted an article 4 months ago

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

upvoted an article 7 months ago

How NuminaMath Won the 1st AIMO Progress Prize

View all activity

Organizations

None yet

morphism42's activity

upvoted a paper 9 days ago

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Paper • 2502.02508 • Published 10 days ago • 19

upvoted an article 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 218

upvoted 2 articles 7 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 115

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 153

upvoted an article 8 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 233

upvoted an article 10 months ago

Article

Personal Copilot: Train Your Own Coding Assistant

Oct 27, 2023

• 44