Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kun LI's picture
2 5 16

Kun LI

inNexus

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
upvoted a paper 18 days ago
Reinforcement Learning on Pre-Training Data
new activity 3 months ago
FR3E-Bytedance/FR3E-Math-7B:Any plans to release the training code?
View all activity

Organizations

None yet

Collections 1

NLP
  • Self-Evaluation Improves Selective Generation in Large Language Models

    Paper • 2312.09300 • Published Dec 14, 2023 • 16
  • Prefix Grouper: Efficient GRPO Training through Shared-Prefix Forward

    Paper • 2506.05433 • Published Jun 5 • 4
NLP
  • Self-Evaluation Improves Selective Generation in Large Language Models

    Paper • 2312.09300 • Published Dec 14, 2023 • 16
  • Prefix Grouper: Efficient GRPO Training through Shared-Prefix Forward

    Paper • 2506.05433 • Published Jun 5 • 4

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs