2 5 16

Kun LI

inNexus

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

upvoted a paper 19 days ago

Reinforcement Learning on Pre-Training Data

new activity 3 months ago

FR3E-Bytedance/FR3E-Math-7B:Any plans to release the training code?

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published 10 days ago • 39

upvoted a paper 19 days ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published 20 days ago • 65

New activity in FR3E-Bytedance/FR3E-Math-7B 3 months ago

Any plans to release the training code?

👀 2

#2 opened 3 months ago by

inNexus

upvoted a paper 4 months ago

Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers

Paper • 2406.10991 • Published Jun 16, 2024 • 1

updated a collection 4 months ago

NLP

Collection

2 items • Updated Jun 12

liked a model over 1 year ago

stabilityai/stablelm-2-12b

Text Generation • 12B • Updated Jul 10, 2024 • 71 • 120

liked a dataset over 1 year ago

laion/OIG

Viewer • Updated Mar 31, 2023 • 52.6M • 5.98k • 303

updated a collection almost 2 years ago

NLP

Collection

2 items • Updated Jun 12

New activity in SUSTech/SUS-Chat-34B almost 2 years ago

200K Version

👍 1

#7 opened almost 2 years ago by

brucethemoose

liked 7 models almost 2 years ago

upvoted 2 papers about 2 years ago

Long-range Language Modeling with Self-retrieval

Paper • 2306.13421 • Published Jun 23, 2023 • 16

FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation

Paper • 2310.03214 • Published Oct 5, 2023 • 20

liked a model about 2 years ago

TinyLlama/TinyLlama-1.1B-step-50K-105b

Text Generation • 1B • Updated Sep 16, 2023 • 10.5k • • 133