If this really help, please upvote for researchers' hardwork
Jiang Jiwen
jjw0126
AI & ML interests
RL, LLM
Recent Activity
upvoted
a
collection
23 days ago
Reasoning, Thinking, RL and Test-Time Scaling
liked
a dataset
about 1 month ago
Salesforce/xlam-function-calling-60k
new activity
2 months ago
PLM-Team/PLM-1.8B-Instruct:inappropriate tokenizer
Organizations
models
3
datasets
0
None public yet