17 12

林優奈

smoore2024

AI & ML interests

None yet

Recent Activity

liked a dataset about 8 hours ago

nvidia/Llama-Nemotron-Post-Training-Dataset

upvoted a paper 1 day ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

liked a dataset 1 day ago

Emmyc2/psp

View all activity

Organizations

None yet

liked a dataset about 8 hours ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8, 2025 • 3.91M • 4.79k • 667

upvoted a paper 1 day ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 4 days ago • 139

liked a dataset 1 day ago

Emmyc2/psp

Updated Mar 6 • 521k • 12

liked a model 2 days ago

zhaohq/PureRL-1.5B-v7-s2-l2-kl-w3-b0

Text Generation • 2B • Updated 2 days ago • 106 • 1

liked a dataset 5 days ago

jjr1007/dagger_generalization_10

Viewer • Updated 5 days ago • 378 • 95 • 1

upvoted a paper 9 days ago

HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution

Paper • 2605.09942 • Published 13 days ago • 15

liked a dataset 12 days ago

proj-persona/PersonaHub

Viewer • Updated Sep 26, 2025 • 375k • 6.62k • 756

upvoted a paper 16 days ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published 18 days ago • 99

upvoted a paper 18 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 21 days ago • 162

liked a dataset 22 days ago

Tralalabs/cc-more-cleaned-2026-04

Viewer • Updated 22 days ago • 574 • 55 • 1

upvoted a paper 29 days ago

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

Paper • 2604.21375 • Published about 1 month ago • 18

liked 2 models about 1 month ago

OrcunAICovers/legacy_core_pretrain_v1.5

Updated about 1 month ago • 1

inclusionAI/LLaDA2.0-Uni

Any-to-Any • 16B • Updated 11 days ago • 4.67k • 247

upvoted a paper about 1 month ago

ViVa: A Video-Generative Value Model for Robot Reinforcement Learning

Paper • 2604.08168 • Published Apr 9 • 18

liked a model about 1 month ago

bigcode/starcoder

Text Generation • 16B • Updated Oct 8, 2024 • 23.1k • 2.95k

upvoted a paper about 1 month ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

liked a dataset about 1 month ago

hijbullahx/goemotions

Viewer • Updated Apr 12 • 211k • 25 • 1

upvoted 2 papers about 1 month ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

liked a model about 1 month ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 169k • • 2.49k

林優奈

AI & ML interests

Recent Activity

Organizations

smoore2024's activity